Loading
2014. 11. 14. 00:46 - Phil lee

Hadoop TopNWordCount Informations

Hadoop TopNWordCount Informations

Various Application Hadoop Version 2Code

New Version Tutorial

 

TreeMap Information

 

 

How to connect jobs

 

The number of map tasks

  • 64MB / 3.3GB

 

http://gb-solutions.org/web/guest/wiki/-/wiki/Main/MapReduce+Job%EB%93%A4%20%EC%97%B0%EA%B2%B0%20%ED%95%98%EA%B8%B0

 

Chained Reducers

 

How to improve performance

 

Various Drivers

 

희석이 블로그 - http://blog.naver.com/hisukdory

 

Actually, in-Mapper function's performance is better than a standard combiner.

Install Single Node

Make Jar file

Multiple inputs

SortedByValues

Creating a jar file

  • rm -rf TopWordCount_classes
  • mkdir TopWordCount_classes
    javac -classpath /usr/lib/hadoop/hadoop-core-1.2.1.jar -d TopWordCount_classes TopWordCount.java
  • jar -cvf TopWordCount.jar -C ./TopWordCount_classes/ .
  • class=./$1-class
  • rm -rf $class
  • mkdir $class
  • javac -cp "/usr/lib/hadoop/*:/usr/lib/hadoop/lib/*" -d ./$1-class ./$1.java
  • jar -cvf $1.jar -C ./$1-class/ .

 

 

Running on Hadoop

 

  • hadoop jar TopWordCount.jar TopWordCount input.txt outputIn
  • hadoop fs -get outputIn/part-00000 TopWordCount.txt