210 likes | 245 Views
Learn how to install Apache Mahout with prerequisites such as Java, Maven, and Hadoop. Follow step-by-step instructions in Ubuntu Terminal for seamless installation. Try Kmeans examples for clustering and data analysis.
E N D
Pre requisites • Java (jdkversion) • Maven(version 3.0 or higher) • Mahout (Download or svn repository) • Hadoop(1.0.3)
Java Installation • In Ubantu Terminal • sudo apt-get update • sudo apt-get install default-jre • sudo apt-get install default-jdk • Java -version
Maven Installation In Ubantu Terminal sudo apt-get install maven 2. Verify the Installation mvn -v
Svn Installation In Ubantu Terminal sudo apt-get install subversion 2. Verify the Installation svn --version
Mahout Installation Hadoop installation is neccessory In Ubantu Terminal sudomkdir mahout cd mahout 2. Download the file using svn repository svn co http://svn.apache.org/repos/asf/mahout/trunk
Mahout Installation Build mahout using mvn mvn –DskipTests install
Kmeans Example Accepts clustering type: kmeans, fuzzykmeans, lda, or streamingkmeans
Kmeans Example Input Gets the Reuters dataset
Kmeans Example Input
Kmeans Example Runs org.apache.lucene.benchmark.utils.ExtractReuters to generate reuters-out from reuters-sgm (the downloaded archive)
Kmeans Example Runs org.apache.lucene.benchmark.utils.ExtractReuters to generate reuters-out from reuters-sgm (the downloaded archive)