2013年8月7日星期三

mahout mining schema

I want to use mahout to do data mining, there are several questions to ask
(1) with a mahout is mainly how to use it ? Form the command line or ?
(2) execute mahout , I want to monitor the implementation process , is how to do it ? You have an existing monitor ? Or what technology
(3) mining process is ready for data , applied to the algorithm would be finished ?
------ Solution ---------------------------------------- ----

1 mahout is a subproject of Apache , mainly used for data analysis mining
2 others I do not know, I mainly used in MAPREDUCE , you can use GANGLIA monitor HADOOP of MAPREDUCE, between itself and the mahout nothing
3 you first designed data model , ETL, application workflows, such as KMEAN unsupervised clustering data also need to train
------ Solution -------- ------------------------------------
see "mahout in action" You can use the command number, the program can also be integrated with java .
------ For reference only -------------------------------------- -
mahout in action in the java program is not distributed right

没有评论:

发表评论