A book: Web Crawling and Data Mining with Apache Nutch
Recently I am reading a book <Web Crawling and Data Mining with Apache Nutch>, http://www.packtpub.com/web-crawling-and-data-mining-with-apache-nutch/book, it is really a great book. And I get help in my project.
In my project I need to crawl the web content and do the data analyst. From the book I can know how to use and integrate Nutch and Solr frameworks to implement it.
If you have similiar case, recommand to read this book.
posted on 2014-02-03 13:14 paulwong 閱讀(498) 評論(0) 編輯 收藏 所屬分類: HADOOP