Found in 4 comments on Hacker News
thecal · 2021-06-03 · Original thread
Sorry, I agree with the GP. This was a popular book for learning ML with Weka (which is still around):

There is also the Knowledge Discovery in Databases (KDD) term which is still around via:

agbell · 2009-11-21 · Original thread
I recommend starting with weka and this great book:
gtani · 2009-10-26 · Original thread
some other helpful books:

- Data Mining, by Witten and Franke; describes basics with rigor, including how to use Weka, which they wrote

a couple java-based books from Manning:

- Collective Intelligence in Action (by Satnam Alag) and

- Algorithms of the Intelligen Web (Marmanis, Babenko)


gtani · 2009-06-30 · Original thread
spot on. OP: Are you asking how basic tf-idf works, or is there something you can't get lucene / SOLR / sphinx / tsearch to do easily?

nevertheless, here are some good background materials (search amazon on "data mining"

Also the Collective intelligence by Satnam alag is quite good (a lot of java code to wade through tho

Fresh book recommendations delivered straight to your inbox every Thursday.