| Random Link ¯\_(ツ)_/¯ | ||
| Mar 1, 2014 | » | [ToDo] Mining of Massive Datasets
2 min; updated Sep 5, 2022
Data Mining What is Data Mining? Statistical Limits of Data Mining Things Useful to Know Outline of the book MapReduce and the New Software Stack Distributed File Systems MapReduce Algorithms Using MapReduce Extensions to MapReduce The Communication Cost Model Complexity Theory for MapReduce Finding Similar Items Applications of Near-Neighbor Search Shingling of Documents Similarity-Preserving Summaries of Sets Locality-Sensitive Hashing for Documents Distance Measures The Theory of Locality-Sensitive Functions LSH Families for Other Distance Measures Applications of Locality-Sensitive Hashing Methods for High Degrees of Similarity Mining Data Streams... |