MapReduce is an algorithm for processing huge datasets on certain kinds of distributable problems using a large number of nodes
1
vote
0answers
72 views
CSMR for large-scale text-prcessing
I'm working on a project for large-scale text-processing, which is a first implementation of the basic idea of CSMR. CSMR is an algorithm that measures the similarity between documents by calculating ...