Take the 2-minute tour ×
Stack Overflow is a question and answer site for professional and enthusiast programmers. It's 100% free, no registration required.

we have been tasked to create a simple search engine, which could search through a small set of .html documents (stored locally probably) for given text string (not exact occurence, just similarity), and sort the results based on their similarity with given query and PageRank (PageRank algorithm is to be done as well, but that's not the point of this question). To make this task easier for us, we were allowed to use any function or library which can do the text search, but I don't know any such library or function. So I'd like to ask, is there any function or library which can do such thing? PHP is preferred, but we can deal with Java or C++ as well.

Thanks a lot!

share|improve this question

put on hold as off-topic by EdChum, Mark Baker, Jay Blanchard, Borgleader, Dark Falcon 19 hours ago

This question appears to be off-topic. The users who voted to close gave this specific reason:

  • "Questions asking us to recommend or find a book, tool, software library, tutorial or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it." – EdChum, Mark Baker, Jay Blanchard, Borgleader, Dark Falcon
If this question can be reworded to fit the rules in the help center, please edit the question.

    
Please read stackoverflow.com/help/on-topic –  Epodax 19 hours ago
    
for PHP exec('grep -rl "yourstring" ') –  Daan 19 hours ago
    
lucene.apache.org/core –  Dark Falcon 19 hours ago

Browse other questions tagged or ask your own question.