The search tag has no wiki summary.
0
votes
0answers
13 views
Sharding / indexing strategy for multi-faceted search
I'm currently thinking about our database structure and how we modify it for scale. Specifically, we're thinking about using ElasticSearch to provide our search functionality.
One common pattern with ...
-1
votes
2answers
78 views
Web Search for a Hard Drive [on hold]
Here is the situation. Our organization has a fair amount of data in the form of documents, images, videos stored on a intranet server.
We need to be able to expose these documents via some sort of ...
-3
votes
0answers
60 views
What are the main differences between Lucene and Google search engines? [closed]
There are more and more companies and organisations which build their own searching service using open source search engine like Lucene. What are the main differences between Lucene and Google search ...
-1
votes
3answers
88 views
Find all lines segments intersections
I have a collection of lines segments, represented by an array.
Ex: [3,7,13,6,9] is 4 line segments: [(3,7)(7,13)] , [(7,13)(13,6)] , [(13,6)(6,9)] , ([6,9)(9,3)]
I want to find all the lines ...
0
votes
2answers
95 views
Using lucene and sql server togheter. Newbie needs directions [closed]
Basically the whole thing can be explained simply:
I need to index one or more SQL Server 2005 databases with lucene so I can search the various records.
I found a lot of examples and documentation ...
0
votes
3answers
204 views
A* search for Sudoku
I have a homework problem for an Artificial Intelligence course that I am having trouble answering.
Consider solving the Sudoku problem using A* search. The start state
has some number of cells ...
1
vote
2answers
141 views
Search algorithm
I would like to create a site where users can post articles with the following optional parts:
A title
Contents (text)
Categories
Keywords
Articles will be stored in mongodb and the site will be ...
2
votes
3answers
178 views
How to combine search words? AND or OR?
I have a basic search in my webpage. When I designed it, I chose to combine the search box inputs with OR. For example: A search for foo bar will be translated to foo OR bar, so every entry which ...
0
votes
2answers
92 views
How can I conceptually model a craigslist search?
I'm trying to understand how something like this works, but I'm inexperienced and I'm trying to understand how the process would work.
Say you have ten categories, a thousand zip codes, and ten ...
0
votes
0answers
36 views
Solving point in interval queries
There are n intervals given by starting (a[1], a[2], ..., a[n]) and ending points (b[1], b[2], ..., b[n]) and m queries of the form: given an integer x find the indices of the intervals which contain ...
2
votes
1answer
151 views
Use a search box that calls on a JSON file? [closed]
I use a JSON file to populate several drop down lists.
The format is:
{
"value" :"lightyear",
"name" :"Light Year(yl)"
},
{
"value" :"astronomicalUnit",
...
1
vote
2answers
212 views
Finding duplicate files? [duplicate]
I am going to be developing a program that detects duplicate files and I was wondering what the best/fastest method would be to do this? I am more interested in what the best hash algorithm would be ...
1
vote
0answers
42 views
How to map the english dictionary to UNSPSC codes?
Is there a db which maps the words from the english dictionary to the UNSPSC codes?(http://www.unspsc.org/)
My problem is the following:
I am building a search system. And the customer searches for ...
1
vote
2answers
385 views
Implementing search over large data set, PHP or Mysql stored procedure?
I'm building an Online Food Ordering System with PHP and MYSQL, One of the feature of the application is to allow users to search for the restaurants by typing the area name.
I would like to know ...
0
votes
3answers
105 views
Binary Search Programming implementation
Binary Search, as we all know requires the elements to be sorted. But we have to take care of unsorted elements too, in the worst case. If the input size is very large, is it a good idea to sort the ...
0
votes
0answers
123 views
How to find sites of similar interests (programming, DIY projects etc ) [duplicate]
I am a programmer and looking for good sites for reading/learning/glancing. Example:
I follow TechCrunch and similar sites
I follow blogs of some programmers/major companies that interest me
I use ...
0
votes
0answers
68 views
Considerations for beginning work on a unified search
I have become interested in creating a unified search for a corporate asset management database. My goal is to allow users to submit queries like:
stuff in building 3210
stuff in building 3210 owned ...
8
votes
1answer
208 views
heuristic for searching through non-perfectly sorted data
Given sorted data, the search solution is obvious. Given unsorted data, sensible options are sort then search or just linear search.
This question is about what to do if the data is somewhat sorted, ...
2
votes
4answers
397 views
Querystring Advanced Search where there are about 20 search fields
I am creating an advanced search page where there are about 20 search fields for a user to filter their search. My question deals with the query string, Is it standard web development practice to have ...
0
votes
0answers
91 views
Inverted indices with minimum frequency counts
I'm trying to find a tool (database system, library, etc.) which can help me with a variation of the inverted index/posting list intersection problem. The variation is that a record matches only if a ...
0
votes
2answers
283 views
What is a good algorithm for priority allocation of work duties?
I am currently doing a project (in PHP) that has the following requirements:
There is a list of people, sorted in a certain priority. Work should be allocated to them by this priority. e.g. If the ...
2
votes
1answer
95 views
Auto-completion or Suggest
How does Google or amazon implement the auto-suggestion at their search box. I am looking for the most used algorithm with technology stack.
PS: I have searched over the net and found this and this ...
6
votes
1answer
217 views
What is “the right way” to do search on a website?
I'm talking the kind of search that auto-suggests your query as you type, the way Google does, the way Wikipedia does, the way Stack Exchange suggests other questions as you type the title, etc. And ...
-1
votes
1answer
184 views
Address search from large text file
Basically I want to develop a Address lookup(part of my project) using C# (and I can use SQL if necessary). I have a very large text file which have all the UK address and postcodes. Addresses needs ...
2
votes
2answers
128 views
doing a full permutation search and replace on a string
I'm writing an app that does something like a custom number (license) plate generator tool where if I ask for the plate "robin" it will suggest I try:
r0bin
rob1n
r0b1n
Are there any published ...
8
votes
1answer
380 views
Good library for search text tokenization
Looking to tokenize some text in the same or similar way in which a search engine would do it.
The reason we are doing this is so that we can run some statistical analysis on the tokens. The language ...
-1
votes
1answer
75 views
Verify uniqueness of new content
I'm working on a review site, where there is a minor issue with almost duplicate reviews across items. Just a few words are changed. It would be very nice to be able to uncover these duplicates before ...
5
votes
2answers
252 views
Is it possible (and practical) to search a string for arbitrary-length repeating patterns?
I've recently developed a huge interest in cryptography, and I'm exploring some of the weaknesses of ECB-mode block ciphers. A common attack scenario involves encrypted cookies, whose fields can be ...
3
votes
3answers
368 views
Data structure: sort and search effectively
I need to have a data structure with say 4 keys . I can sort on any of these keys. What data structure can I opt for? Sorting time should be very little.
I thought of a tree, but it will be only help ...
3
votes
1answer
201 views
What technology/algorithm should be used to abstract meaning or keywords from a passage of text?
Hi and thanks for looking!
Background
I have a project wherein I need to abstract meaning from a passage of text to determine what the text is seeking and then match that text to a list of search ...
1
vote
2answers
106 views
Fuzzy search for a sub-string without tokens
Let's say I have the following lines:
Lorem ipsum dolor sit amet, (tag) consectetur adipiscing elit.
Phasellus congue nisi vel lorem dignissim tristique. (tag)
Etiam vulputate lacus nec velit ...
1
vote
1answer
135 views
DB technology for efficient search in tabular data?
We have a repository of tables. Around 200 tables, each table can be thousands of rows, all tables are originally in Excel sheets.
Each table has a different scheme. All data is text or numbers.
We ...
11
votes
8answers
1k views
Find a “hole” in a list of numbers
What is the fastest way to find the first (smallest) integer that doesn't exist in a given list of unsorted integers (and that is greater than the list's smallest value)?
My primitive approach is ...
1
vote
1answer
301 views
Is there a more efficient way to filter large arrays than preg_match()?
I have a log that our web application builds. Each month it contains around 16,000 entries of a string with about the average sentence worth of text.
To filter/search through these in our admin panel ...
1
vote
1answer
574 views
Tineye.com search algorithm?
I was wondering how does tineye carry a search. Does it store all the images and then extracts exif data? Which in turn must be stored in a database and queried against. So probably it is using some ...
7
votes
1answer
181 views
How important is index size when searching?
My company has recently began using Apache Solr to search its data. As we learn how to use it we have gone down the path of indexing multiple fields to get the results we need. Most of these are ...
-2
votes
1answer
109 views
How to effectivelly search for reusable code? [duplicate]
Possible Duplicate:
Is there an alternative to Google Code Search?
Google code search has been shut down. How to effectivelly search for common functions and algorithms now?
-1
votes
2answers
172 views
Custom search engine [closed]
I have made a web application and now I would like to add a search option to it.
Since it is a learning project, I can't afford to buy some Google license. That is why I consider creating my own ...
2
votes
2answers
193 views
Find feature in NetBeans (or even Eclipse, or others) excluding comments
Is it possible to use the find feauture in NetBeans or other Java IDE excluding commented code from the results? How??
Given a project I'd like to find all the occurrences of a word without getting ...
3
votes
4answers
382 views
Search algorithm with co-ordinate (x,y) hints?
I am about to start writing a UI view where many small text items are laid-out over the view and when the user hovers over a text item a dynamically-generated image is displayed (like a tooltip), ...
0
votes
2answers
119 views
Suggestions for search methodology requested
I am throwing this question out to the 'greater creativity' for some suggestions on search methodology that might solve this problem. I have come up with a somewhat contrived example to attempt to ...
19
votes
1answer
352 views
How should HTML data formats be applied in everyday situations?
Given Google's shift towards a stronger focus on page markup data, how do the data formats used in Schema.org work alongside those for Microformats? How do these (and other specs) compliment each ...
3
votes
2answers
212 views
Best algorithm to correlate similar articles
which is the best way to correlate and group similar articles?
I mean something like Google News, which groups under a single topic different articles from different sources.
I'm not interested in ...
2
votes
1answer
103 views
Doubt regarding search engine/plugin(One present on the website itself)
I am new to web development and trying to study various types of websites as case study. Right now my focus is on how search engines works for an eCommerce website.
I know basic functioning for a ...
3
votes
6answers
183 views
How to search the web for programming related solutions? [closed]
I have the impression that Google has become unusable when searching for programming related questions.
Example: I'm Googling for
XML-RPC Redstone Cookie
I'm expecting results where all three ...
34
votes
8answers
2k views
How to Properly Google for C
The problem with trying to use Google to find tutorials or answers for the C programming language is that C is not an expressive enough name to narrow down the searches. Even coupled with keywords ...
6
votes
1answer
445 views
Space complexity of Iterative Deepening DFS
We read on Wikipedia > Iterative deepening depth-first search that
The space complexity of IDDFS is O(bd), where b is the branching factor and d is the depth of shallowest goal.
Wikipedia also ...
6
votes
3answers
487 views
Looking for good books about the theory behind search engines [closed]
I am working on a project that requires that I understand different techniques used by search engines for the web.
I have a strong scientific and development background, so I am not afraid of highly ...
6
votes
1answer
131 views
How to determine the source of a request in a distributed service system?
Map/Reduce is a great concept for sorting large quantities of data at once. What to do if you have small parts of data and you need to reduce it all the time?
Simple example - choosing a service for ...
4
votes
2answers
150 views
Why the decline in search traffic for popular programming languages?
Is there any solid evidence behind the reasons for decline in search volume for popular programming languages?
Could this possibly be due to improvements in finding necessary information (no need to ...