The tag has no wiki summary.

learn more… | top users | synonyms

0
votes
0answers
13 views

Sharding / indexing strategy for multi-faceted search

I'm currently thinking about our database structure and how we modify it for scale. Specifically, we're thinking about using ElasticSearch to provide our search functionality. One common pattern with ...
-1
votes
2answers
78 views

Web Search for a Hard Drive [on hold]

Here is the situation. Our organization has a fair amount of data in the form of documents, images, videos stored on a intranet server. We need to be able to expose these documents via some sort of ...
-3
votes
0answers
60 views

What are the main differences between Lucene and Google search engines? [closed]

There are more and more companies and organisations which build their own searching service using open source search engine like Lucene. What are the main differences between Lucene and Google search ...
-1
votes
3answers
88 views

Find all lines segments intersections

I have a collection of lines segments, represented by an array. Ex: [3,7,13,6,9] is 4 line segments: [(3,7)(7,13)] , [(7,13)(13,6)] , [(13,6)(6,9)] , ([6,9)(9,3)] I want to find all the lines ...
0
votes
2answers
95 views

Using lucene and sql server togheter. Newbie needs directions [closed]

Basically the whole thing can be explained simply: I need to index one or more SQL Server 2005 databases with lucene so I can search the various records. I found a lot of examples and documentation ...
0
votes
3answers
204 views

A* search for Sudoku

I have a homework problem for an Artificial Intelligence course that I am having trouble answering. Consider solving the Sudoku problem using A* search. The start state has some number of cells ...
1
vote
2answers
141 views

Search algorithm

I would like to create a site where users can post articles with the following optional parts: A title Contents (text) Categories Keywords Articles will be stored in mongodb and the site will be ...
2
votes
3answers
178 views

How to combine search words? AND or OR?

I have a basic search in my webpage. When I designed it, I chose to combine the search box inputs with OR. For example: A search for foo bar will be translated to foo OR bar, so every entry which ...
0
votes
2answers
92 views

How can I conceptually model a craigslist search?

I'm trying to understand how something like this works, but I'm inexperienced and I'm trying to understand how the process would work. Say you have ten categories, a thousand zip codes, and ten ...
0
votes
0answers
36 views

Solving point in interval queries

There are n intervals given by starting (a[1], a[2], ..., a[n]) and ending points (b[1], b[2], ..., b[n]) and m queries of the form: given an integer x find the indices of the intervals which contain ...
2
votes
1answer
151 views

Use a search box that calls on a JSON file? [closed]

I use a JSON file to populate several drop down lists. The format is: { "value" :"lightyear", "name" :"Light Year(yl)" }, { "value" :"astronomicalUnit", ...
1
vote
2answers
212 views

Finding duplicate files? [duplicate]

I am going to be developing a program that detects duplicate files and I was wondering what the best/fastest method would be to do this? I am more interested in what the best hash algorithm would be ...
1
vote
0answers
42 views

How to map the english dictionary to UNSPSC codes?

Is there a db which maps the words from the english dictionary to the UNSPSC codes?(http://www.unspsc.org/) My problem is the following: I am building a search system. And the customer searches for ...
1
vote
2answers
385 views

Implementing search over large data set, PHP or Mysql stored procedure?

I'm building an Online Food Ordering System with PHP and MYSQL, One of the feature of the application is to allow users to search for the restaurants by typing the area name. I would like to know ...
0
votes
3answers
105 views

Binary Search Programming implementation

Binary Search, as we all know requires the elements to be sorted. But we have to take care of unsorted elements too, in the worst case. If the input size is very large, is it a good idea to sort the ...
0
votes
0answers
123 views

How to find sites of similar interests (programming, DIY projects etc ) [duplicate]

I am a programmer and looking for good sites for reading/learning/glancing. Example: I follow TechCrunch and similar sites I follow blogs of some programmers/major companies that interest me I use ...
0
votes
0answers
68 views

Considerations for beginning work on a unified search

I have become interested in creating a unified search for a corporate asset management database. My goal is to allow users to submit queries like: stuff in building 3210 stuff in building 3210 owned ...
8
votes
1answer
208 views

heuristic for searching through non-perfectly sorted data

Given sorted data, the search solution is obvious. Given unsorted data, sensible options are sort then search or just linear search. This question is about what to do if the data is somewhat sorted, ...
2
votes
4answers
397 views

Querystring Advanced Search where there are about 20 search fields

I am creating an advanced search page where there are about 20 search fields for a user to filter their search. My question deals with the query string, Is it standard web development practice to have ...
0
votes
0answers
91 views

Inverted indices with minimum frequency counts

I'm trying to find a tool (database system, library, etc.) which can help me with a variation of the inverted index/posting list intersection problem. The variation is that a record matches only if a ...
0
votes
2answers
283 views

What is a good algorithm for priority allocation of work duties?

I am currently doing a project (in PHP) that has the following requirements: There is a list of people, sorted in a certain priority. Work should be allocated to them by this priority. e.g. If the ...
2
votes
1answer
95 views

Auto-completion or Suggest

How does Google or amazon implement the auto-suggestion at their search box. I am looking for the most used algorithm with technology stack. PS: I have searched over the net and found this and this ...
6
votes
1answer
217 views

What is “the right way” to do search on a website?

I'm talking the kind of search that auto-suggests your query as you type, the way Google does, the way Wikipedia does, the way Stack Exchange suggests other questions as you type the title, etc. And ...
-1
votes
1answer
184 views

Address search from large text file

Basically I want to develop a Address lookup(part of my project) using C# (and I can use SQL if necessary). I have a very large text file which have all the UK address and postcodes. Addresses needs ...
2
votes
2answers
128 views

doing a full permutation search and replace on a string

I'm writing an app that does something like a custom number (license) plate generator tool where if I ask for the plate "robin" it will suggest I try: r0bin rob1n r0b1n Are there any published ...
8
votes
1answer
380 views

Good library for search text tokenization

Looking to tokenize some text in the same or similar way in which a search engine would do it. The reason we are doing this is so that we can run some statistical analysis on the tokens. The language ...
-1
votes
1answer
75 views

Verify uniqueness of new content

I'm working on a review site, where there is a minor issue with almost duplicate reviews across items. Just a few words are changed. It would be very nice to be able to uncover these duplicates before ...
5
votes
2answers
252 views

Is it possible (and practical) to search a string for arbitrary-length repeating patterns?

I've recently developed a huge interest in cryptography, and I'm exploring some of the weaknesses of ECB-mode block ciphers. A common attack scenario involves encrypted cookies, whose fields can be ...
3
votes
3answers
368 views

Data structure: sort and search effectively

I need to have a data structure with say 4 keys . I can sort on any of these keys. What data structure can I opt for? Sorting time should be very little. I thought of a tree, but it will be only help ...
3
votes
1answer
201 views

What technology/algorithm should be used to abstract meaning or keywords from a passage of text?

Hi and thanks for looking! Background I have a project wherein I need to abstract meaning from a passage of text to determine what the text is seeking and then match that text to a list of search ...
1
vote
2answers
106 views

Fuzzy search for a sub-string without tokens

Let's say I have the following lines: Lorem ipsum dolor sit amet, (tag) consectetur adipiscing elit. Phasellus congue nisi vel lorem dignissim tristique. (tag) Etiam vulputate lacus nec velit ...
1
vote
1answer
135 views

DB technology for efficient search in tabular data?

We have a repository of tables. Around 200 tables, each table can be thousands of rows, all tables are originally in Excel sheets. Each table has a different scheme. All data is text or numbers. We ...
11
votes
8answers
1k views

Find a “hole” in a list of numbers

What is the fastest way to find the first (smallest) integer that doesn't exist in a given list of unsorted integers (and that is greater than the list's smallest value)? My primitive approach is ...
1
vote
1answer
301 views

Is there a more efficient way to filter large arrays than preg_match()?

I have a log that our web application builds. Each month it contains around 16,000 entries of a string with about the average sentence worth of text. To filter/search through these in our admin panel ...
1
vote
1answer
574 views

Tineye.com search algorithm?

I was wondering how does tineye carry a search. Does it store all the images and then extracts exif data? Which in turn must be stored in a database and queried against. So probably it is using some ...
7
votes
1answer
181 views

How important is index size when searching?

My company has recently began using Apache Solr to search its data. As we learn how to use it we have gone down the path of indexing multiple fields to get the results we need. Most of these are ...
-2
votes
1answer
109 views

How to effectivelly search for reusable code? [duplicate]

Possible Duplicate: Is there an alternative to Google Code Search? Google code search has been shut down. How to effectivelly search for common functions and algorithms now?
-1
votes
2answers
172 views

Custom search engine [closed]

I have made a web application and now I would like to add a search option to it. Since it is a learning project, I can't afford to buy some Google license. That is why I consider creating my own ...
2
votes
2answers
193 views

Find feature in NetBeans (or even Eclipse, or others) excluding comments

Is it possible to use the find feauture in NetBeans or other Java IDE excluding commented code from the results? How?? Given a project I'd like to find all the occurrences of a word without getting ...
3
votes
4answers
382 views

Search algorithm with co-ordinate (x,y) hints?

I am about to start writing a UI view where many small text items are laid-out over the view and when the user hovers over a text item a dynamically-generated image is displayed (like a tooltip), ...
0
votes
2answers
119 views

Suggestions for search methodology requested

I am throwing this question out to the 'greater creativity' for some suggestions on search methodology that might solve this problem. I have come up with a somewhat contrived example to attempt to ...
19
votes
1answer
352 views

How should HTML data formats be applied in everyday situations?

Given Google's shift towards a stronger focus on page markup data, how do the data formats used in Schema.org work alongside those for Microformats? How do these (and other specs) compliment each ...
3
votes
2answers
212 views

Best algorithm to correlate similar articles

which is the best way to correlate and group similar articles? I mean something like Google News, which groups under a single topic different articles from different sources. I'm not interested in ...
2
votes
1answer
103 views

Doubt regarding search engine/plugin(One present on the website itself)

I am new to web development and trying to study various types of websites as case study. Right now my focus is on how search engines works for an eCommerce website. I know basic functioning for a ...
3
votes
6answers
183 views

How to search the web for programming related solutions? [closed]

I have the impression that Google has become unusable when searching for programming related questions. Example: I'm Googling for XML-RPC Redstone Cookie I'm expecting results where all three ...
34
votes
8answers
2k views

How to Properly Google for C

The problem with trying to use Google to find tutorials or answers for the C programming language is that C is not an expressive enough name to narrow down the searches. Even coupled with keywords ...
6
votes
1answer
445 views

Space complexity of Iterative Deepening DFS

We read on Wikipedia > Iterative deepening depth-first search that The space complexity of IDDFS is O(bd), where b is the branching factor and d is the depth of shallowest goal. Wikipedia also ...
6
votes
3answers
487 views

Looking for good books about the theory behind search engines [closed]

I am working on a project that requires that I understand different techniques used by search engines for the web. I have a strong scientific and development background, so I am not afraid of highly ...
6
votes
1answer
131 views

How to determine the source of a request in a distributed service system?

Map/Reduce is a great concept for sorting large quantities of data at once. What to do if you have small parts of data and you need to reduce it all the time? Simple example - choosing a service for ...
4
votes
2answers
150 views

Why the decline in search traffic for popular programming languages?

Is there any solid evidence behind the reasons for decline in search volume for popular programming languages? Could this possibly be due to improvements in finding necessary information (no need to ...