0
votes
1answer
26 views

Wrong number of total entities on all pages except the last when using gae search and cursors for pagination

I use gae's search API and I'm getting some strange results. When it is returning the number of documents, it is about ten times as many. I'm using an if test in mapreduce to check that entity is ...
0
votes
0answers
34 views

How to order search results from the gae index by time

I'm using gae's search API which is working but it can only order by date and not datetime. So how do I order results by time? Must I just store hours and minutes in the search index like numbers i.e. ...
1
vote
0answers
32 views

django orm search with postgres

For small searches in PostgreSQL, http://django-orm.readthedocs.org/en/latest/orm-pg-fulltext.html can be used easily, as shown in the docs. Following are the steps, I used to implement it - ...
1
vote
1answer
43 views

sql injection in sqlite full text search

consider sqlite3 fts4 table c.execute("CREATE VIRTUAL TABLE docs USING fts4(content)") Is the following safe from sql injection where txt contains a string? I am not sure if parameterised ...
0
votes
1answer
46 views

Python : parse string to sub string for searching

My web site run on gae, I want to implement a input-tag-box like tags input box in SO, but search on gae required user enter whole word for match. Example hello world required user enter world or ...
2
votes
1answer
86 views

regular expression dictionary [Google type search and match with regular expressions]

EDIT: One of the main problems with the code below is due to storing regular expression objects in dictionaries, and how to access them to see if they can match another string. But I will still leave ...
4
votes
4answers
108 views

How many times string appears in another string

I have a large static binary (10GB) that doesn't change. I want to be able to take as input small strings (15 bytes or lower each) and then to determine which string is the least frequent. I ...
0
votes
1answer
39 views

Python-Whoosh BufferedWriter does not commit to the disk

Here is example in which I try to index large collection with whoosh schema = Schema(name=TEXT(stored=True), m=ID(stored=True), content=KEYWORD(stored=True)) ix = create_in("indexdir", schema) from ...
1
vote
1answer
95 views

Python: regex a string with a wildcard in it

What I would like to do is to allow the user to input something such as: "Torsion ? Group", wildcard and all (the wildcard being a whole word) and a true or false returned depending on whether found ...
0
votes
0answers
443 views

how to get all “index data” using whoosh

i use whoosh for full text search , and i want to know : how to get all 'index data' that be added. this is my main.py: import cgi,os from google.appengine.ext import webapp from ...
1
vote
1answer
74 views

Pattern match in file

I am trying to find multiple matches in a file. I use the following code: f = open('/home/evi.nastou/Documenten/filename') text = f.read() #print text urls = re.findall(r"_8o _8r lfloat\" ...
0
votes
1answer
26 views

ImportError while importing djorm_pgfulltext

Installed the djorm-ext-pgfulltext using the command pip install djorm-ext-pgfulltext And tried to import this for full text seach in django from djorm_pgfulltext.models import SearchManager ...
2
votes
2answers
57 views

python advanced search library

I have around 80,000 text files and I want to be able to do an advanced search on them. Let's say I have two lists of keywords and I want to return all the files that include at least one of the ...
0
votes
1answer
657 views

mongodb full text search advice multiple words

I am trying to have some form of fulltext search for one of my mongodb collections (a la flowdock). I create a _keywords entry for each document and populate it with lowercased words from the other ...
2
votes
3answers
504 views

Full Text Search on Heroku, database and or indexer selection?

I am looking to implement (free as in beer) full text searching on a small application on Heroku (minimal number of users, limited dataset). However, I am struggling to find a best pattern for doing ...

1 2 3 4 5 6
15 30 50 per page