The text-processing tag has no wiki summary.
0
votes
2answers
157 views
How does Facebook strip html/apostrophes for XSS but also display it?
I'm not quite sure if this is a question for programmers.se rather than stackoverflow, but here goes. So Facebook [or any other large company] when given something like an apostrophe or html, can ...
2
votes
3answers
149 views
Domain-specific language for text search/processing?
I work for an organization that does a lot of work with government data. We have a couple of different projects where we've abstracted out common text search/manipulation operations into reusable ...
3
votes
4answers
825 views
How can I extract words from a sentence and determine what part of speech each is?
I want to write something that takes a sentence and identifies each word it contains and defines what part of speech each word is.
For example
Hello World, I am a sentence
would return this
...
3
votes
1answer
133 views
How to process an endless XML data stream
There is an endless data stream of XML messages (and "heartbeats"), that I receive via a telnet connection and through a site-to-site VPN IPsec tunnel.
I'm still pondering. What is the best/most ...
2
votes
6answers
1k views
Which programming language for text editing?
I need a programming language for text editing and processing (replace, formatting, regular expressions, string comparison, word processing, text analysis, etc.). Which programming language is more ...
6
votes
1answer
152 views
Finding occurrences of a useful words and phrases in strings
I am building an app that analyzes posts by people by pulling their Tweets and Facebook posts. I need to process all the posts and find useful phrases. What I mean by useful is that, any word or ...
-2
votes
1answer
57 views
How can I “bulk paste” a clipboard string of multi-line text into a readable ordered list? [closed]
How can I "bulk paste" a clipboard string of multi-line text into a readable ordered list?
I'm trying to demonstrate how to turn any string of multi-line text into an ordered list.
The script ...
8
votes
4answers
238 views
How should I implement a command processing application?
I want to make a simple, proof-of-concept application (REPL) that takes a number and then processes commands on that number.
Example:
I start with 1. Then I write "add 2", it gives me 3. Then I ...
6
votes
1answer
167 views
Tools for modelling data and workflows using structured text files
Consider a case when I want to try some idea of an application. But I want to avoid investing a lot of effort in coding UI/work flows/database schema etc before I see that it's going to be useful to ...
-1
votes
2answers
146 views
Can non-IT people learn and take advantage of regular expressions? [closed]
Often times, not-IT people has to deal with massive text data, clean it, filter it, modify it. Often times normal office tools like Excel lack the tools to make complex search and replace operations ...
2
votes
1answer
454 views
Best Practice - XML To Excel
I've to read a big XML file with a lot of information. Afterwards I extract the needed information (~20 Points(columns) / ~80 relevant Data (rows, some of them with subdatasets) and write them out in ...
0
votes
1answer
121 views
How to make a text search template? [closed]
I am not really sure what to call this, but I am looking for a way to have a "template" for my code to go by when searching for text.
I am working on a project where a summary for a piece of text is ...
3
votes
3answers
371 views
Separating words in a string
How do I separate words in a string?
In the following I have a random sample of words in a string extracted from text file with over a million words.
Here's the string:
"intervene Pockets ...
-1
votes
1answer
158 views
Algorithm to garble text based upon a weight [closed]
Say I have a weighted range of 0 - 10. 0 being no garbled, 10 being 100% garbled.
I'm looking for an algorithm that will garble plain text based upon this weight. The garbling doesn't need to be ...
4
votes
1answer
305 views
Text comparison algorithm using java-diff-utils
One of the features in our project is to implement a comparison algorithm between two versions of text and provide a % change between the two versions. While I was researching, I came across google ...