The primary purpose of XPath is to address parts of an XML document. It also provides basic facilities for manipulation of strings, numbers and booleans. XPath uses a compact, non-XML syntax. XPath operates on the abstract, logical structure of an XML document, rather than its surface syntax.
6
votes
2answers
166 views
SQL query to compare two XML columns
I need to compare two XML columns. I currently have about 1000 rows of data that this is running against and it takes about 20min. Is there anyway I can improve this query to run faster.
...
0
votes
1answer
61 views
1
vote
0answers
102 views
Scrape google apps page and store application details in database
Below is a python script which scrapes specific google apps url for example https://play.google.com/store/apps/details?id=com.mojang.minecraftpe and save the ...
2
votes
1answer
159 views
Distinct List of attribute values from XML, using a pre-defined set of attribute names
I have an XML stream that looks like the following (attribute and node quantity , and node depth reduced for demonstration purposes.)
...
1
vote
1answer
88 views
XML editing and merging
I have only started working with XML files and PowerShell. I feel that while there are many ways to accomplish the same thing that I might be doing the following inefficiently.
This is all based on ...
8
votes
3answers
158 views
XML splitting query very slow
I have a query to split a delimited string into to multiple columns. The query is extremely slow and takes more than 5 minutes to handle 80444 rows. How can I improve execution time?
...
7
votes
1answer
2k views
Converting partial XML (Node List) to a String
I have a need to extract a part of XML tree (everything under root) and convert it to a string. (The result string will be later pasted inside ...
2
votes
1answer
337 views
Use DOM and XPath to make some changes in HTML document
I intend to make some changes in an HTML document, like remove, replace, append some nodes.
I have several arrays with same structure like the following example
...
3
votes
1answer
214 views
Parsing large CSV in Perl
I am getting Out of Memory errors when I try to parse through a large CSV file (2.5Gb). My computer has 32Gb of Memory but Perl uses all of it up. The CSV has 2 ...
1
vote
1answer
1k views
Get InnerHTML, OuterHTML, and plain text of an element by ID or class
I do a fair amount of scraping but I am by no way a good PHP programmer. I always struggle to get the innerhtml of elements using PHP, domdocument and XPath.
I have cobbled together a couple of ...
3
votes
3answers
104 views
Merging website info into an array
I'm scraping a title and a link from a website and placing them in a single array.
The $i variable seems a bit wrong to me, or is it perfectly fine?
...
1
vote
1answer
335 views
Performance problem when getting text from a context node with VTD-XML
I have a structure like this:
...
3
votes
2answers
540 views
Using Scrapy/Xpath to scrape ESPN for football (soccer) commentaries
The class takes an input of a game-id from the ESPN soccer website. The code then has to grab the commentary, process that and also grab the player names/ids and create a small dictionary of those.
I ...
1
vote
1answer
112 views
Handling XPATH input from a user
I want to design API which could handle XPATH input from a user. I currently model the XPATH input in the following way:
...
5
votes
3answers
765 views
Returning a large number of values from a thread
I have an application in which I am querying an XML web service continuously every 2 seconds in a thread. The returned XML is very big and I am retrieving lots of stuff from them using XPath and ...
2
votes
1answer
5k views
Parsing and data presentation in Django
For some time now, I've been using a pattern which I think could be done more elegantly.
I have a lot of XML files that need to be represented in a Django views. My current approach is like this:
I ...