Tagged Questions
0
votes
1answer
28 views
How can I get content of all scripts in HTML
I am making a Java program, which involves extraction of tags from a webpage.For parsing, i am using Jsoup, which is working fine.But there is some problem with number of tags in the downloaded ...
0
votes
0answers
26 views
Parsing Amazon URL to find book information
I am looking to parse amazon urls to find information about the author and the book. What is the best way to achieve this? Should I be using an html parser or is something better?
Thanks
0
votes
1answer
32 views
Highlight nodes covered by XPath
I want to highlight nodes covered by a given XPath in an HTML page source.
I looked to in HtmlUnit, could not find any thing in API.
At present, I am thinking of doing it following way:
Get XPath ...
0
votes
0answers
50 views
issue in parsing html in java
i want to parse an html and get the next tag value after a matched pattern.
what i am doing
Searching the pattern and find it.
now i want the next value to be printed .
My current code is:
public ...
1
vote
1answer
42 views
Parsing HTML in Android
I have html code that looks like
<td align="right"><select name="Hour">
<option >00</option>
<option >01</option>
<option >02</option>
<option ...
0
votes
0answers
31 views
Parsing and reading HTML content in a String as DOM
What is the best way to parse HTML content in a string? I need to read the content as DOM, what is the best toolkit to do it?
0
votes
1answer
17 views
Extract data between two specific text using Jericho
I am using Jericho to parse Html.
I have a html page in which I need to extract data between two specific text .
<table width="100%" align="left">
<tr><td>
...
0
votes
1answer
64 views
Html parsing text from TD Tag
I have my Html data
<table border='0' cellpadding='3' bgcolor="#CCCCCC" class="hostinfo_title2" width='100%' align="center">
<tr align='center' bgcolor="#ffffff">
...
1
vote
1answer
76 views
Parse HTML Web Page
I parse data from this web site using JSoup:
http://www.skore.com/en/soccer/england/premier-league/results/all/
I get the name of team and result, and I also need to the get name of scorer (it is ...
1
vote
1answer
69 views
Extracting relative links from a web page in proper format using Jsoup
I have parsed the outlinks of a web page which I am going to parse again using Jsoup. But the problem is that, the links are of the form: ../../../pincode/india/andaman-and-nicobar- islands/. In this ...
2
votes
1answer
55 views
JButton ActionListener Integration Issue
I am relatively new to Java and recently I have been working on a GUI based html parser.
The interface is simple, consisting of:
JTextField for entering a search term
JButton b1 to initiate the ...
1
vote
1answer
66 views
Remove desired tag from html using JTidy
I am using JTidy and xpath in parsing HTML, but for the time being parsing text causes me a little trouble because it may include b tag inside, so I don't want to loop over it's child nodes but ...
-1
votes
0answers
47 views
Custom HTML parsing using JAVA [duplicate]
I am in need of a HTML parser that only extract attributes of elements that have been clicked ( like firebug). I want to create one if there isn't anything already available. Any ideas or suggestions ...
-2
votes
1answer
35 views
How to get a specific string with Jsoup
How can I get the productId value (6030283523) using Jsoup? There is another way to do that?
I'm trying
Elements dis = doc.select("dl[class=cs-cfg] > dt").get(4).getAllElements();
for (Element el ...
3
votes
1answer
153 views
JSoup parsing invalid HTML with unclosed tags
Using JSoup inclusive the last release 1.7.2 there is a bug parsing invalid HTML with unclosed tags.
Example:
String tmp = "<a href='www.google.com'>Link<p>Error link</a>";
...