Tagged Questions
0
votes
1answer
20 views
Python - Best way to parse specific, standardized information in PDF documents?
I am trying to parse these PDF "Arms Sale Notification" letters, found here:
http://www.dsca.mil/pressreleases/36-b/36b_index.htm
Here is a specific PDF document example, of a proposed arms sale to ...
0
votes
2answers
15 views
Reportlab 2 or more pages per file
How can I generate a PDF with two or more pages with reportlab? I've been unable to find anything in the documentation.
4
votes
1answer
46 views
Failing to export properly to svg and pdf with matplotlib
I am trying to save a .svg image from an interactive path editor based on this example: path_editor.py. Saving a PNG image works just fine, but when I save to SVG or PDF, the paths are not exported.
...
1
vote
0answers
17 views
ArcGIS Python Map Book PDF not working blank PDF
The purpose of the code is to make a PDF map book that displays all of the large lakes in North America. I'm trying to run this code to make a map book but it gives me a blank PDF. How can I fix this? ...
1
vote
1answer
44 views
Embed .SVG files into PDF using reportlab
I have written a script in python that produces matplotlib graphs and puts them into a pdf report using reportlab.
I am having difficulty embedding SVG image files into my PDF file. I've had no ...
0
votes
1answer
34 views
How to merge PDFs with only PHP and keep internal and external links
How to merge PDFs with only PHP and keep internal and external links?
Simple question but I can't find a solution. It is no access to shell so pdftk and so on is unacceptable.
I'm looking for pure ...
0
votes
0answers
16 views
Detecting corrupted PDFs in file uploads on heroku
At my company, we accept user-uploaded PDFs. A surprisingly high percent of these uploads are corrupted. Is there a way we could detect this reliably?
The app runs on python / heroku. A call to the ...
0
votes
1answer
45 views
Firefox produces unsearchable pdfs
Currently I'm writing software for web automation using selenium and autoit.
I've found a strange issue, that for some pages when printing to pdf with firefox I get unsearchable pdfs. I've tried ff ...
0
votes
0answers
35 views
PDF with QWebView: missing refresh/repaint after loading
I use the QWebView (python 3.3 + pyside 1.1.2 + Qt 4.8) as FileViewer. Picture, Text, HTML, ... all fine, but PDF has a display problem. I tested two possible ways.
internal pdf viewer: after use ...
3
votes
1answer
62 views
Insert PDF into Excel, Python
I wrote a python script that I use to create a spreasheet with various calculations. I want to further insert a pdf into the spreadhseet (pdf is desireable, but a jpeg is useable). I've been flipping ...
0
votes
0answers
33 views
xhtml2pdf not converting my webpage django python
Am using this example supplied by http://xhtml2pdf-base.googlecode.com/hg/doc/xhtml2pdf-en.html
I have used the command xhtml2pdf -s http://www.google.com/ in my terminal to convert a webpage to pdf. ...
1
vote
0answers
67 views
EOF marker not found - How to fix in PyPDF and PyPDF2?
I'm attempting to combine a few PDF files into a single PDF file using Python. I've tried both PyPDF and PyPDF2 - on some files, they both throw this same error:
PdfReadError: EOF marker not found
...
0
votes
0answers
32 views
Getting Coordinates of Embedded PDF
I would like to embed a PDF in a webpage. When a user clicks on a certain coordinate within the embedded PDF, I would like to know what coordinate it corresponds to with respect to the PDF (and not ...
0
votes
2answers
62 views
automatically extract text from pdf for many files
I have about 10,000 of pdf files(conf papers) and I need to extract text from certain section (like the experimental section) of these papers and save in a file.
Does anyone know a java tool or some ...
1
vote
2answers
100 views
matplotlib linewidth when saving a PDF
I have a figure with some fairly delicate features that are sensitive to linewidth. I want to save this figure as a PDF that can be easily printed (i.e. no scaling on the receiver's side, just ...