0
votes
1answer
20 views

Python - Best way to parse specific, standardized information in PDF documents?

I am trying to parse these PDF "Arms Sale Notification" letters, found here: http://www.dsca.mil/pressreleases/36-b/36b_index.htm Here is a specific PDF document example, of a proposed arms sale to ...
0
votes
2answers
15 views

Reportlab 2 or more pages per file

How can I generate a PDF with two or more pages with reportlab? I've been unable to find anything in the documentation.
4
votes
1answer
46 views

Failing to export properly to svg and pdf with matplotlib

I am trying to save a .svg image from an interactive path editor based on this example: path_editor.py. Saving a PNG image works just fine, but when I save to SVG or PDF, the paths are not exported. ...
1
vote
0answers
17 views

ArcGIS Python Map Book PDF not working blank PDF

The purpose of the code is to make a PDF map book that displays all of the large lakes in North America. I'm trying to run this code to make a map book but it gives me a blank PDF. How can I fix this? ...
1
vote
1answer
44 views

Embed .SVG files into PDF using reportlab

I have written a script in python that produces matplotlib graphs and puts them into a pdf report using reportlab. I am having difficulty embedding SVG image files into my PDF file. I've had no ...
0
votes
1answer
34 views

How to merge PDFs with only PHP and keep internal and external links

How to merge PDFs with only PHP and keep internal and external links? Simple question but I can't find a solution. It is no access to shell so pdftk and so on is unacceptable. I'm looking for pure ...
0
votes
0answers
16 views

Detecting corrupted PDFs in file uploads on heroku

At my company, we accept user-uploaded PDFs. A surprisingly high percent of these uploads are corrupted. Is there a way we could detect this reliably? The app runs on python / heroku. A call to the ...
0
votes
1answer
45 views

Firefox produces unsearchable pdfs

Currently I'm writing software for web automation using selenium and autoit. I've found a strange issue, that for some pages when printing to pdf with firefox I get unsearchable pdfs. I've tried ff ...
0
votes
0answers
35 views

PDF with QWebView: missing refresh/repaint after loading

I use the QWebView (python 3.3 + pyside 1.1.2 + Qt 4.8) as FileViewer. Picture, Text, HTML, ... all fine, but PDF has a display problem. I tested two possible ways. internal pdf viewer: after use ...
3
votes
1answer
62 views

Insert PDF into Excel, Python

I wrote a python script that I use to create a spreasheet with various calculations. I want to further insert a pdf into the spreadhseet (pdf is desireable, but a jpeg is useable). I've been flipping ...
0
votes
0answers
33 views

xhtml2pdf not converting my webpage django python

Am using this example supplied by http://xhtml2pdf-base.googlecode.com/hg/doc/xhtml2pdf-en.html I have used the command xhtml2pdf -s http://www.google.com/ in my terminal to convert a webpage to pdf. ...
1
vote
0answers
67 views

EOF marker not found - How to fix in PyPDF and PyPDF2?

I'm attempting to combine a few PDF files into a single PDF file using Python. I've tried both PyPDF and PyPDF2 - on some files, they both throw this same error: PdfReadError: EOF marker not found ...
0
votes
0answers
32 views

Getting Coordinates of Embedded PDF

I would like to embed a PDF in a webpage. When a user clicks on a certain coordinate within the embedded PDF, I would like to know what coordinate it corresponds to with respect to the PDF (and not ...
0
votes
2answers
62 views

automatically extract text from pdf for many files

I have about 10,000 of pdf files(conf papers) and I need to extract text from certain section (like the experimental section) of these papers and save in a file. Does anyone know a java tool or some ...
1
vote
2answers
100 views

matplotlib linewidth when saving a PDF

I have a figure with some fairly delicate features that are sensitive to linewidth. I want to save this figure as a PDF that can be easily printed (i.e. no scaling on the receiver's side, just ...

1 2 3 4 5 26
15 30 50 per page