Welcome to ftp.vim.org,
Hosted by ftp.nluug.nl Current directory: /ftp/os/Linux/distr/salix/i486/extra-15.0/source/python/python2-pdfminer/ |
Contents of README:PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes than text analysis. PDFMiner comes with two handy tools: pdf2txt.py and dumppdf.py. pdf2txt.py pdf2txt.py extracts text contents from a PDF file. It cannot recognize text drawn as images. It also extracts locations, font names/sizes, writing direction. It requires a password for password protected PDF documents. You cannot extract any text from a PDF document which does not have extraction permission. dumppdf.py dumppdf.py dumps the internal contents of a PDF file in pseudo-XML format. This program is primarily for debugging purposes, but it's also possible to extract some meaningful contents (e.g. images). |
Name Last modified Size
Parent Directory - README 27-May-2022 21:39 1.0K pdfminer-20140328.tar.gz 27-May-2022 21:39 3.9M python2-pdfminer.SlackBuild 11-Mar-2022 06:34 3.2K python2-pdfminer.info 27-May-2022 21:39 335 slack-desc 27-May-2022 21:39 1.1K
NLUUG - Open Systems. Open Standards
Become a member
and get discounts on conferences and more, see the NLUUG website!