[NLUUG]   Welcome to ftp.vim.org,
Hosted by ftp.nluug.nl
Current directory: /ftp/os/Linux/distr/salix/i486/extra-15.0/source/python/python2-pdfminer/
Contents of README:
PDFMiner is a tool for extracting information from PDF documents. Unlike
other PDF-related tools, it focuses entirely on getting and analyzing
text data. PDFMiner allows one to obtain the exact location of text in a
page, as well as other information such as fonts or lines. It includes a
PDF converter that can transform PDF files into other text formats (such
as HTML). It has an extensible PDF parser that can be used for other
purposes than text analysis.

PDFMiner comes with two handy tools: pdf2txt.py and dumppdf.py.

pdf2txt.py

pdf2txt.py extracts text contents from a PDF file.  It cannot recognize 
text drawn as images.  It also extracts locations, font names/sizes, 
writing direction.  It requires a password for password protected PDF 
documents.  You cannot extract any text from a PDF document which does 
not have extraction permission.

dumppdf.py

dumppdf.py dumps the internal contents of a PDF file in pseudo-XML
format. This program is primarily for debugging purposes, but it's also
possible to extract some meaningful contents (e.g. images).

Icon  Name                                                     Last modified      Size  
[DIR] Parent Directory - [TXT] README 27-May-2022 21:39 1.0K [   ] pdfminer-20140328.tar.gz 27-May-2022 21:39 3.9M [TXT] python2-pdfminer.SlackBuild 11-Mar-2022 06:34 3.2K [   ] python2-pdfminer.info 27-May-2022 21:39 335 [TXT] slack-desc 27-May-2022 21:39 1.1K

NLUUG - Open Systems. Open Standards
Become a member and get discounts on conferences and more, see the NLUUG website!