Show openSUSE:Slowroll:Base:1 / python-pdfminer.six

Overview Repositories Revisions Requests Users Attributes Meta

PDF parser and analyzer

https://github.com/pdfminer/pdfminer.six

Fork of PDFMiner using six for Python3 compatibility.

PDFMiner is a tool for extracting information from PDF documents.
Unlike other PDF-related tools, it focuses entirely on getting
and analyzing text data. PDFMiner allows to obtain the exact
location of texts in a page, as well as other information such
as fonts or lines. It includes a PDF converter that can transform
PDF files into other text formats (such as HTML). It has an
extensible PDF parser that can be used for other purposes instead
of text analysis.

Developed at devel:languages:python
Sources inherited from project openSUSE:Factory
3 derived packages
Derived Packages
devel:languages:python

devel:languages:python:backports

home:dirkmueller:AL:TW
Cancel
Download package
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Slowroll:Base:1/python-pdfminer.six && cd $_
Create Badge

Build Results
RPM Lint

Refresh

Source Files

Filename	Size	Changed
pdfminer.six-20231228.tar.gz	0010951846 10.4 MB	9 months ago
python-pdfminer.six.changes	0000005215 5.09 KB	9 months ago
python-pdfminer.six.spec	0000003009 2.94 KB	9 months ago

Latest Revision

Ana Guerrero (anag+factory) accepted request 1137442 from

Dirk Mueller (dirkmueller) 9 months ago (revision 8)

- update to 20231228:
  * Removed Support for Python 3.6 and 3.7
  * Output converter for the hOCR format
  * Font name aliases for Arial, Courier New and Times New Roman
  * Documentation on why special characters can sometimes not be
    extracted
  * Storing Bezier path and dashing style of line in LTCurve
  * Broken CI/CD pipeline by setting upper version limit for
    black, mypy, pip and setuptools
  * `flake8` failures
  * `ValueError` when bmp images with 1 bit channel are decoded
  * `ValueError` when trying to decrypt empty metadata values
  * Sphinx errors during building of documentation
  * `TypeError` when getting default width of font
  * Installing typing-extensions on Python 3.6 and 3.7
  * `TypeError` in cmapdb.py when parsing null characters
  * Color "convenience operators" now (per spec) also set color
    space
  * `ValueError` when extracting images, due to breaking changes
    in Pillow
  * Small typo's and issues in the documentation
  * Ignore non-Unicode cmaps in TrueType fonts
  * Using non-hardcoded version string and setuptools-git-
    versioning to enable installation from source and building on
    Python 3.12
  * Usage of `if __name__ == "__main__"` where it was only
    intended for testing purposes
- drop import-from-non-pythonpath-files.patch (upstream)

  - Option to disable boxes flow layout analysis when using pdf2txt

Places

PDF parser and analyzer

Edit Package python-pdfminer.six

Source Files

Latest Revision

Comments 0

Places