Port details |
- py-pdfminer.six PDF parser and analyzer
- 20220319 textproc =0 20220319Version of this port present on the latest quarterly branch.
- Maintainer: kai@FreeBSD.org
- Port Added: 2018-07-28 12:06:03
- Last Update: 2023-09-29 14:19:56
- Commit Hash: 3754fc5
- Also Listed In: python
- License: MIT
- WWW:
- https://github.com/pdfminer/pdfminer.six
- Description:
- We fathom PDF
Pdfminer.six is a community maintained fork of the original PDFMiner. It is a
tool for extracting information from PDF documents. It focuses on getting and
analyzing text data. Pdfminer.six extracts the text from a page directly from
the sourcecode of the PDF. It can also be used to get the exact location, font
or color of the text.
It is built in a modular way such that each component of pdfminer.six can be
replaced easily. You can implement your own interpreter or rendering device
that uses the power of pdfminer.six for other purposes than text analysis.
Features:
- Parse, analyze, and convert PDF documents.
- PDF-1.7 specification support. (well, almost)
- CJK languages and vertical writing scripts support.
- Various font types (Type1, TrueType, Type3, and CID) support.
- Basic encryption (RC4) support.
- Outline (TOC) extraction.
- Tagged contents extraction.
- Automatic layout analysis.
- ¦ ¦ ¦ ¦
- Manual pages:
- FreshPorts has no man page information for this port.
- pkg-plist: as obtained via:
make generate-plist - There is no configure plist information for this port.
- Dependency lines:
-
- ${PYTHON_PKGNAMEPREFIX}pdfminer.six>0:textproc/py-pdfminer.six@${PY_FLAVOR}
- Conflicts:
- CONFLICTS_INSTALL:
- py??-pdfminer
- py??-pdfminer3k
- Conflicts Matches:
-
There are no Conflicts Matches for this port. This is usually an error.
- To install the port:
- cd /usr/ports/textproc/py-pdfminer.six/ && make install clean
- To add the package, run one of these commands:
- pkg install textproc/py-pdfminer.six
- pkg install py39-pdfminer.six
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above. NOTE: This is a Python port. Instead of py39-pdfminer.six listed in the above command, you can pick from the names under the Packages section.- PKGNAME: py39-pdfminer.six
- Package flavors (<flavor>: <package>)
- distinfo:
- TIMESTAMP = 1649529385
SHA256 (pdfminer.six-20220319.tar.gz) = eff2ce0abeaa4df94dc3461f70eab104487c7b4a2b3c7e9fd0aeec6c5f44d6a6
SIZE (pdfminer.six-20220319.tar.gz) = 7356602
Packages (timestamps in pop-ups are UTC):
- Dependencies
- NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
- Build dependencies:
-
- py39-setuptools>=63.1.0 : devel/py-setuptools@py39
- python3.9 : lang/python39
- Test dependencies:
-
- python3.9 : lang/python39
- Runtime dependencies:
-
- py39-chardet>=0 : textproc/py-chardet@py39
- py39-pillow>0 : graphics/py-pillow@py39
- py39-cryptography>=41.0.3_1,1 : security/py-cryptography@py39
- py39-setuptools>=63.1.0 : devel/py-setuptools@py39
- python3.9 : lang/python39
- This port is required by:
- for Run
-
- editors/setzer
- textproc/py-ocrmypdf
-
Deleted ports which required this port:
- * - deleted ports are only shown under the This port is required by section. It was harder to do for the Required section. Perhaps later...
Configuration Options:
- ===> The following configuration options are available for py39-pdfminer.six-20220319:
IMAGE=on: Enable support for image extraction
===> Use 'make config' to modify these settings
- Options name:
- textproc_py-pdfminer.six
- USES:
- python
- FreshPorts was unable to extract/find any pkg message
- Master Sites:
|
Commit History - (may be incomplete: for full details, see links to repositories near top of page) |
Commit | Credits | Log message |
20220319 29 Sep 2023 14:19:56 |
Po-Chuan Hsieh (sunpoet) |
*: Replace USES=pycryptography* with USE_PYTHON=cryptography{,_build,_test}
- Introduce USE_PYTHON=cryptography{,_build,_test}
- Switch all 96 ports from USES=pycryptography to with
USE_PYTHON=cryptography{,_build,_test}
- Remove Mk/Uses/pycryptography.mk
PR: 273727
Approved by: tcberner (portmgr)
Exp-run by: antoine |
20220319 22 Jul 2023 07:46:38 |
Tobias C. Berner (tcberner) |
framework: convert tree to use USES=pycryptography
A future commit will update to security/py-cryptography will introduce a
rust dependency.
PR: 254853 |
20220319 27 Jun 2023 19:34:34 |
Rene Ladan (rene) |
all: remove explicit versions in USES=python for "3.x+"
The logic in USES=python will automatically convert this to 3.8+ by
itself.
Adjust two ports that only had Python 3.7 mentioned but build fine
on Python 3.8 too.
finance/quickfix: mark BROKEN with PYTHON
libtool: compile: c++ -DHAVE_CONFIG_H -I. -I../.. -I -I. -I.. -I../.. -I../C++
-DLIBICONV_PLUG -DPYTHON_MAJOR_VERSION=3 -Wno-unused-variable
-Wno-maybe-uninitialized -O2 -pipe -DLIBICONV_PLUG -fstack-protector-strong
-fno-strict-aliasing -DLIBICONV_PLUG -Wall -ansi
-Wno-unused-command-line-argument -Wpointer-arith -Wwrite-strings
-Wno-overloaded-virtual -Wno-deprecated-declarations -Wno-deprecated -std=c++0x
-MT _quickfix_la-QuickfixPython.lo -MD -MP -MF
.deps/_quickfix_la-QuickfixPython.Tpo -c QuickfixPython.cpp -fPIC -DPIC -o
.libs/_quickfix_la-QuickfixPython.o
warning: unknown warning option '-Wno-maybe-uninitialized'; did you mean
'-Wno-uninitialized'? [-Wunknown-warning-option]
QuickfixPython.cpp:175:11: fatal error: 'Python.h' file not found
^~~~~~~~~~
1 warning and 1 error generated.
Reviewed by: portmgr, vishwin, yuri
Differential Revision: <https://reviews.freebsd.org/D40568> |
20220319 11 Jan 2023 15:58:34 |
Dmitry Marakasov (amdmi3) |
*/*: rename CHEESESHOP to PYPI in MASTER_SITES
PR: 267994
Differential revision: D37518
Approved by: bapt |
07 Sep 2022 21:58:51 |
Stefan Eßer (se) |
Remove WWW entries moved into port Makefiles
Commit b7f05445c00f has added WWW entries to port Makefiles based on
WWW: lines in pkg-descr files.
This commit removes the WWW: lines of moved-over URLs from these
pkg-descr files.
Approved by: portmgr (tcberner) |
20220319 07 Sep 2022 21:10:59 |
Stefan Eßer (se) |
Add WWW entries to port Makefiles
It has been common practice to have one or more URLs at the end of the
ports' pkg-descr files, one per line and prefixed with "WWW:". These
URLs should point at a project website or other relevant resources.
Access to these URLs required processing of the pkg-descr files, and
they have often become stale over time. If more than one such URL was
present in a pkg-descr file, only the first one was tarnsfered into
the port INDEX, but for many ports only the last line did contain the
port specific URL to further information.
There have been several proposals to make a project URL available as
a macro in the ports' Makefiles, over time.
(Only the first 15 lines of the commit message are shown above ) |
20220319 10 Apr 2022 10:42:24 |
Kai Knoblich (kai) |
textproc/py-pdfminer.six: Update to 20220319
Changelog:
https://github.com/pdfminer/pdfminer.six/releases/tag/20220319
MFH: 2022Q2 |
20211012 21 Mar 2022 16:40:31 |
Kai Knoblich (kai) |
textproc/py-pdfminer.six: Update to 20211012
* Switch back to PyPI as the sdist is there available again.
* Remove the test suite as it makes only sense for upstream testing in
its current condition and therefore isn't supplied with the sdist.
* Update pkg-descr to match reality.
Changelog:
https://github.com/pdfminer/pdfminer.six/blob/develop/CHANGELOG.md#20211012
PR: 262101
Reported by: rsmith@xs4all.nl |
20201018 25 Nov 2021 21:40:11 |
Stefan Eßer (se) |
*/*: Remove redundant '-*' from CONFLICTS definitions
The conflict checks compare the patterns first against the package
names without version (as reported by "pkg query "%n"), then - if
there was no match - agsinst the full package names including the
version (as reported by "pkg query "%n-%v").
Approved by: portmgr (blanket) |
20201018 07 Apr 2021 08:09:01 |
Mathieu Arnold (mat) |
One more small cleanup, forgotten yesterday.
Reported by: lwhsu |
20201018 06 Apr 2021 14:31:07 |
Mathieu Arnold (mat) |
Remove # $FreeBSD$ from Makefiles. |
20201018 30 Nov 2020 12:59:56 |
kai |
textproc/py-pdfminer.six: Update to 20201018
Changelog:
https://github.com/pdfminer/pdfminer.six/blob/20201018/CHANGELOG.md |
20200726 22 Oct 2020 17:56:11 |
sunpoet |
Update to 20200726
Changes: https://github.com/pdfminer/pdfminer.six/releases
PR: 250118
Submitted by: sunpoet (myself)
Approved by: maintainer (timeout, 18 days) |
20191110 13 Jun 2020 12:28:07 |
kai |
textproc/py-pdfminer.six: Update to 20191110
* Use this release for a while as it's the last one that supports Python 2.7
which is still needed by textproc/scancode-toolkit.
* Backport two patches to fix the runtime of CLI scripts.
* Remove "dos2unix" and "shebangfix" macros as they're no longer required.
Changelog since 20181108:
https://github.com/pdfminer/pdfminer.six/blob/20191110/CHANGELOG.md
MFH: 2020Q2 (blanket: bugfix release) |
20181108 03 Jul 2019 07:43:35 |
kai |
textproc/py-pdfminer.six: Update to 20181108
* Switch to GitHub for a while as no tarballs of the current version are
available at PyPI.
* Add a test target to make future QA somewhat easier.
Changelog:
https://github.com/pdfminer/pdfminer.six/releases/tag/20181108 |
20170720_1 25 May 2019 17:24:04 |
tobik |
textproc/py-pdfminer.six: Drop nop and misspelled DOS2UNIXGLOB |
20170720_1 10 Apr 2019 06:18:32 |
antoine |
Bump a few PORTREVISIONs after r498529
With hat: portmgr |
20170720 01 Feb 2019 18:55:18 |
kai |
Change my maintainer email address to my new one at the FreeBSD project.
Reviewed by: miwi (mentor)
Approved by: miwi (mentor)
Differential Revision: https://reviews.freebsd.org/D19050 |
20170720 28 Jul 2018 12:05:28 |
miwi |
PDFMiner.six is a fork of PDFMiner using six for Python 2 + 3 compatibility.
PDFMiner is a tool for extracting information from PDF documents. Unlike other
PDF-related tools, it focuses entirely on getting and analyzing text data.
PDFMiner allows one to obtain the exact location of text in a page, as well as
other information such as fonts or lines. It includes a PDF converter that can
transform PDF files into other text formats (such as HTML). It has an extensible
PDF parser that can be used for other purposes than text analysis.
Features:
- Parse, analyze, and convert PDF documents.
- PDF-1.7 specification support. (well, almost)
- CJK languages and vertical writing scripts support. (Only the first 15 lines of the commit message are shown above ) |