notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Want a good monitor light? See my photosAll times are UTC
Ukraine
Port details
py-textract Extract text from any document
1.6.5_9 textproc on this many watch lists=2 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout 1.6.5_8Version of this port present on the latest quarterly branch.
Maintainer: DtxdF@disroot.org search for ports maintained by this maintainer
Port Added: 2022-10-25 20:51:06
Last Update: 2025-03-08 04:05:21
Commit Hash: 06a08e6
People watching this port, also watch:: jdictionary, py311-Automat, py311-python-gdsii, py311-PyOpenGL, p5-Sane
Also Listed In: python
License: MIT
WWW:
https://github.com/deanmalmgren/textract
Description:
textract provides a single interface for extracting content embedded from Word documents, PowerPoint presentations, PDFs and much more, which can be used for further textual analysis and visualization.
Homepage    cgit ¦ Codeberg ¦ GitHub ¦ GitLab ¦ SVNWeb - no subversion history for this port

Manual pages:
FreshPorts has no man page information for this port.
pkg-plist: as obtained via: make generate-plist
There is no configure plist information for this port.
Dependency lines:
  • ${PYTHON_PKGNAMEPREFIX}textract>0:textproc/py-textract@${PY_FLAVOR}
To install the port:
cd /usr/ports/textproc/py-textract/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/py-textract
  • pkg install py311-textract
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
NOTE: This is a Python port. Instead of py311-textract listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py311-textract
Package flavors (<flavor>: <package>)
  • py311: py311-textract
distinfo:
TIMESTAMP = 1659835075 SHA256 (textract-1.6.5.tar.gz) = 68f0f09056885821e6c43d8538987518daa94057c306679f2857cc5ee66ad850 SIZE (textract-1.6.5.tar.gz) = 17871

Packages (timestamps in pop-ups are UTC):
py39-textract
ABIaarch64amd64armv6armv7i386powerpcpowerpc64powerpc64le
FreeBSD:13:latest--------
FreeBSD:13:quarterly------1.6.5_7-
FreeBSD:14:latest-------1.6.5_6
FreeBSD:14:quarterly------1.6.5_7-
FreeBSD:15:latest--n/a-n/a---
 

py311-textract
ABIaarch64amd64armv6armv7i386powerpcpowerpc64powerpc64le
FreeBSD:13:latest1.6.5_91.6.5_9-1.6.5_81.6.5_9---
FreeBSD:13:quarterly1.6.5_81.6.5_8-1.6.5_81.6.5_8---
FreeBSD:14:latest1.6.5_81.6.5_9-1.6.5_81.6.5_9---
FreeBSD:14:quarterly1.6.5_81.6.5_8-1.6.5_81.6.5_8---
FreeBSD:15:latest1.6.5_91.6.5_9n/a-n/a---
 

Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. py311-setuptools>=63.1.0 : devel/py-setuptools@py311
  2. python3.11 : lang/python311
Test dependencies:
  1. python3.11 : lang/python311
Runtime dependencies:
  1. py311-argcomplete>=1.10.0 : devel/py-argcomplete@py311
  2. py311-chardet>=3 : textproc/py-chardet@py311
  3. py311-six>1.12.0 : devel/py-six@py311
  4. antiword>0 : textproc/antiword
  5. py311-beautifulsoup>=4.8.0 : www/py-beautifulsoup@py311
  6. py311-docx2txt>=0.8 : textproc/py-docx2txt@py311
  7. ffmpeg>0 : multimedia/ffmpeg
  8. flac>0 : audio/flac
  9. jpeg-turbo>0 : graphics/jpeg-turbo
  10. lame>0 : audio/lame
  11. py311-libxml2>0 : textproc/py-libxml2@py311
  12. libxslt>=1.1.15 : textproc/libxslt
  13. py311-extract-msg>=0.29 : textproc/py-extract-msg@py311
  14. poppler-utils>0 : graphics/poppler-utils
  15. py311-python-pptx>=0.6.18 : textproc/py-python-pptx@py311
  16. pstotext>0 : print/pstotext
  17. sox>0 : audio/sox
  18. py311-speechrecognition>=3.8.1 : audio/py-speechrecognition@py311
  19. py311-xlrd>=1.2.0 : textproc/py-xlrd@py311
  20. tesseract>0 : graphics/tesseract
  21. unrtf>0 : textproc/unrtf
  22. python3.11 : lang/python311
There are no ports dependent upon this port

Configuration Options:
===> The following configuration options are available for py311-textract-1.6.5_9: ANTIWORD=on: DOC document support BEAUTIFULSOUP=on: HTML parsing library DOCX2TXT=on: DOCX document support LIBXML2=on: Python interface for XML parser library LIBXSLT=on: XML stylesheet transformation library MSG=on: MS Outlook MSG file format support PPTX=on: MS PowerPoint PPTX presentations support PS=on: PostScript document support SPREADSHEET=on: XLS and XLSX spreadsheet support UNRTF=on: RTF document support ====> Options available for the group AUDIO FFMPEG=on: FFmpeg support (WMA, AIFF, AC3, APE...) FLAC=on: FLAC lossless audio codec support LAME=on: LAME MP3 audio encoder support POCKETSPHINX=off: Interface to CMU Sphinxbase and Pocketsphinx SOX=on: Command-line audio processing tool SPEECH_RECOGNITION=on: Python library for performing speech recognition ====> Options available for the group OCR JPEG_TURBO=on: SIMD-accelerated JPEG codec TESSERACT=on: Commercial quality open source OCR engine ====> PDF document support PDFMINER=off: PDF parser and analyzer PDFTOTEXT=on: Extract text from a PDF document ===> Use 'make config' to modify these settings
Options name:
textproc_py-textract
USES:
python
FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (2 items)
Collapse this list.
  1. https://files.pythonhosted.org/packages/source/t/textract/
  2. https://pypi.org/packages/source/t/textract/
Collapse this list.

Number of commits found: 14

Commit History - (may be incomplete: for full details, see links to repositories near top of page)
CommitCreditsLog message
1.6.5_9
08 Mar 2025 04:05:21
commit hash: 06a08e69f3acb846ad8252945f53c3a2d211e303commit hash: 06a08e69f3acb846ad8252945f53c3a2d211e303commit hash: 06a08e69f3acb846ad8252945f53c3a2d211e303commit hash: 06a08e69f3acb846ad8252945f53c3a2d211e303 files touched by this commit
Charlie Li (vishwin) search for other commits by this committer
python: bump all USE_PYTHON=distutils consumers after RUN_DEPENDS removal

Any missed ports, feel free to bump.

Any ports that need setuptools at runtime can have the devel/py-setuptools
manually added back to RUN_DEPENDS, but understand that this practice
is deprecated; see CHANGES for details.
1.6.5_8
01 Mar 2024 23:56:15
commit hash: d2f834ba51ec40ef69d7e9cc117db32fa572fddbcommit hash: d2f834ba51ec40ef69d7e9cc117db32fa572fddbcommit hash: d2f834ba51ec40ef69d7e9cc117db32fa572fddbcommit hash: d2f834ba51ec40ef69d7e9cc117db32fa572fddb files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump consumers of graphics/poppler

Bump after rupdate in 478df79a3071b399f648107456cf371587e84a3f
1.6.5_7
03 Jan 2024 07:18:40
commit hash: 672f0463c70387a32340a83f7ccce244f16f6743commit hash: 672f0463c70387a32340a83f7ccce244f16f6743commit hash: 672f0463c70387a32340a83f7ccce244f16f6743commit hash: 672f0463c70387a32340a83f7ccce244f16f6743 files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump revision of consumers
1.6.5_6
27 Jun 2023 19:34:34
commit hash: 3d9a815d9c5acbb71f4bb07738bdeab4879feacbcommit hash: 3d9a815d9c5acbb71f4bb07738bdeab4879feacbcommit hash: 3d9a815d9c5acbb71f4bb07738bdeab4879feacbcommit hash: 3d9a815d9c5acbb71f4bb07738bdeab4879feacb files touched by this commit
Rene Ladan (rene) search for other commits by this committer
all: remove explicit versions in USES=python for "3.x+"

The logic in USES=python will automatically convert this to 3.8+ by
itself.

Adjust two ports that only had Python 3.7 mentioned but build fine
on Python 3.8 too.

finance/quickfix: mark BROKEN with PYTHON

libtool: compile:  c++ -DHAVE_CONFIG_H -I. -I../.. -I -I. -I.. -I../.. -I../C++
-DLIBICONV_PLUG -DPYTHON_MAJOR_VERSION=3 -Wno-unused-variable
-Wno-maybe-uninitialized -O2 -pipe -DLIBICONV_PLUG -fstack-protector-strong
-fno-strict-aliasing -DLIBICONV_PLUG -Wall -ansi
-Wno-unused-command-line-argument -Wpointer-arith -Wwrite-strings
-Wno-overloaded-virtual -Wno-deprecated-declarations -Wno-deprecated -std=c++0x
-MT _quickfix_la-QuickfixPython.lo -MD -MP -MF
.deps/_quickfix_la-QuickfixPython.Tpo -c QuickfixPython.cpp  -fPIC -DPIC -o
.libs/_quickfix_la-QuickfixPython.o
warning: unknown warning option '-Wno-maybe-uninitialized'; did you mean
'-Wno-uninitialized'? [-Wunknown-warning-option]
QuickfixPython.cpp:175:11: fatal error: 'Python.h' file not found
          ^~~~~~~~~~
1 warning and 1 error generated.

Reviewed by:	portmgr, vishwin, yuri
Differential Revision:	<https://reviews.freebsd.org/D40568>
1.6.5_6
14 May 2023 05:58:14
commit hash: edbe24d784c93c2986306a2301fc945311d4f8f4commit hash: edbe24d784c93c2986306a2301fc945311d4f8f4commit hash: edbe24d784c93c2986306a2301fc945311d4f8f4commit hash: edbe24d784c93c2986306a2301fc945311d4f8f4 files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump dependencies
1.6.5_5
25 Apr 2023 15:17:15
commit hash: feb1fa34f58ea796656b86a81c2a2996b0b03c96commit hash: feb1fa34f58ea796656b86a81c2a2996b0b03c96commit hash: feb1fa34f58ea796656b86a81c2a2996b0b03c96commit hash: feb1fa34f58ea796656b86a81c2a2996b0b03c96 files touched by this commit
Christian Weisgerber (naddy) search for other commits by this committer
audio/opus: bump consumers after update to 1.4
1.6.5_4
20 Apr 2023 04:07:00
commit hash: 32edee2a202f6b2356da77e54b231cb5783f5dc2commit hash: 32edee2a202f6b2356da77e54b231cb5783f5dc2commit hash: 32edee2a202f6b2356da77e54b231cb5783f5dc2commit hash: 32edee2a202f6b2356da77e54b231cb5783f5dc2 files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump consumers after update to 23.04

graphics/poppler was updated in 06339c451266f5843e53bd6406c81a89eedd4ab1
1.6.5_3
30 Jan 2023 13:02:41
commit hash: f5e6e814ea2d98fce4ac3a7823687360e7f01a50commit hash: f5e6e814ea2d98fce4ac3a7823687360e7f01a50commit hash: f5e6e814ea2d98fce4ac3a7823687360e7f01a50commit hash: f5e6e814ea2d98fce4ac3a7823687360e7f01a50 files touched by this commit
Po-Chuan Hsieh (sunpoet) search for other commits by this committer
textproc/py-textract: Add NO_ARCH

- While I'm here, fix indent

Approved by:	portmgr (blanket)
1.6.5_3
30 Jan 2023 12:59:34
commit hash: 4d98b344a91a4b5db62dbb75ea1364470387f1f2commit hash: 4d98b344a91a4b5db62dbb75ea1364470387f1f2commit hash: 4d98b344a91a4b5db62dbb75ea1364470387f1f2commit hash: 4d98b344a91a4b5db62dbb75ea1364470387f1f2 files touched by this commit
Po-Chuan Hsieh (sunpoet) search for other commits by this committer
audio/py-speechrecognition: Update to 3.9.0

- Update PORTNAME: use lowercase
- Change MASTER_SITES from GitHub to PYPI
- Update version requirement of RUN_DEPENDS
- Take maintainership

Changes:	https://github.com/Uberi/speech_recognition/releases
1.6.5_3
11 Jan 2023 15:58:34
commit hash: 77d68471aded6907dbbcb5ef50d557e4c69db656commit hash: 77d68471aded6907dbbcb5ef50d557e4c69db656commit hash: 77d68471aded6907dbbcb5ef50d557e4c69db656commit hash: 77d68471aded6907dbbcb5ef50d557e4c69db656 files touched by this commit
Dmitry Marakasov (amdmi3) search for other commits by this committer
*/*: rename CHEESESHOP to PYPI in MASTER_SITES

PR:			267994
Differential revision:	D37518
Approved by:		bapt
1.6.5_3
09 Jan 2023 12:37:17
commit hash: 6f417247b1d3bbc704090a0ded4895a2850308d2commit hash: 6f417247b1d3bbc704090a0ded4895a2850308d2commit hash: 6f417247b1d3bbc704090a0ded4895a2850308d2commit hash: 6f417247b1d3bbc704090a0ded4895a2850308d2 files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump dependencies

Follow-up to 9b78681895a5a5b7225299242098f7f2f27d959c
1.6.5_2
08 Dec 2022 05:45:34
commit hash: e3d436085f99fe8675433c2b8753d3f484b19ebfcommit hash: e3d436085f99fe8675433c2b8753d3f484b19ebfcommit hash: e3d436085f99fe8675433c2b8753d3f484b19ebfcommit hash: e3d436085f99fe8675433c2b8753d3f484b19ebf files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump dependencies
1.6.5_1
08 Nov 2022 05:07:17
commit hash: 3a7ab00b4f8502014a5c42f058bb27f2127b424dcommit hash: 3a7ab00b4f8502014a5c42f058bb27f2127b424dcommit hash: 3a7ab00b4f8502014a5c42f058bb27f2127b424dcommit hash: 3a7ab00b4f8502014a5c42f058bb27f2127b424d files touched by this commit
Tobias C. Berner (tcberner) search for other commits by this committer
graphics/poppler: bump PORTREVISION of dependencies

- after update to 22.11 in d01d0d73b169
1.6.5
25 Oct 2022 20:49:12
commit hash: b6e6388dab6dd78e37adebf738e568997db6d15acommit hash: b6e6388dab6dd78e37adebf738e568997db6d15acommit hash: b6e6388dab6dd78e37adebf738e568997db6d15acommit hash: b6e6388dab6dd78e37adebf738e568997db6d15a files touched by this commit
Li-Wen Hsu (lwhsu) search for other commits by this committer
Author: Jesús Daniel Colmenares Oviedo
Add textproc/py-textract: Extract text from any document

textract provides a single interface for extracting content embedded
from Word documents, PowerPoint presentations, PDFs and much more,
which can be used for further textual analysis and visualization.

WWW: https://github.com/deanmalmgren/textract

PR:		265768

Number of commits found: 14