| Port details | 
	| 
 py-html5-parser Fast implementation of the HTML 5 parsing spec for Python
 0.4.12_4 www  =2          0.4.12_4Version of this port present on the latest quarterly branch.Maintainer: madpilot@FreeBSD.org Port Added: 2017-07-31 16:22:30Last Update: 2025-09-15 13:55:40Commit Hash: a6233e5People watching this port, also watch:: jdictionary, py311-Automat, py311-python-gdsii, py311-PyOpenGL, p5-SaneAlso Listed In: pythonLicense: APACHE20WWW: https://html5-parser.readthedocs.io/Description:A fast implementation of the HTML 5 parsing spec for Python. Parsing
is done in C using a variant of the gumbo parser. The gumbo parse
tree is then transformed into an lxml tree, also in C, yielding
parse times that can be a thirtieth of the html5lib parse times.
That is a speedup of 30x. This differs, for instance, from the gumbo
python bindings, where the initial parsing is done in C but the
transformation into the final tree is done in python.   ¦  ¦  ¦  ¦ 
 Manual pages:FreshPorts has no man page information for this port.pkg-plist: as obtained via: make generate-plistThere is no configure plist information for this port.USE_RC_SUBR (Service Scripts)
no SUBR information found for this portDependency lines:
${PYTHON_PKGNAMEPREFIX}html5-parser>0:www/py-html5-parser@${PY_FLAVOR}
To install the port: cd /usr/ports/www/py-html5-parser/ && make install cleanTo add the package, run one of these commands:NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.pkg install www/py-html5-parserpkg install py311-html5-parser
 NOTE: This is a Python port. Instead of py311-html5-parser listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py311-html5-parserPackage flavors (<flavor>: <package>)py311: py311-html5-parser
distinfo:TIMESTAMP = 1700496447
SHA256 (html5-parser-0.4.12.tar.gz) = 3d7f89841aa48b976311f43863178c34c141abcf1dd45b67a7339e61cffe5306
SIZE (html5-parser-0.4.12.tar.gz) = 270861 
Packages (timestamps in pop-ups are UTC):
 
DependenciesNOTE: FreshPorts displays only information on required and default dependencies.  Optional dependencies are not covered.Build dependencies:
py311-lxml>=4.9.2 : devel/py-lxml@py311pkgconf>=1.3.0_1 : devel/pkgconfpy311-setuptools>=63.1.0 : devel/py-setuptools@py311python3.11 : lang/python311
Test dependencies:
python3.11 : lang/python311
Runtime dependencies:
python3.11 : lang/python311
Library dependencies:
libxml2.so : textproc/libxml2
This port is required by:for Build
					            
deskutils/calibre
for Run
					            
deskutils/calibrewww/py-mechanize
Configuration Options:
     No options to configureOptions name:www_py-html5-parserUSES:gnome pkgconfig pythonFreshPorts was unable to extract/find any pkg message
Master Sites: | 
| Commit History - (may be incomplete: for full details, see links to repositories near top of page) | 
| Commit | Credits | Log message | 
|---|
| 0.4.12_4 15 Sep 2025 13:55:40
 
       | Hiroki Tagato (tagattie)  | */*: switch dependency from devel/py-lxml5 to devel/py-lxml
Currently, there are two versions of python XML processing library
which conflict each other in the ports tree, namely:
- devel/py-lxml5 (now at version 5.4.0)
- devel/py-lxml  (now at version 6.0.1)
To avoid the situation that some ports depend on py-lxml5 and some
others do on py-lxml (by switching dependencies individually), this
commit switches the dependencies from py-lxml5 to py-lxml at once.
Additional note: There are still two ports (textproc/py-rdflib and
www/py-draftjs-exporter) depending on py-lxml5 since they limit upper
version to less than 6.
PR:		289491
Approved by:	Michiel van Baak Jansen <michiel@vanbaak.eu>, arrowd,
		crees, madpilot, delphij, marcus, nivit, kai,
		skreuzer, fluffy, bofh, thierry, stephen, sunpoet,
		0mp, Eric Camachat <eric@camachat.org> | 
| 0.4.12_3 21 Jul 2025 08:00:55
 
       | Hiroki Tagato (tagattie)  | */*: update dependency on devel/py-lxml to devel/py-lxml5 (2nd attempt)
This is a follow up to the commit 230fb2661c78, which updated some
ports' dependencies on devel/py-lxml to devel/py-lxml5. It was an
attempt to follow the dependency change of
www/py-beautifulsoup. However, the switch was incomplete and broke
some other ports.
It has turned out that the ports depending on devel/py-lxml (at
version 4.9.3) does not limit their dependencies' upper bound to 4.x
except for one (devel/py-pymaven-patch). So updating them to
devel/py-lxml5 (at version 5.4.0) should cause no harm.
This commit switch dependencies of all the ports (except
devel/py-pymaven-patch) to devel/py-lxml5 avoid potential conflicts.
Co-authored-by: Daniel Engberg <diizzy@FreeBSD.org>
PR:		287144, 288047
Reported by:	makc (via ports-committers),
		vvd (PR 288047),
		diizzy (PR 287144)
Approved by:	portmgr (chase dependency change, unbreak build)
Fixes:		230fb2661c78 (*/*: update dependency on devel/py-lxml to devel/py-lxml5) | 
| 0.4.12_2 30 Jun 2025 09:04:22
 
         | Baptiste Daroussin (bapt)  | libxml2: chase libxml soversion bump | 
| 0.4.12_1 08 Mar 2025 04:05:21
 
       | Charlie Li (vishwin)  | python: bump all USE_PYTHON=distutils consumers after RUN_DEPENDS removal
Any missed ports, feel free to bump.
Any ports that need setuptools at runtime can have the devel/py-setuptools
manually added back to RUN_DEPENDS, but understand that this practice
is deprecated; see CHANGES for details. | 
| 0.4.12 20 Nov 2023 17:55:57
 
       | Guido Falsi (madpilot)  | www/py-html5-parser: Update to 0.4.12 | 
| 0.4.11 27 Jun 2023 19:34:34
 
       | Rene Ladan (rene)  | all: remove explicit versions in USES=python for "3.x+"
The logic in USES=python will automatically convert this to 3.8+ by
itself.
Adjust two ports that only had Python 3.7 mentioned but build fine
on Python 3.8 too.
finance/quickfix: mark BROKEN with PYTHON
libtool: compile:  c++ -DHAVE_CONFIG_H -I. -I../.. -I -I. -I.. -I../.. -I../C++
-DLIBICONV_PLUG -DPYTHON_MAJOR_VERSION=3 -Wno-unused-variable
-Wno-maybe-uninitialized -O2 -pipe -DLIBICONV_PLUG -fstack-protector-strong
-fno-strict-aliasing -DLIBICONV_PLUG -Wall -ansi
-Wno-unused-command-line-argument -Wpointer-arith -Wwrite-strings
-Wno-overloaded-virtual -Wno-deprecated-declarations -Wno-deprecated -std=c++0x
-MT _quickfix_la-QuickfixPython.lo -MD -MP -MF
.deps/_quickfix_la-QuickfixPython.Tpo -c QuickfixPython.cpp  -fPIC -DPIC -o
.libs/_quickfix_la-QuickfixPython.o
warning: unknown warning option '-Wno-maybe-uninitialized'; did you mean
'-Wno-uninitialized'? [-Wunknown-warning-option]
QuickfixPython.cpp:175:11: fatal error: 'Python.h' file not found
          ^~~~~~~~~~
1 warning and 1 error generated.
Reviewed by:	portmgr, vishwin, yuri
Differential Revision:	<https://reviews.freebsd.org/D40568> | 
| 0.4.11 12 Apr 2023 15:02:42
 
       | Guido Falsi (madpilot)  | www/py-html5-parser: Update to 0.4.11
- Align minimum dependency requirement to what upstream uses in CI
  script. | 
| 0.4.10_2 11 Jan 2023 15:58:34
 
       | Dmitry Marakasov (amdmi3)  | */*: rename CHEESESHOP to PYPI in MASTER_SITES
PR:			267994
Differential revision:	D37518
Approved by:		bapt | 
| 07 Sep 2022 21:58:51 
       | Stefan Eßer (se)  | Remove WWW entries moved into port Makefiles
Commit b7f05445c00f has added WWW entries to port Makefiles based on
WWW: lines in pkg-descr files.
This commit removes the WWW: lines of moved-over URLs from these
pkg-descr files.
Approved by:		portmgr (tcberner) | 
| 0.4.10_2 07 Sep 2022 21:10:59
 
       | Stefan Eßer (se)  | Add WWW entries to port Makefiles
It has been common practice to have one or more URLs at the end of the
ports' pkg-descr files, one per line and prefixed with "WWW:". These
URLs should point at a project website or other relevant resources.
Access to these URLs required processing of the pkg-descr files, and
they have often become stale over time. If more than one such URL was
present in a pkg-descr file, only the first one was tarnsfered into
the port INDEX, but for many ports only the last line did contain the
port specific URL to further information.
There have been several proposals to make a project URL available as
a macro in the ports' Makefiles, over time.
(Only the first 15 lines of the commit message are shown above  ) | 
| 0.4.10_2 10 Apr 2022 19:11:41
 
       | Charlie Li (vishwin)  | textproc/libxml2: bump all LIB_DEPENDS consumers
This is a separate commit to facilitate easier cherry-picking for
quarterly.
PR: 262853, 262940, 262877, 263126
Approved by: fluffy (mentor) | 
| 0.4.10_1 26 Mar 2022 08:27:27
 
       | Matthias Fechner (mfechner)  | textproc/libxml2: bump all dependencies
This should make sure that all dependent ports will pick
up the new version commited with a13ec21cd733f67a9fc0dc00ab45268bdc236246 | 
| 0.4.10 22 Sep 2021 16:05:52
 
       | Guido Falsi (madpilot)  | www/py-html5-parser: Update to 0.4.10 | 
| 0.4.9 07 Apr 2021 08:09:01
 
       | Mathieu Arnold (mat)  | One more small cleanup, forgotten yesterday.
Reported by:	lwhsu | 
| 0.4.9 06 Apr 2021 14:31:07
 
       | Mathieu Arnold (mat)  | Remove # $FreeBSD$ from Makefiles. | 
| 0.4.9 28 Dec 2020 23:02:15
 
     | antoine  | Drop python 2.7 support from a few ports
With hat:	portmgr | 
| 0.4.9 24 Dec 2020 13:46:02
 
     | kai  | Relax hardcoded paths to fix build with Python 3.8.7
Since r558913 Python 3.8 incorporates BPO-42604 [1] which changed the
shared libs naming scheme.  This means "EXT_SUFFIX" is now derived from
SOABI and yields with Python 3.8 to ".cpython-38.so" instead of ".so".
The affected ports strip the libaries in the "post-install" target via
hardcoded path(s) and the build fails at the end because the new extension
is not expected at this place.
Remedy the issue by adding wildcards to these paths.  This should also
prepare the ports for future Python releases, which will use the new shared
libs naming scheme.
[1] https://bugs.python.org/issue42604
PR:		252057
Reported by:	John Kennedy
Reviewed by:	fluffy, koobs
Approved by:	koobs (python) | 
| 0.4.9 11 Jan 2020 15:38:11
 
     | madpilot  | Update py-html5-parser to 0.4.9 | 
| 0.4.8 08 Nov 2019 12:53:37
 
     | tobik  | www: Add missing USES=gnome | 
| 0.4.8 01 Aug 2019 18:00:16
 
     | madpilot  | Update py-html5-parser to 0.4.8 | 
| 0.4.7 14 Jun 2019 12:46:50
 
     | madpilot  | Update html5-parser to 0.4.7 | 
| 0.4.6 15 May 2019 09:53:19
 
     | madpilot  | Update py-html5-parser to 0.4.6 | 
| 0.4.5 20 Jun 2018 17:05:44
 
     | mat  | Use PY_FLAVOR for dependencies.
FLAVOR is the current port's flavor, it should not be used outside of
this scope.
Sponsored by:	Absolight | 
| 0.4.5 18 Jun 2018 08:36:59
 
     | madpilot  | - Update py-html5-parser to 0.4.5
- Add missing dependency
- Strip binaries | 
| 0.4.4 30 Nov 2017 15:50:34
 
       | mat  | Convert Python ports to FLAVORS.
  Ports using USE_PYTHON=distutils are now flavored.  They will
  automatically get flavors (py27, py34, py35, py36) depending on what
  versions they support.
  There is also a USE_PYTHON=flavors for ports that do not use distutils
  but need FLAVORS to be set.  A USE_PYTHON=noflavors can be set if
  using distutils but flavors are not wanted.
  A new USE_PYTHON=optsuffix that will add PYTHON_PKGNAMESUFFIX has been
  added to cope with Python ports that did not have the Python
  PKGNAMEPREFIX but are flavored.
  USES=python now also exports a PY_FLAVOR variable that contains the(Only the first 15 lines of the commit message are shown above  ) | 
| 0.4.4 03 Aug 2017 22:32:53
 
     | madpilot  | Update to 0.4.4. | 
| 0.4.3 31 Jul 2017 16:22:20
 
     | madpilot  | A fast implementation of the HTML 5 parsing spec for Python. Parsing
is done in C using a variant of the gumbo parser. The gumbo parse
tree is then transformed into an lxml tree, also in C, yielding
parse times that can be a thirtieth of the html5lib parse times.
That is a speedup of 30x. This differs, for instance, from the gumbo
python bindings, where the initial parsing is done in C but the
transformation into the final tree is done in python.
WWW: https://html5-parser.readthedocs.io/ |