notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Want a good monitor light? See my photosAll times are UTC
Ukraine

Newsfeed changes

The news feed page[s] were not laid out well. Two pages, disjointed information, hard to figure out how to use the optional parameters...

Thankfully, someone told me.

The new page is ready for your review. Please compare these two:

You may also be interested in the Github issue.
Port details
py-tokenizers Fast state-of-the-art tokenizers optimized for research and production
0.21.1 textproc on this many watch lists=2 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout 0.21.1Version of this port present on the latest quarterly branch.
Maintainer: tagattie@FreeBSD.org search for ports maintained by this maintainer
Port Added: 2024-02-12 08:36:07
Last Update: 2025-03-22 08:35:24
Commit Hash: 63ed0ec
People watching this port, also watch:: jdictionary, py311-Automat, py311-python-gdsii, py311-PyOpenGL, p5-Sane
Also Listed In: python
License: APACHE20
WWW:
https://github.com/huggingface/tokenizers
Description:
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility. Main features: - Train new vocabularies and tokenize, using today's most used tokenizers. - Extremely fast (both training and tokenization), thanks to the Rust implementation. Takes less than 20 seconds to tokenize a GB of text on a server's CPU. - Easy to use, but also extremely versatile. - Designed for research and production. - Normalization comes with alignments tracking. It's always possible to get the part of the original sentence that corresponds to a given token. - Does all the pre-processing: Truncate, Pad, add the special tokens your model needs.
Homepage    cgit ¦ Codeberg ¦ GitHub ¦ GitLab ¦ SVNWeb - no subversion history for this port

Manual pages:
FreshPorts has no man page information for this port.
pkg-plist: as obtained via: make generate-plist
There is no configure plist information for this port.
Dependency lines:
  • ${PYTHON_PKGNAMEPREFIX}tokenizers>0:textproc/py-tokenizers@${PY_FLAVOR}
To install the port:
cd /usr/ports/textproc/py-tokenizers/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/py-tokenizers
  • pkg install py311-tokenizers
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
NOTE: This is a Python port. Instead of py311-tokenizers listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py311-tokenizers
Package flavors (<flavor>: <package>)
  • py311: py311-tokenizers
distinfo:
TIMESTAMP = 1742605562 SHA256 (tokenizers-0.21.1.tar.gz) = a1bb04dc5b448985f86ecd4b05407f5a8d97cb2c0532199b2a302a604a0165ab SIZE (tokenizers-0.21.1.tar.gz) = 343256

Expand this list (260 items)

Collapse this list.

SHA256 (rust/crates/aho-corasick-1.1.3.crate) = 8e60d3430d3a69478ad0993f19238d2df97c507009a52b3c10addcd7f6bcb916 SIZE (rust/crates/aho-corasick-1.1.3.crate) = 183311 SHA256 (rust/crates/anstream-0.6.18.crate) = 8acc5369981196006228e28809f761875c0327210a891e941f4c683b3a99529b SIZE (rust/crates/anstream-0.6.18.crate) = 29681 SHA256 (rust/crates/anstyle-1.0.10.crate) = 55cc3b69f167a1ef2e161439aa98aed94e6028e5f9a59be9a6ffb47aef1651f9 SIZE (rust/crates/anstyle-1.0.10.crate) = 15725 SHA256 (rust/crates/anstyle-parse-0.2.6.crate) = 3b2d16507662817a6a20a9ea92df6652ee4f94f914589377d69f3b21bc5798a9 SIZE (rust/crates/anstyle-parse-0.2.6.crate) = 22343 SHA256 (rust/crates/anstyle-query-1.1.2.crate) = 79947af37f4177cfead1110013d678905c37501914fba0efea834c3fe9a8d60c SIZE (rust/crates/anstyle-query-1.1.2.crate) = 9969 SHA256 (rust/crates/anstyle-wincon-3.0.6.crate) = 2109dbce0e72be3ec00bed26e6a7479ca384ad226efdd66db8fa2e3a38c83125 SIZE (rust/crates/anstyle-wincon-3.0.6.crate) = 12271 SHA256 (rust/crates/autocfg-1.4.0.crate) = ace50bade8e6234aa140d9a2f552bbee1db4d353f69b8217bc503490fc1a9f26 SIZE (rust/crates/autocfg-1.4.0.crate) = 17712 SHA256 (rust/crates/base64-0.13.1.crate) = 9e1b586273c5702936fe7b7d6896644d8be71e6314cfe09d3167c95f712589e8 SIZE (rust/crates/base64-0.13.1.crate) = 61002 SHA256 (rust/crates/bitflags-1.3.2.crate) = bef38d45163c2f1dde094a7dfd33ccf595c92905c8f8f4fdc18d06fb1037718a SIZE (rust/crates/bitflags-1.3.2.crate) = 23021 SHA256 (rust/crates/bitflags-2.6.0.crate) = b048fb63fd8b5923fc5aa7b340d8e156aec7ec02f0c78fa8a6ddc2613f6f71de SIZE (rust/crates/bitflags-2.6.0.crate) = 45357 SHA256 (rust/crates/bumpalo-3.16.0.crate) = 79296716171880943b8470b5f8d03aa55eb2e645a4874bdbb28adb49162e012c SIZE (rust/crates/bumpalo-3.16.0.crate) = 85677 SHA256 (rust/crates/byteorder-1.5.0.crate) = 1fd0f2584146f6f2ef48085050886acf353beff7305ebd1ae69500e27c67f64b SIZE (rust/crates/byteorder-1.5.0.crate) = 23288 SHA256 (rust/crates/cc-1.2.6.crate) = 8d6dbb628b8f8555f86d0323c2eb39e3ec81901f4b83e091db8a6a76d316a333 SIZE (rust/crates/cc-1.2.6.crate) = 99923 SHA256 (rust/crates/cfg-if-1.0.0.crate) = baf1de4339761588bc0619e3cbc0120ee582ebb74b53b4efbf79117bd2da40fd SIZE (rust/crates/cfg-if-1.0.0.crate) = 7934 SHA256 (rust/crates/colorchoice-1.0.3.crate) = 5b63caa9aa9397e2d9480a9b13673856c78d8ac123288526c37d7839f2a86990 SIZE (rust/crates/colorchoice-1.0.3.crate) = 7923 SHA256 (rust/crates/console-0.15.10.crate) = ea3c6ecd8059b57859df5c69830340ed3c41d30e3da0c1cbed90a96ac853041b SIZE (rust/crates/console-0.15.10.crate) = 36449 SHA256 (rust/crates/crossbeam-deque-0.8.6.crate) = 9dd111b7b7f7d55b72c0a6ae361660ee5853c9af73f70c3c2ef6858b950e2e51 SIZE (rust/crates/crossbeam-deque-0.8.6.crate) = 22471 SHA256 (rust/crates/crossbeam-epoch-0.9.18.crate) = 5b82ac4a3c2ca9c3460964f020e1402edd5753411d7737aa39c3714ad1b5420e SIZE (rust/crates/crossbeam-epoch-0.9.18.crate) = 46875 SHA256 (rust/crates/crossbeam-utils-0.8.21.crate) = d0a5c400df2834b80a4c3327b3aad3a4c4cd4de0629063962b03235697506a28 SIZE (rust/crates/crossbeam-utils-0.8.21.crate) = 42691 SHA256 (rust/crates/darling-0.20.10.crate) = 6f63b86c8a8826a49b8c21f08a2d07338eec8d900540f8630dc76284be802989 SIZE (rust/crates/darling-0.20.10.crate) = 32031 SHA256 (rust/crates/darling_core-0.20.10.crate) = 95133861a8032aaea082871032f5815eb9e98cef03fa916ab4500513994df9e5 SIZE (rust/crates/darling_core-0.20.10.crate) = 65015 SHA256 (rust/crates/darling_macro-0.20.10.crate) = d336a2a514f6ccccaa3e09b02d41d35330c07ddf03a62165fcec10bb561c7806 SIZE (rust/crates/darling_macro-0.20.10.crate) = 1874 SHA256 (rust/crates/derive_builder-0.20.2.crate) = 507dfb09ea8b7fa618fcf76e953f4f5e192547945816d5358edffe39f6f94947 SIZE (rust/crates/derive_builder-0.20.2.crate) = 36661 SHA256 (rust/crates/derive_builder_core-0.20.2.crate) = 2d5bcf7b024d6835cfb3d473887cd966994907effbe9227e8c8219824d06c4e8 SIZE (rust/crates/derive_builder_core-0.20.2.crate) = 31397 SHA256 (rust/crates/derive_builder_macro-0.20.2.crate) = ab63b0e2bf4d5928aff72e83a7dace85d7bba5fe12dcc3c5a572d78caffd3f3c SIZE (rust/crates/derive_builder_macro-0.20.2.crate) = 6360 SHA256 (rust/crates/either-1.13.0.crate) = 60b1af1c220855b6ceac025d3f6ecdd2b7c4894bfe9cd9bda4fbb4bc7c0d4cf0 SIZE (rust/crates/either-1.13.0.crate) = 19169 SHA256 (rust/crates/encode_unicode-1.0.0.crate) = 34aa73646ffb006b8f5147f3dc182bd4bcb190227ce861fc4a4844bf8e3cb2c0 SIZE (rust/crates/encode_unicode-1.0.0.crate) = 56986 SHA256 (rust/crates/env_filter-0.1.3.crate) = 186e05a59d4c50738528153b83b0b0194d3a29507dfec16eccd4b342903397d0 SIZE (rust/crates/env_filter-0.1.3.crate) = 15191 SHA256 (rust/crates/env_logger-0.11.6.crate) = dcaee3d8e3cfc3fd92428d477bc97fc29ec8716d180c0d74c643bb26166660e0 SIZE (rust/crates/env_logger-0.11.6.crate) = 31105 SHA256 (rust/crates/errno-0.3.10.crate) = 33d852cb9b869c2a9b3df2f71a3074817f01e1844f839a144f5fcef059a4eb5d SIZE (rust/crates/errno-0.3.10.crate) = 11824 SHA256 (rust/crates/esaxx-rs-0.1.10.crate) = d817e038c30374a4bcb22f94d0a8a0e216958d4c3dcde369b1439fec4bdda6e6 SIZE (rust/crates/esaxx-rs-0.1.10.crate) = 175210 SHA256 (rust/crates/fastrand-2.3.0.crate) = 37909eebbb50d72f9059c3b6d82c0463f2ff062c9e95845c43a6c9c0355411be SIZE (rust/crates/fastrand-2.3.0.crate) = 15076 SHA256 (rust/crates/fnv-1.0.7.crate) = 3f9eec918d3f24069decb9af1554cad7c880e2da24a9afd88aca000531ab82c1 SIZE (rust/crates/fnv-1.0.7.crate) = 11266 SHA256 (rust/crates/getrandom-0.2.15.crate) = c4567c8db10ae91089c99af84c68c38da3ec2f087c3f82960bcdbf3656b6f4d7 SIZE (rust/crates/getrandom-0.2.15.crate) = 37163 SHA256 (rust/crates/heck-0.5.0.crate) = 2304e00983f87ffb38b55b444b5e3b60a884b5d30c0fca7d82fe33449bbe55ea SIZE (rust/crates/heck-0.5.0.crate) = 11517 SHA256 (rust/crates/humantime-2.1.0.crate) = 9a3a5bfb195931eeb336b2a7b4d761daec841b97f947d34394601737a7bba5e4 SIZE (rust/crates/humantime-2.1.0.crate) = 16749 SHA256 (rust/crates/ident_case-1.0.1.crate) = b9e0384b61958566e926dc50660321d12159025e767c18e043daf26b70104c39 SIZE (rust/crates/ident_case-1.0.1.crate) = 3492 SHA256 (rust/crates/indicatif-0.17.9.crate) = cbf675b85ed934d3c67b5c5469701eec7db22689d0a2139d856e0925fa28b281 SIZE (rust/crates/indicatif-0.17.9.crate) = 65246 SHA256 (rust/crates/indoc-2.0.5.crate) = b248f5224d1d606005e02c97f5aa4e88eeb230488bcc03bc9ca4d7991399f2b5 SIZE (rust/crates/indoc-2.0.5.crate) = 14396 SHA256 (rust/crates/is_terminal_polyfill-1.70.1.crate) = 7943c866cc5cd64cbc25b2e01621d07fa8eb2a1a23160ee81ce38704e97b8ecf SIZE (rust/crates/is_terminal_polyfill-1.70.1.crate) = 7492 SHA256 (rust/crates/itertools-0.11.0.crate) = b1c173a5686ce8bfa551b3563d0c2170bf24ca44da99c7ca4bfdab5418c3fe57 SIZE (rust/crates/itertools-0.11.0.crate) = 125074 SHA256 (rust/crates/itertools-0.12.1.crate) = ba291022dbbd398a455acf126c1e341954079855bc60dfdda641363bd6922569 SIZE (rust/crates/itertools-0.12.1.crate) = 137761 SHA256 (rust/crates/itertools-0.13.0.crate) = 413ee7dfc52ee1a4949ceeb7dbc8a33f2d6c088194d9f922fb8318faf1f01186 SIZE (rust/crates/itertools-0.13.0.crate) = 146261 SHA256 (rust/crates/itoa-1.0.14.crate) = d75a2a4b1b190afb6f5425f10f6a8f959d2ea0b9c2b1d79553551850539e4674 SIZE (rust/crates/itoa-1.0.14.crate) = 11210 SHA256 (rust/crates/js-sys-0.3.76.crate) = 6717b6b5b077764fb5966237269cb3c64edddde4b14ce42647430a78ced9e7b7 SIZE (rust/crates/js-sys-0.3.76.crate) = 54420 SHA256 (rust/crates/lazy_static-1.5.0.crate) = bbd2bcb4c963f2ddae06a2efc7e9f3591312473c50c6685e1f298068316e66fe SIZE (rust/crates/lazy_static-1.5.0.crate) = 14025 SHA256 (rust/crates/libc-0.2.169.crate) = b5aba8db14291edd000dfcc4d620c7ebfb122c613afb886ca8803fa4e128a20a SIZE (rust/crates/libc-0.2.169.crate) = 757901 SHA256 (rust/crates/linux-raw-sys-0.4.14.crate) = 78b3ae25bc7c8c38cec158d1f2757ee79e9b3740fbc7ccf0e59e4b08d793fa89 SIZE (rust/crates/linux-raw-sys-0.4.14.crate) = 1826665 SHA256 (rust/crates/log-0.4.22.crate) = a7a70ba024b9dc04c27ea2f0c0548feb474ec5c54bba33a7f72f873a39d07b24 SIZE (rust/crates/log-0.4.22.crate) = 44027 SHA256 (rust/crates/macro_rules_attribute-0.2.0.crate) = 8a82271f7bc033d84bbca59a3ce3e4159938cb08a9c3aebbe54d215131518a13 SIZE (rust/crates/macro_rules_attribute-0.2.0.crate) = 15408 SHA256 (rust/crates/macro_rules_attribute-proc_macro-0.2.0.crate) = b8dd856d451cc0da70e2ef2ce95a18e39a93b7558bedf10201ad28503f918568 SIZE (rust/crates/macro_rules_attribute-proc_macro-0.2.0.crate) = 8264 SHA256 (rust/crates/matrixmultiply-0.3.9.crate) = 9380b911e3e96d10c1f415da0876389aaf1b56759054eeb0de7df940c456ba1a SIZE (rust/crates/matrixmultiply-0.3.9.crate) = 57819 SHA256 (rust/crates/memchr-2.7.4.crate) = 78ca9ab1a0babb1e7d5695e3530886289c18cf2f87ec19a575a0abdce112e3a3 SIZE (rust/crates/memchr-2.7.4.crate) = 96670 SHA256 (rust/crates/memoffset-0.9.1.crate) = 488016bfae457b036d996092f6cb448677611ce4449e970ceaf42695203f218a SIZE (rust/crates/memoffset-0.9.1.crate) = 9032 SHA256 (rust/crates/minimal-lexical-0.2.1.crate) = 68354c5c6bd36d73ff3feceb05efa59b6acb7626617f4962be322a825e61f79a SIZE (rust/crates/minimal-lexical-0.2.1.crate) = 94841 SHA256 (rust/crates/monostate-0.1.13.crate) = 0d208407d7552cd041d8cdb69a1bc3303e029c598738177a3d87082004dc0e1e SIZE (rust/crates/monostate-0.1.13.crate) = 14073 SHA256 (rust/crates/monostate-impl-0.1.13.crate) = a7ce64b975ed4f123575d11afd9491f2e37bbd5813fbfbc0f09ae1fbddea74e0 SIZE (rust/crates/monostate-impl-0.1.13.crate) = 7302 SHA256 (rust/crates/ndarray-0.16.1.crate) = 882ed72dce9365842bf196bdeedf5055305f11fc8c03dee7bb0194a6cad34841 SIZE (rust/crates/ndarray-0.16.1.crate) = 291026 SHA256 (rust/crates/nom-7.1.3.crate) = d273983c5a657a70a3e8f2a01329822f3b8c8172b73826411a55751e404a0a4a SIZE (rust/crates/nom-7.1.3.crate) = 117570 SHA256 (rust/crates/num-complex-0.4.6.crate) = 73f88a1307638156682bada9d7604135552957b7818057dcef22705b4d509495 SIZE (rust/crates/num-complex-0.4.6.crate) = 30352 SHA256 (rust/crates/num-integer-0.1.46.crate) = 7969661fd2958a5cb096e56c8e1ad0444ac2bbcd0061bd28660485a44879858f SIZE (rust/crates/num-integer-0.1.46.crate) = 22331 SHA256 (rust/crates/num-traits-0.2.19.crate) = 071dfc062690e90b734c0b2273ce72ad0ffa95f0c74596bc250dcfd960262841 SIZE (rust/crates/num-traits-0.2.19.crate) = 51631 SHA256 (rust/crates/number_prefix-0.4.0.crate) = 830b246a0e5f20af87141b25c173cd1b609bd7779a4617d6ec582abaf90870f3 SIZE (rust/crates/number_prefix-0.4.0.crate) = 6922 SHA256 (rust/crates/numpy-0.23.0.crate) = b94caae805f998a07d33af06e6a3891e38556051b8045c615470a71590e13e78 SIZE (rust/crates/numpy-0.23.0.crate) = 79034 SHA256 (rust/crates/once_cell-1.20.2.crate) = 1261fe7e33c73b354eab43b1273a57c8f967d0391e80353e51f764ac02cf6775 SIZE (rust/crates/once_cell-1.20.2.crate) = 33394 SHA256 (rust/crates/onig-6.4.0.crate) = 8c4b31c8722ad9171c6d77d3557db078cab2bd50afcc9d09c8b315c59df8ca4f SIZE (rust/crates/onig-6.4.0.crate) = 32616 SHA256 (rust/crates/onig_sys-69.8.1.crate) = 7b829e3d7e9cc74c7e315ee8edb185bf4190da5acde74afd7fc59c35b1f086e7 SIZE (rust/crates/onig_sys-69.8.1.crate) = 638216 SHA256 (rust/crates/paste-1.0.15.crate) = 57c0d7b74b563b49d38dae00a0c37d4d6de9b432382b2892f0574ddcae73fd0a SIZE (rust/crates/paste-1.0.15.crate) = 18374 SHA256 (rust/crates/pkg-config-0.3.31.crate) = 953ec861398dccce10c670dfeaf3ec4911ca479e9c02154b3a215178c5f566f2 SIZE (rust/crates/pkg-config-0.3.31.crate) = 20880 SHA256 (rust/crates/portable-atomic-1.10.0.crate) = 280dc24453071f1b63954171985a0b0d30058d287960968b9b2aca264c8d4ee6 SIZE (rust/crates/portable-atomic-1.10.0.crate) = 174760 SHA256 (rust/crates/portable-atomic-util-0.2.4.crate) = d8a2f0d8d040d7848a709caf78912debcc3f33ee4b3cac47d73d1e1069e83507 SIZE (rust/crates/portable-atomic-util-0.2.4.crate) = 47043 SHA256 (rust/crates/ppv-lite86-0.2.20.crate) = 77957b295656769bb8ad2b6a6b09d897d94f05c41b069aede1fcdaa675eaea04 SIZE (rust/crates/ppv-lite86-0.2.20.crate) = 22478 SHA256 (rust/crates/proc-macro2-1.0.92.crate) = 37d3544b3f2748c54e147655edb5025752e2303145b5aefb3c3ea2c78b973bb0 SIZE (rust/crates/proc-macro2-1.0.92.crate) = 52353 SHA256 (rust/crates/pyo3-0.23.5.crate) = 7778bffd85cf38175ac1f545509665d0b9b92a198ca7941f131f85f7a4f9a872 SIZE (rust/crates/pyo3-0.23.5.crate) = 1088533 SHA256 (rust/crates/pyo3-build-config-0.23.5.crate) = 94f6cbe86ef3bf18998d9df6e0f3fc1050a8c5efa409bf712e661a4366e010fb SIZE (rust/crates/pyo3-build-config-0.23.5.crate) = 33885 SHA256 (rust/crates/pyo3-ffi-0.23.5.crate) = e9f1b4c431c0bb1c8fb0a338709859eed0d030ff6daa34368d3b152a63dfdd8d SIZE (rust/crates/pyo3-ffi-0.23.5.crate) = 74867 SHA256 (rust/crates/pyo3-macros-0.23.5.crate) = fbc2201328f63c4710f68abdf653c89d8dbc2858b88c5d88b0ff38a75288a9da SIZE (rust/crates/pyo3-macros-0.23.5.crate) = 8856 SHA256 (rust/crates/pyo3-macros-backend-0.23.5.crate) = fca6726ad0f3da9c9de093d6f116a93c1a38e417ed73bf138472cf4064f72028 SIZE (rust/crates/pyo3-macros-backend-0.23.5.crate) = 70938 SHA256 (rust/crates/quote-1.0.38.crate) = 0e4dccaaaf89514f546c693ddc140f729f958c247918a13380cccc6078391acc SIZE (rust/crates/quote-1.0.38.crate) = 31252 SHA256 (rust/crates/rand-0.8.5.crate) = 34af8d1a0e25924bc5b7c43c079c942339d8f0a8b57c39049bef581b46327404 SIZE (rust/crates/rand-0.8.5.crate) = 87113 SHA256 (rust/crates/rand_chacha-0.3.1.crate) = e6c10a63a0fa32252be49d21e7709d4d4baf8d231c2dbce1eaa8141b9b127d88 SIZE (rust/crates/rand_chacha-0.3.1.crate) = 15251 SHA256 (rust/crates/rand_core-0.6.4.crate) = ec0be4795e2f6a28069bec0b5ff3e2ac9bafc99e6a9a7dc3547996c5c816922c SIZE (rust/crates/rand_core-0.6.4.crate) = 22666 SHA256 (rust/crates/rawpointer-0.2.1.crate) = 60a357793950651c4ed0f3f52338f53b2f809f32d83a07f72909fa13e4c6c1e3 SIZE (rust/crates/rawpointer-0.2.1.crate) = 7490 SHA256 (rust/crates/rayon-1.10.0.crate) = b418a60154510ca1a002a752ca9714984e21e4241e804d32555251faf8b78ffa SIZE (rust/crates/rayon-1.10.0.crate) = 180155 SHA256 (rust/crates/rayon-cond-0.3.0.crate) = 059f538b55efd2309c9794130bc149c6a553db90e9d99c2030785c82f0bd7df9 SIZE (rust/crates/rayon-cond-0.3.0.crate) = 9913 SHA256 (rust/crates/rayon-core-1.12.1.crate) = 1465873a3dfdaa8ae7cb14b4383657caab0b3e8a0aa9ae8e04b044854c8dfce2 SIZE (rust/crates/rayon-core-1.12.1.crate) = 70701 SHA256 (rust/crates/regex-1.11.1.crate) = b544ef1b4eac5dc2db33ea63606ae9ffcfac26c1416a2806ae0bf5f56b201191 SIZE (rust/crates/regex-1.11.1.crate) = 254170 SHA256 (rust/crates/regex-automata-0.4.9.crate) = 809e8dc61f6de73b46c85f4c96486310fe304c434cfa43669d7b40f711150908 SIZE (rust/crates/regex-automata-0.4.9.crate) = 618525 SHA256 (rust/crates/regex-syntax-0.8.5.crate) = 2b15c43186be67a4fd63bee50d0303afffcef381492ebe2c5d87f324e1b8815c SIZE (rust/crates/regex-syntax-0.8.5.crate) = 357541 SHA256 (rust/crates/rustc-hash-2.1.0.crate) = c7fb8039b3032c191086b10f11f319a6e99e1e82889c5cc6046f515c9db1d497 SIZE (rust/crates/rustc-hash-2.1.0.crate) = 13316 SHA256 (rust/crates/rustix-0.38.42.crate) = f93dc38ecbab2eb790ff964bb77fa94faf256fd3e73285fd7ba0903b76bedb85 SIZE (rust/crates/rustix-0.38.42.crate) = 378683 SHA256 (rust/crates/ryu-1.0.18.crate) = f3cb5ba0dc43242ce17de99c180e96db90b235b8a9fdc9543c96d2209116bd9f SIZE (rust/crates/ryu-1.0.18.crate) = 47713 SHA256 (rust/crates/serde-1.0.217.crate) = 02fc4265df13d6fa1d00ecff087228cc0a2b5f3c0e87e258d8b94a156e984c70 SIZE (rust/crates/serde-1.0.217.crate) = 79019 SHA256 (rust/crates/serde_derive-1.0.217.crate) = 5a9bf7cf98d04a2b28aead066b7496853d4779c9cc183c440dbac457641e19a0 SIZE (rust/crates/serde_derive-1.0.217.crate) = 57749 SHA256 (rust/crates/serde_json-1.0.134.crate) = d00f4175c42ee48b15416f6193a959ba3a0d67fc699a0db9ad12df9f83991c7d SIZE (rust/crates/serde_json-1.0.134.crate) = 154351 SHA256 (rust/crates/shlex-1.3.0.crate) = 0fda2ff0d084019ba4d7c6f371c95d8fd75ce3524c3cb8fb653a3023f6323e64 SIZE (rust/crates/shlex-1.3.0.crate) = 18713 SHA256 (rust/crates/smallvec-1.13.2.crate) = 3c5e1a9a646d36c3599cd173a41282daf47c44583ad367b8e6837255952e5c67 SIZE (rust/crates/smallvec-1.13.2.crate) = 35216 SHA256 (rust/crates/spm_precompiled-0.1.4.crate) = 5851699c4033c63636f7ea4cf7b7c1f1bf06d0cc03cfb42e711de5a5c46cf326 SIZE (rust/crates/spm_precompiled-0.1.4.crate) = 557527 SHA256 (rust/crates/strsim-0.11.1.crate) = 7da8b5736845d9f2fcb837ea5d9e2628564b3b043a70948a3f0b778838c5fb4f SIZE (rust/crates/strsim-0.11.1.crate) = 14266 SHA256 (rust/crates/syn-2.0.93.crate) = 9c786062daee0d6db1132800e623df74274a0a87322d8e183338e01b3d98d058 SIZE (rust/crates/syn-2.0.93.crate) = 293925 SHA256 (rust/crates/target-lexicon-0.12.16.crate) = 61c41af27dd6d1e27b1b16b489db798443478cef1f06a660c96db617ba5de3b1 SIZE (rust/crates/target-lexicon-0.12.16.crate) = 26488 SHA256 (rust/crates/tempfile-3.14.0.crate) = 28cce251fcbc87fac86a866eeb0d6c2d536fc16d06f184bb61aeae11aa4cee0c SIZE (rust/crates/tempfile-3.14.0.crate) = 35065 SHA256 (rust/crates/thiserror-2.0.9.crate) = f072643fd0190df67a8bab670c20ef5d8737177d6ac6b2e9a236cb096206b2cc SIZE (rust/crates/thiserror-2.0.9.crate) = 28531 SHA256 (rust/crates/thiserror-impl-2.0.9.crate) = 7b50fa271071aae2e6ee85f842e2e28ba8cd2c5fb67f11fcb1fd70b276f9e7d4 SIZE (rust/crates/thiserror-impl-2.0.9.crate) = 21023 SHA256 (rust/crates/unicode-ident-1.0.14.crate) = adb9e6ca4f869e1180728b7950e35922a7fc6397f7b641499e8f3ef06e50dc83 SIZE (rust/crates/unicode-ident-1.0.14.crate) = 47547 SHA256 (rust/crates/unicode-normalization-alignments-0.1.12.crate) = 43f613e4fa046e69818dd287fdc4bc78175ff20331479dab6e1b0f98d57062de SIZE (rust/crates/unicode-normalization-alignments-0.1.12.crate) = 91546 SHA256 (rust/crates/unicode-segmentation-1.12.0.crate) = f6ccf251212114b54433ec949fd6a7841275f9ada20dddd2f29e9ceea4501493 SIZE (rust/crates/unicode-segmentation-1.12.0.crate) = 106323 SHA256 (rust/crates/unicode-width-0.2.0.crate) = 1fc81956842c57dac11422a97c3b8195a1ff727f06e85c84ed2e8aa277c9a0fd SIZE (rust/crates/unicode-width-0.2.0.crate) = 271509 SHA256 (rust/crates/unicode_categories-0.1.1.crate) = 39ec24b3121d976906ece63c9daad25b85969647682eee313cb5779fdd69e14e SIZE (rust/crates/unicode_categories-0.1.1.crate) = 87298 SHA256 (rust/crates/unindent-0.2.3.crate) = c7de7d73e1754487cb58364ee906a499937a0dfabd86bcb980fa99ec8c8fa2ce SIZE (rust/crates/unindent-0.2.3.crate) = 7306 SHA256 (rust/crates/utf8parse-0.2.2.crate) = 06abde3611657adf66d383f00b093d7faecc7fa57071cce2578660c9f1010821 SIZE (rust/crates/utf8parse-0.2.2.crate) = 13499 SHA256 (rust/crates/wasi-0.11.0+wasi-snapshot-preview1.crate) = 9c8d87e72b64a3b4db28d11ce29237c246188f4f51057d65a7eab63b7987e423 SIZE (rust/crates/wasi-0.11.0+wasi-snapshot-preview1.crate) = 28131 SHA256 (rust/crates/wasm-bindgen-0.2.99.crate) = a474f6281d1d70c17ae7aa6a613c87fce69a127e2624002df63dcb39d6cf6396 SIZE (rust/crates/wasm-bindgen-0.2.99.crate) = 46136 SHA256 (rust/crates/wasm-bindgen-backend-0.2.99.crate) = 5f89bb38646b4f81674e8f5c3fb81b562be1fd936d84320f3264486418519c79 SIZE (rust/crates/wasm-bindgen-backend-0.2.99.crate) = 30928 SHA256 (rust/crates/wasm-bindgen-macro-0.2.99.crate) = 2cc6181fd9a7492eef6fef1f33961e3695e4579b9872a6f7c83aee556666d4fe SIZE (rust/crates/wasm-bindgen-macro-0.2.99.crate) = 7011 SHA256 (rust/crates/wasm-bindgen-macro-support-0.2.99.crate) = 30d7a95b763d3c45903ed6c81f156801839e5ee968bb07e534c44df0fcd330c2 SIZE (rust/crates/wasm-bindgen-macro-support-0.2.99.crate) = 22800 SHA256 (rust/crates/wasm-bindgen-shared-0.2.99.crate) = 943aab3fdaaa029a6e0271b35ea10b72b943135afe9bffca82384098ad0e06a6 SIZE (rust/crates/wasm-bindgen-shared-0.2.99.crate) = 7773 SHA256 (rust/crates/web-time-1.1.0.crate) = 5a6580f308b1fad9207618087a65c04e7a10bc77e02c8e84e9b00dd4b12fa0bb SIZE (rust/crates/web-time-1.1.0.crate) = 18026 SHA256 (rust/crates/windows-sys-0.59.0.crate) = 1e38bc4d79ed67fd075bcc251a1c39b32a1776bbe92e5bef1f0bf1f8c531853b SIZE (rust/crates/windows-sys-0.59.0.crate) = 2387323 SHA256 (rust/crates/windows-targets-0.52.6.crate) = 9b724f72796e036ab90c1021d4780d4d3d648aca59e491e6b98e725b84e99973 SIZE (rust/crates/windows-targets-0.52.6.crate) = 6403 SHA256 (rust/crates/windows_aarch64_gnullvm-0.52.6.crate) = 32a4622180e7a0ec044bb555404c800bc9fd9ec262ec147edd5989ccd0c02cd3 SIZE (rust/crates/windows_aarch64_gnullvm-0.52.6.crate) = 435718 SHA256 (rust/crates/windows_aarch64_msvc-0.52.6.crate) = 09ec2a7bb152e2252b53fa7803150007879548bc709c039df7627cabbd05d469 SIZE (rust/crates/windows_aarch64_msvc-0.52.6.crate) = 832615 SHA256 (rust/crates/windows_i686_gnu-0.52.6.crate) = 8e9b5ad5ab802e97eb8e295ac6720e509ee4c243f69d781394014ebfe8bbfa0b SIZE (rust/crates/windows_i686_gnu-0.52.6.crate) = 880402 SHA256 (rust/crates/windows_i686_gnullvm-0.52.6.crate) = 0eee52d38c090b3caa76c563b86c3a4bd71ef1a819287c19d586d7334ae8ed66 SIZE (rust/crates/windows_i686_gnullvm-0.52.6.crate) = 475940 SHA256 (rust/crates/windows_i686_msvc-0.52.6.crate) = 240948bc05c5e7c6dabba28bf89d89ffce3e303022809e73deaefe4f6ec56c66 SIZE (rust/crates/windows_i686_msvc-0.52.6.crate) = 901163 SHA256 (rust/crates/windows_x86_64_gnu-0.52.6.crate) = 147a5c80aabfbf0c7d901cb5895d1de30ef2907eb21fbbab29ca94c5b08b1a78 SIZE (rust/crates/windows_x86_64_gnu-0.52.6.crate) = 836363 SHA256 (rust/crates/windows_x86_64_gnullvm-0.52.6.crate) = 24d5b23dc417412679681396f2b49f3de8c1473deb516bd34410872eff51ed0d SIZE (rust/crates/windows_x86_64_gnullvm-0.52.6.crate) = 435707 SHA256 (rust/crates/windows_x86_64_msvc-0.52.6.crate) = 589f6da84c646204747d1270a2a5661ea66ed1cced2631d546fdfb155959f9ec SIZE (rust/crates/windows_x86_64_msvc-0.52.6.crate) = 832564 SHA256 (rust/crates/zerocopy-0.7.35.crate) = 1b9b4fd18abc82b8136838da5d50bae7bdea537c574d8dc1a34ed098d6c166f0 SIZE (rust/crates/zerocopy-0.7.35.crate) = 152645 SHA256 (rust/crates/zerocopy-derive-0.7.35.crate) = fa4f8080344d4671fb4e831a13ad1e68092748387dfc4f55e356242fae12ce3e SIZE (rust/crates/zerocopy-derive-0.7.35.crate) = 37829

Collapse this list.


Packages (timestamps in pop-ups are UTC):
py311-tokenizers
ABIaarch64amd64armv6armv7i386powerpcpowerpc64powerpc64le
FreeBSD:13:latest0.21.0_20.21.1--0.21.1---
FreeBSD:13:quarterly0.21.00.21.0-0.21.00.21.0---
FreeBSD:14:latest0.21.0_20.21.1--0.21.1---
FreeBSD:14:quarterly0.21.00.21.0-0.21.00.21.0---
FreeBSD:15:latest0.21.0_20.21.1n/a0.21.0_2n/a---
Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. py311-maturin>=1.0<2.0 : devel/py-maturin@py311
  2. rust>=1.85.0 : lang/rust
  3. pkgconf>=1.3.0_1 : devel/pkgconf
  4. python3.11 : lang/python311
  5. py311-build>=0 : devel/py-build@py311
  6. py311-installer>=0 : devel/py-installer@py311
Test dependencies:
  1. oniguruma.pc : devel/oniguruma
  2. python3.11 : lang/python311
Runtime dependencies:
  1. py311-huggingface-hub>=0.16.4<1.0 : misc/py-huggingface-hub@py311
  2. python3.11 : lang/python311
This port is required by:
for Run
  1. misc/py-aider-chat
  2. misc/py-anthropic
  3. misc/py-litellm

Configuration Options:
No options to configure
Options name:
textproc_py-tokenizers
USES:
cargo python
FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (2 items)
Collapse this list.
  1. https://files.pythonhosted.org/packages/source/t/tokenizers/
  2. https://pypi.org/packages/source/t/tokenizers/
Collapse this list.

Number of commits found: 19

Commit History - (may be incomplete: for full details, see links to repositories near top of page)
CommitCreditsLog message
0.21.1
22 Mar 2025 08:35:24
commit hash: 63ed0ecfbf1fd3717af96ee8571ebdbc525d8beecommit hash: 63ed0ecfbf1fd3717af96ee8571ebdbc525d8beecommit hash: 63ed0ecfbf1fd3717af96ee8571ebdbc525d8beecommit hash: 63ed0ecfbf1fd3717af96ee8571ebdbc525d8bee files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.21.1

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.21.1

Reported by:	portscout
0.21.0_2
24 Feb 2025 07:55:52
commit hash: cf7c4b037d7e9cadfa539f3b9cd45396c1a00dfbcommit hash: cf7c4b037d7e9cadfa539f3b9cd45396c1a00dfbcommit hash: cf7c4b037d7e9cadfa539f3b9cd45396c1a00dfbcommit hash: cf7c4b037d7e9cadfa539f3b9cd45396c1a00dfb files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.85.0

PR:		284884
0.21.0_1
20 Jan 2025 11:06:50
commit hash: fccaa1b326310b7f33556fe133b509260a90fe7fcommit hash: fccaa1b326310b7f33556fe133b509260a90fe7fcommit hash: fccaa1b326310b7f33556fe133b509260a90fe7fcommit hash: fccaa1b326310b7f33556fe133b509260a90fe7f files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.84.0

PR:		283962
0.21.0
06 Dec 2024 02:24:32
commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.21.0

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.21.0

Reported by:	Repology
0.20.3_2
01 Dec 2024 09:24:18
commit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2fcommit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2fcommit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2fcommit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2f files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.83.0

PR:		283000
0.20.3_1
08 Nov 2024 08:24:20
commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2 files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.82.0

PR:		282516
0.20.3
07 Nov 2024 12:30:23
commit hash: c857714fe0171b077b877238ca80112148241977commit hash: c857714fe0171b077b877238ca80112148241977commit hash: c857714fe0171b077b877238ca80112148241977commit hash: c857714fe0171b077b877238ca80112148241977 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.20.3

Changelog:
- https://github.com/huggingface/tokenizers/releases/tag/v0.20.2
- https://github.com/huggingface/tokenizers/releases/tag/v0.20.3

Reported by:	portscout
0.20.1
18 Oct 2024 05:12:30
commit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefedcommit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefedcommit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefedcommit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefed files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.20.1

While here, add LICENSE_FILE.

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.20.1

Reported by:	Repology
0.20.0_2
10 Sep 2024 11:00:34
commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5 files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.81.0

PR:		281300
0.20.0_1
10 Sep 2024 10:58:07
commit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5efcommit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5efcommit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5efcommit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5ef files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
Author: Siva Mahadevan
*/*: remove STRIP_CMD calls in rust based ports

This is not needed after bc4fedc1fec0d359365c04d43be9e32bf101a50e

PR:		246993
Differential Revision:	https://reviews.freebsd.org/D46503
0.20.0_1
26 Aug 2024 08:08:01
commit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bffcommit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bffcommit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bffcommit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bff files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.80.1

PR:		280490
0.20.0
10 Aug 2024 22:35:02
commit hash: ece290008e97142c0099b1dce0194d4d999af53ecommit hash: ece290008e97142c0099b1dce0194d4d999af53ecommit hash: ece290008e97142c0099b1dce0194d4d999af53ecommit hash: ece290008e97142c0099b1dce0194d4d999af53e files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.20.0

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.20.0

Reported by:	portscout
0.19.1_2
18 Jun 2024 10:59:14
commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350 files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.79.0

PR:		279707
0.19.1_1
13 May 2024 11:03:24
commit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8ecommit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8ecommit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8ecommit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8e files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.78.0

PR:		278834
0.19.1
21 Apr 2024 08:18:00
commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.19.1

Changelog:
- https://github.com/huggingface/tokenizers/releases/tag/v0.19.0
- https://github.com/huggingface/tokenizers/releases/tag/v0.19.1

Reported by:	Repology
0.15.2_2
23 Mar 2024 09:41:46
commit hash: d632fbe55e45559d03a0bef7211913964aeba45acommit hash: d632fbe55e45559d03a0bef7211913964aeba45acommit hash: d632fbe55e45559d03a0bef7211913964aeba45acommit hash: d632fbe55e45559d03a0bef7211913964aeba45a files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.77.0

PR:		277786
0.15.2_1
19 Feb 2024 11:59:23
commit hash: 7f5a7b0e524c8d32727681a4324905e350dc433ccommit hash: 7f5a7b0e524c8d32727681a4324905e350dc433ccommit hash: 7f5a7b0e524c8d32727681a4324905e350dc433ccommit hash: 7f5a7b0e524c8d32727681a4324905e350dc433c files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.76.0

PR:		276920
0.15.2
14 Feb 2024 09:17:15
commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.15.2

While here, enable tests.

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.15.2

Reported by:	portscout
0.15.1
12 Feb 2024 08:34:14
commit hash: e3dfc2fad4565873d807ba820a24094b97890f98commit hash: e3dfc2fad4565873d807ba820a24094b97890f98commit hash: e3dfc2fad4565873d807ba820a24094b97890f98commit hash: e3dfc2fad4565873d807ba820a24094b97890f98 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: add port: Fast state-of-the-art tokenizers optimized for
research and production

Provides an implementation of today's most used tokenizers, with a
focus on performance and versatility.

Main features:
- Train new vocabularies and tokenize, using today's most used
  tokenizers.
- Extremely fast (both training and tokenization), thanks to the Rust
  implementation. Takes less than 20 seconds to tokenize a GB of text
  on a server's CPU.
- Easy to use, but also extremely versatile.
- Designed for research and production.
- Normalization comes with alignments tracking. It's always possible
  to get the part of the original sentence that corresponds to a given
  token.
- Does all the pre-processing: Truncate, Pad, add the special tokens
  your model needs.

WWW: https://github.com/huggingface/tokenizers

Number of commits found: 19