Port details |
- sentencepiece Unsupervised text tokenizer for Neural Network-based text generation
- 0.2.0 textproc =2 0.2.0Version of this port present on the latest quarterly branch.
- Maintainer: yuri@FreeBSD.org
- Port Added: 2023-01-16 09:42:22
- Last Update: 2024-02-20 07:09:21
- Commit Hash: 2250e98
- People watching this port, also watch:: jdictionary, py311-Automat, py311-python-gdsii, py39-PyOpenGL, p5-Sane
- License: APACHE20
- WWW:
- https://github.com/google/sentencepiece
- Description:
- SentencePiece is an unsupervised text tokenizer and detokenizer mainly for
Neural Network-based text generation systems where the vocabulary size is
predetermined prior to the neural model training. SentencePiece implements
subword units (e.g., byte-pair-encoding (BPE)) and unigram language model
with the extension of direct training from raw sentences. SentencePiece
allows us to make a purely end-to-end system that does not depend on
language-specific pre/postprocessing.
- ¦ ¦ ¦ ¦
- Manual pages:
- FreshPorts has no man page information for this port.
- pkg-plist: as obtained via:
make generate-plist - Dependency lines:
-
- sentencepiece>0:textproc/sentencepiece
- To install the port:
- cd /usr/ports/textproc/sentencepiece/ && make install clean
- To add the package, run one of these commands:
- pkg install textproc/sentencepiece
- pkg install sentencepiece
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.- PKGNAME: sentencepiece
- Flavors: there is no flavor information for this port.
- distinfo:
- TIMESTAMP = 1708411133
SHA256 (google-sentencepiece-v0.2.0_GH0.tar.gz) = 9970f0a0afee1648890293321665e5b2efa04eaec9f1671fcf8048f456f5bb86
SIZE (google-sentencepiece-v0.2.0_GH0.tar.gz) = 11980811
Packages (timestamps in pop-ups are UTC):
- Dependencies
- NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
- Build dependencies:
-
- cmake : devel/cmake-core
- ninja : devel/ninja
- Library dependencies:
-
- libtcmalloc_minimal.so : devel/google-perftools
- This port is required by:
- for Libraries
-
- textproc/py-sentencepiece
Configuration Options:
- No options to configure
- Options name:
- textproc_sentencepiece
- USES:
- cmake:testing compiler:c++17-lang
- FreshPorts was unable to extract/find any pkg message
- Master Sites:
|
Number of commits found: 4
Commit History - (may be incomplete: for full details, see links to repositories near top of page) |
Commit | Credits | Log message |
0.2.0 20 Feb 2024 07:09:21 |
Yuri Victorovich (yuri) |
textproc/{,py-}sentencepiece: update 0.1.99 → 0.2.0
Reported by: portscout |
0.1.99 03 Jun 2023 07:43:33 |
Yuri Victorovich (yuri) |
textproc/sentencepiece: Update 0.1.98 → 0.1.99
Reported by: portscout |
0.1.98 18 Apr 2023 04:38:14 |
Yuri Victorovich (yuri) |
textproc/{,py-}sentencepiece: Update 0.1.97 → 0.1.98
Reported by: portscout |
0.1.97 16 Jan 2023 09:41:00 |
Yuri Victorovich (yuri) |
textproc/sentencepiece: New port: Unsupervised text tokenizer for Neural
Network-based text generation |
Number of commits found: 4
|