GitHub Analytics
GitHub Analytics
Battle
Trending
Toggle language
Toggle theme
Back to Search
OpenNMT
/
Tokenizer
Public
Fast and customizable text tokenization library with BPE and SentencePiece support
MIT License
Updated Feb 2, 2026
Created Feb 14, 2017
View on GitHub
329
stars
79
forks
18
watchers
8
open issues
View on GitHub
Languages
Codebase composition by bytes
Top Contributors
GU
guillaumekln
550 commits
VI
vince62s
25 commits
JS
jsenellart
9 commits
JH
jhnwnd
8 commits
MO
monsieurzhang
5 commits
PA
panosk
5 commits
MI
minhthuc2502
2 commits
IN
innerNULL
2 commits
DY
DYCSystran
1 commits
KE
keichi
1 commits
Recent Stargazers
BO
boostf
almost 9 years ago
VA
valmat
almost 9 years ago
LO
loretoparisi
over 8 years ago
AB
abduld
over 8 years ago
MO
mohaps
over 8 years ago
ST
stephane88
over 8 years ago
SM
smiranda
over 8 years ago
DA
davidalbertonogueira
over 8 years ago
TO
tokestermw
almost 8 years ago
SP
Spacebody
over 7 years ago
Repository Complexity
Approximate file count and repository size.
Files
-
Size (KB)
1,813
Issues & Activity
Open vs closed issues and recent activity peak.
Open Issues 7
Closed Issues 93
Open Issues
100
Activity Peak
Most recent day with highest activity
Feb 2, 2026 • 1 events
Stars & Forks Over Time
Cumulative stars and forks based on recent history.