Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Multilingual UnigramLM
company
https://cimeister.github.io/blog/unigramlm/
Activity Feed
Follow
4
AI & ML interests
Multilingual Tokenization
Recent Activity
suchirsalhan
updated
a model
about 1 hour ago
MultilingualUnigramLM/ft-langmap-qwen2_5-1_5b-fineweb100M-tur-original
suchirsalhan
published
a model
about 1 hour ago
MultilingualUnigramLM/ft-langmap-qwen2_5-1_5b-fineweb100M-tur-original
suchirsalhan
updated
a model
about 1 hour ago
MultilingualUnigramLM/ft-langmap-qwen2_5-1_5b-fineweb100M-hun-original
View all activity
Team members
4
models
484
Sort: Recently updated
MultilingualUnigramLM/ft-langmap-gemma3-1b-fineweb100M-tur-original
Updated
2 minutes ago
MultilingualUnigramLM/ft-langmap-qwen2_5-1_5b-fineweb100M-tur-original
Updated
3 minutes ago
MultilingualUnigramLM/ft-langmap-qwen2_5-1_5b-fineweb100M-hun-original
Updated
4 minutes ago
MultilingualUnigramLM/ft-langmap-qwen2_5-1_5b-fineweb100M-fin-original
Updated
4 minutes ago
MultilingualUnigramLM/las-tokenizers-granite-3.0-8b-base-khm
Updated
12 days ago
MultilingualUnigramLM/las-tokenizers-granite-3.0-8b-base-tam
Updated
12 days ago
MultilingualUnigramLM/las-tokenizers-granite-3.0-8b-base-ben
Updated
12 days ago
MultilingualUnigramLM/las-tokenizers-phi-4-khm
Updated
12 days ago
MultilingualUnigramLM/las-tokenizers-phi-4-ben
Updated
12 days ago
MultilingualUnigramLM/las-tokenizers-granite-3.0-8b-base-tha
Updated
12 days ago
View 484 models
datasets
35
Sort: Recently updated
MultilingualUnigramLM/FineWeb2-500M-tur_Latn
Viewer
•
Updated
about 2 hours ago
•
419k
MultilingualUnigramLM/FineWeb2-500M-hun_Latn
Viewer
•
Updated
about 2 hours ago
•
312k
MultilingualUnigramLM/FineWeb2-500M-fin_Latn
Viewer
•
Updated
about 2 hours ago
•
387k
•
1
MultilingualUnigramLM/FineWeb2-khm_Khmr-100M
Viewer
•
Updated
13 days ago
•
1.01M
•
13
MultilingualUnigramLM/FineWeb2-tha_Thai-100M
Viewer
•
Updated
13 days ago
•
710k
•
13
MultilingualUnigramLM/FineWeb2-tam_Taml-100M
Viewer
•
Updated
13 days ago
•
264k
•
12
MultilingualUnigramLM/FineWeb2-arb_Arab-100M
Viewer
•
Updated
13 days ago
•
188k
•
11
MultilingualUnigramLM/FineWeb2-ben_Beng-100M
Viewer
•
Updated
13 days ago
•
254k
•
11
MultilingualUnigramLM/FineWeb2-amh_Ethi-100M
Viewer
•
Updated
13 days ago
•
197k
•
13
MultilingualUnigramLM/FineWeb2-yor_Latn-100M
Viewer
•
Updated
13 days ago
•
80k
•
11
View 35 datasets