Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4281.6
TFLOPS
526
7
219
Michael
PRO
michaelfeil
Follow
muhtasham's profile picture
Han1127's profile picture
cc66pig's profile picture
45 followers
·
16 following
https://michaelfeil.eu
michaelfeil
AI & ML interests
ML Inference
Recent Activity
new
activity
6 days ago
baseten-admin/bert-base-ner-uncased:
Create modules.json
new
activity
7 days ago
voyageai/voyage-4-nano:
Alt modeling code
new
activity
7 days ago
gradientai/Llama-3-8B-Instruct-Gradient-1048k:
Update Readme link
View all activity
Organizations
michaelfeil
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
baseten-admin/bert-base-ner-uncased
6 days ago
Create modules.json
#1 opened 6 days ago by
michaelfeil
New activity in
voyageai/voyage-4-nano
7 days ago
Alt modeling code
5
#5 opened 23 days ago by
michaelfeil
New activity in
gradientai/Llama-3-8B-Instruct-Gradient-1048k
7 days ago
Update Readme link
#30 opened 7 days ago by
michaelfeil
liked
a model
16 days ago
arcee-ai/Trinity-Large-Preview
Text Generation
•
Updated
9 days ago
•
1.75k
•
142
New activity in
nvidia/llama-embed-nemotron-8b
19 days ago
Upstream transformers support with `use_bidirectional_attention`
4
#13 opened 23 days ago by
michaelfeil
New activity in
voyageai/voyage-4-nano
23 days ago
Add a config hint for use_linear_output_projection
#6 opened 23 days ago by
michaelfeil
NIT Update config.json
1
#4 opened 24 days ago by
michaelfeil
New activity in
voyageai/voyage-4-nano
24 days ago
framework support via use_bidirectional_attention - Cheers from your friends at Baseten
1
#3 opened 24 days ago by
michaelfeil
updated
a model
25 days ago
baseten/embedding-smol_llama-101M-GQA
76.6M
•
Updated
25 days ago
•
110
New activity in
nvidia/llama-nemotron-embed-1b-v2
26 days ago
"use_bidirectional_attention": true flag
1
#13 opened 26 days ago by
michaelfeil
updated
a model
27 days ago
baseten/qwen3-engine-30A3-repro
Updated
27 days ago
•
19
published
a model
27 days ago
baseten/qwen3-engine-30A3-repro
Updated
27 days ago
•
19
New activity in
Qwen/Qwen3-30B-A3B-Instruct-2507
28 days ago
dummy config.json
#26 opened 28 days ago by
michaelfeil
liked
a model
about 2 months ago
zai-org/GLM-4.7
Text Generation
•
Updated
18 days ago
•
124k
•
•
1.92k
liked
2 models
2 months ago
arcee-ai/Trinity-Mini
Text Generation
•
Updated
Dec 11, 2025
•
7.37k
•
•
178
arcee-ai/Trinity-Nano-Preview
Text Generation
•
6B
•
Updated
Dec 1, 2025
•
20k
•
63
New activity in
jinaai/jina-code-embeddings-0.5b
3 months ago
missing tokenizer
#3 opened 3 months ago by
michaelfeil
New activity in
baseten/Llama-3.2-3B-Instruct-pythonic
4 months ago
Update chat_template.jinja
#1 opened 4 months ago by
baseten-admin
New activity in
Snowflake/snowflake-arctic-embed-l-v2.0
5 months ago
Set CTX Length to 2048
2
#19 opened 5 months ago by
michaelfeil
Set CTX Length to 2048
2
#19 opened 5 months ago by
michaelfeil
Load more