Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
10
16
5
Deqing Fu
PRO
deqing
Follow
oliu-io's profile picture
ghazalkhn's profile picture
leonli66's profile picture
12 followers
·
17 following
https://deqingfu.github.io
DeqingFu
DeqingFu
AI & ML interests
None yet
Recent Activity
updated
a model
20 minutes ago
deqing/llama-300M-v5-addition
updated
a model
22 minutes ago
deqing/llama-300M-v5-addition_adamw
published
a model
about 14 hours ago
deqing/llama-300M-v5-addition_adamw
View all activity
Organizations
deqing
's models
87
Sort: Recently updated
deqing/llama-300M-v5-addition
0.3B
•
Updated
20 minutes ago
•
1.66k
deqing/llama-300M-v5-addition_adamw
0.3B
•
Updated
22 minutes ago
deqing/llama-300M-v5-addition_3digit_adamw
0.3B
•
Updated
about 1 hour ago
deqing/llama-300M-v5-addition_3digit
0.3B
•
Updated
about 1 hour ago
•
465
deqing/llama-300M-v5-addition_adamw-old
0.3B
•
Updated
about 21 hours ago
•
336
deqing/llama-300M-v5-isolate
Text Generation
•
0.3B
•
Updated
about 21 hours ago
•
3.26k
deqing/llama-300M-v5-addition_3digit-old
0.3B
•
Updated
about 21 hours ago
deqing/llama-300M-v5-adamw-addition_3digit_adamw-old
0.3B
•
Updated
about 21 hours ago
deqing/llama-300M-v5-original-random_init_sft
Updated
1 day ago
•
1
deqing/llama-300M-v5-isolate_sft
Updated
1 day ago
•
1
deqing/llama-300M-v5-swap_numbers_sft
Updated
1 day ago
deqing/llama-300M-v5-addition-old
0.3B
•
Updated
2 days ago
•
1.58k
deqing/llama-300M-v5-original_sft
Updated
2 days ago
•
5
deqing/llama-300M-v5-unigram
Text Generation
•
0.3B
•
Updated
2 days ago
•
1.61k
deqing/llama-300M-v5-bigram
Text Generation
•
0.3B
•
Updated
3 days ago
•
1.61k
deqing/lstm-window-4-v5
Text Generation
•
0.2B
•
Updated
3 days ago
•
1.66k
deqing/fone-llama-3.2-1B-fineweb-sample-100BT-fone3d-hybrid-tile-v4
1B
•
Updated
3 days ago
•
392
deqing/llama-300M-v5-fivegram
Text Generation
•
0.3B
•
Updated
4 days ago
•
1.75k
deqing/llama-300M-v5-base_7
Text Generation
•
0.3B
•
Updated
5 days ago
•
2k
deqing/llama-300M-v5-swap_numbers
Text Generation
•
0.3B
•
Updated
5 days ago
•
1.47k
deqing/llama-300M-v5-window_4
Text Generation
•
0.3B
•
Updated
5 days ago
•
1.91k
deqing/llama-300M-v5-permute
Text Generation
•
0.3B
•
Updated
5 days ago
•
1.62k
deqing/llama-300M-v5-isolate-old
Text Generation
•
0.3B
•
Updated
6 days ago
•
1.95k
deqing/llama-300M-v5-original
Text Generation
•
0.3B
•
Updated
6 days ago
•
2.11k
deqing/test-fone-hub-upload
Updated
7 days ago
•
2
deqing/llama-600M-v4-isolate
Text Generation
•
0.6B
•
Updated
8 days ago
•
6.81k
deqing/llama-600M-v4-fivegram
0.6B
•
Updated
9 days ago
•
1.81k
deqing/llama-600M-v4-bigram
Text Generation
•
0.6B
•
Updated
10 days ago
•
1.97k
deqing/llama-600M-v4-unigram
0.6B
•
Updated
11 days ago
•
1.92k
deqing/mamba-370m-v4
Updated
11 days ago
Previous
1
2
3
Next