Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
28
15
76
Nikita Kezins
entfane
Follow
urbas's profile picture
frascuchon's profile picture
drubdown4et's profile picture
10 followers
·
28 following
entfane
nikita-kezins
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated
a model
3 days ago
entfane/jailbreak-cot-lin-probe
published
a model
3 days ago
entfane/jailbreak-cot-lin-probe
updated
a model
3 days ago
entfane/jailbreak-input-lin-probe
View all activity
Organizations
entfane
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
3 days ago
entfane/jailbreak-cot-lin-probe
Updated
3 days ago
published
a model
3 days ago
entfane/jailbreak-cot-lin-probe
Updated
3 days ago
updated
a model
3 days ago
entfane/jailbreak-input-lin-probe
Updated
3 days ago
published
a model
3 days ago
entfane/jailbreak-input-lin-probe
Updated
3 days ago
updated
a dataset
11 days ago
entfane/jailbreaks-only
Viewer
•
Updated
11 days ago
•
666
•
68
published
a dataset
11 days ago
entfane/jailbreaks-only
Viewer
•
Updated
11 days ago
•
666
•
68
updated
a model
11 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
11 days ago
•
63
published
a model
11 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
11 days ago
•
63
updated
a dataset
27 days ago
entfane/construction_points
Viewer
•
Updated
27 days ago
•
10k
•
178
published
a dataset
27 days ago
entfane/construction_points
Viewer
•
Updated
27 days ago
•
10k
•
178
updated
a model
about 1 month ago
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
about 1 month ago
•
101
published
a model
about 1 month ago
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
about 1 month ago
•
101
updated
a dataset
about 1 month ago
entfane/violent_eval
Viewer
•
Updated
Apr 9
•
22.4k
•
12
published
a dataset
about 1 month ago
entfane/violent_eval
Viewer
•
Updated
Apr 9
•
22.4k
•
12
updated
a model
about 1 month ago
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
11
published
a model
about 1 month ago
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
11
updated
a dataset
about 1 month ago
entfane/harmful_subsets
Viewer
•
Updated
Apr 7
•
571k
•
7
published
a dataset
about 1 month ago
entfane/harmful_subsets
Viewer
•
Updated
Apr 7
•
571k
•
7
updated
a dataset
about 2 months ago
entfane/preprocessed_toxigen
Viewer
•
Updated
Apr 3
•
10.1k
•
121
published
a dataset
about 2 months ago
entfane/preprocessed_toxigen
Viewer
•
Updated
Apr 3
•
10.1k
•
121
Load more