Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Steffen Röcker's picture
12 234 747

Steffen Röcker PRO

sroecker
tegridydev's profile picture Molbap's profile picture ltim's profile picture
·
https://x.com/sroecker
  • sroecker
  • sroecker

AI & ML interests

Local models

Recent Activity

upvoted an article about 6 hours ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
liked a model about 22 hours ago
nvidia/Kimi-K2.5-Thinking-Eagle3
liked a model 1 day ago
Tesslate/OmniCoder-9B
View all activity

Organizations

Hugging Face Discord Community's profile picture

sroecker 's collections 1

RLHF
  • The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

    Paper • 2406.01462 • Published Jun 3, 2024 • 6
  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28, 2025 • 124
RLHF
  • The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

    Paper • 2406.01462 • Published Jun 3, 2024 • 6
  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28, 2025 • 124
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs