Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sumuk Shashidhar's picture
10 9 17

Sumuk Shashidhar PRO

sumuks
adamm-hf's profile picture OzzyGT's profile picture Aurelien-Morgan's profile picture
·
https://sumuk.org
  • sumukx
  • sumukshashidhar
  • sumuks

AI & ML interests

Evaluations, Reasoning, Long Term Planning

Recent Activity

updated a dataset 4 days ago
sumuks/openai-coval-dpo
published a dataset 4 days ago
sumuks/openai-coval-dpo
updated a dataset 21 days ago
sumuks/preference-atlas-rewards
View all activity

Organizations

Blog-explorers's profile picture Verifiers For Code's profile picture Preference Agents's profile picture Sumuk's Archived Content's profile picture UIUC Conversational AI Lab's profile picture self-planner's profile picture Nerdy Face's profile picture Sumuk's Testing Grounds!'s profile picture Spiral Works's profile picture Your Bench's profile picture Sumuk's Second Set of Archived Content's profile picture InfoHunt's profile picture TextCleanLM's profile picture Sumuk's First Archival Storage Volume's profile picture popper's profile picture Sumuk's Archival Storage 2's profile picture Sumuk's Archival Storage 3's profile picture

Articles 1

Article
4

Getting Started with YourBench

Papers 5

arxiv:2505.01592
arxiv:2504.20090
arxiv:2504.01833
arxiv:2410.03731

models 0

None public yet

datasets 29

sumuks/openai-coval-dpo

Viewer • Updated 4 days ago • 5.58k • 76

sumuks/preference-atlas-rewards

Viewer • Updated 21 days ago • 5.03k • 33

sumuks/preference-atlas

Viewer • Updated 21 days ago • 329k • 103 • 1

sumuks/reward-bench-2

Viewer • Updated 21 days ago • 1.87k • 48

sumuks/helpsteer3

Viewer • Updated 22 days ago • 49.1k • 251

sumuks/helpsteer3-easy

Viewer • Updated 29 days ago • 7.93k • 51

sumuks/helpsteer-pairwise-grading

Viewer • Updated Feb 12 • 51.8k • 22

sumuks/rupo-eval-logs-helpsteer3-1

Viewer • Updated Feb 10 • 1.43k • 48

sumuks/helpsteer3-rupo

Viewer • Updated Feb 10 • 38.2k • 56

sumuks/persuasiveness_detection

Viewer • Updated Feb 6 • 3.94k • 10
View 29 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs