evalstate/diffusers-pr-api-data / snapshots /hf-evalstate--diffusers-pr--main
308 MB
38 files
Updated about 1 month ago
Name
Size
state
snapshots
analysis-state
analysis
reviews.parquet2.27 MB
xet
review_comments.parquet5.58 MB
xet
pull_requests.parquet4.97 MB
xet
pr_files.parquet67.7 MB
xet
pr_diffs.parquet112 MB
xet
pr-scope-clusters.json168 kB
xet
new_contributors.parquet29.6 kB
xet
new-contributors-report.md157 kB
xet
new-contributors-report.json269 kB
xet
manifest.json3.9 kB
xet
links.parquet88.7 kB
xet
issues.parquet6.27 MB
xet
events.parquet2.94 MB
xet
comments.parquet15.1 MB
xet
README.md1.91 kB
xet
README.md

Diffusers PR Dataset

Normalized snapshots of issues, pull requests, comments, reviews, and linkage data from huggingface/diffusers.

Files:

  • issues.parquet
  • pull_requests.parquet
  • comments.parquet
  • issue_comments.parquet (derived view of issue discussion comments)
  • pr_comments.parquet (derived view of pull request discussion comments)
  • reviews.parquet
  • pr_files.parquet
  • pr_diffs.parquet
  • review_comments.parquet
  • links.parquet
  • events.parquet
  • new_contributors.parquet
  • new-contributors-report.json
  • new-contributors-report.md

Use:

  • duplicate PR and issue analysis
  • triage and ranking experiments
  • eval set creation

Notes:

  • latest snapshot: 20260528T043525Z
  • raw data only; no labels or moderation decisions
  • PR metadata, file-level patch hunks, and full unified diffs are included
  • full file contents for changed files are not included
Total size
308 MB
Files
38
Last updated
May 28
Pre-warmed CDN
US EU US EU

Contributors