Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Common Crawl Foundation

Enterprise
non-profit
Verified
https://commoncrawl.org
commoncrawl
commoncrawl
Activity Feed

AI & ML interests

Crawled data and metadata

Recent Activity

pjox  authored a paper 4 days ago
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
tvaughan  updated a dataset 5 days ago
commoncrawl/statistics
malteos  updated a Space 20 days ago
commoncrawl/cc-citations
View all activity

Thom Vaughan's profile picture Pedro Ortiz Suarez's profile picture Paul Lazar's profile picture Greg Lindahl's profile picture Ford H's profile picture Jen English's profile picture Sebastian Nagel's profile picture Jason Grey's profile picture Laurie Burchell's profile picture Hande Celikkanat's profile picture malteos's profile picture Thijs Dalhuijsen's profile picture d's profile picture Luca's profile picture

commoncrawl 's Spaces 2

pinned
Running

README

🌍

Explore Common Crawl's metadata and experimental datasets

Nov 20, 2024
Running

cc-citations

📜

Scientific articles using or citing Common Crawl data

20 days ago
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs