I made a code sniping agent to detect when new AI papers with code (and weights) are released, and then automatically create a Gradio demo on Hugging Face π§
I call this agent CheatCode (https://github.com/jbilcke-hf/CheatCode) because it skips so many steps that it kinda feels like breaking the rules of the AI tech release game π
- Currently the demos are all generated in one go and not built or tested by the agent itself. A more robust version should loop over the deployed app to fix build/runtime issues. - There is still a bit of human curation done to avoid making demos for things that canβt really be demonstrated on ZeroGPU (eg. tasks taking several minutes) - Some papers can actually be showcased in a variety of ways, which isnβt really supported (see Demo 2)
Hugging Face dropped SmolLM π€ > Beats MobileLLM, Qwen 0.5B, Phi 1.5B and more! > 135M, 360M, and 1.7B param model checkpoints > Trained on 600B high-quality synthetic + FineWeb Edu tokens > Architecture: Llama + GQA + 2048 ctx length > Ripe for fine-tuning and on-device deployments. > Works out of the box with Transformers!
Mistral released Mathstral 7B β > 56.6% on MATH and 63.47% on MMLU > Same architecture as Mistral 7B > Works out of the box with Transformers & llama.cpp > Released under Apache 2.0 license