Pre-trined models for Matmulfree LM.
Rui-Jie Zhu
ridger
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models upvoted a paper 2 days ago
Large Language Models Explore by Latent Distilling upvoted a collection about 1 month ago
Nemotron-Cascade 2