Pretrained models for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"
Jinrui Zhang
zjr2000
AI & ML interests
None yet
Recent Activity
updated
a model about 15 hours ago
zjr2000/SPES-9B updated
a model about 15 hours ago
zjr2000/SPES-7B updated
a model about 15 hours ago
zjr2000/SPES-2B Organizations
None yet