Thivakar Jeyaseelan
Thiv
ยท
AI & ML interests
None yet
Recent Activity
liked a model about 1 hour ago
openpangu/openPangu-2.0-Flash reacted to Banaxi-Tech's post with ๐ about 1 hour ago
A new model is coming!
Its going to take a long time on my 5070 Ti so expect a release in ~1 month.
We think this model is going to be SOTA For its size.
Our Mini Version will be 25M Parameters and Pro with 140M.
The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE)
Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base.
The training will start this weekend
We are very exited to release it when its done! liked a model 2 days ago
Jackrong/Qwopus3.6-35B-A3B-Coder-MTP-GGUFOrganizations
None yet