427B params? This is not intelligence, its brute force.

#6
by Nerdsking - opened

Why not 1T already? 2T? It is DOUBLE the size of the previous model. People from Stepfun delivered a REAL sucessor with Step 3.7. Same size, better model.
Gitgud boys.

I like how he literally swears because FREE model delivered to him for FREE does not satisfy his 10 years old lowvram hardware.

427 BILLIONS parameters? And that's "MY" fault? Well, incompetence allways finds justification (and the typical brainless user simps to support it...). Specially when Qwen 3.6 with mere 27b is able to do almost as good or better in many aspects... What is clear to me is that it is a purpose action to make the "local" model less local as possible, to force users to BUY the online version. The bad news is that there is COMPETITION. So no, I will not be expending more and more to acomodate the incompetence of others, or be forced by marketing strategy, I will simply shift to a better model, able to the same or better costing less. And I already did, now I am using Step3.7, a model that REALLY evolved from the 3.5 last version.

427 BILLIONS parameters? And that's "MY" fault? Well, incompetence allways finds justification (and the typical brainless user simps to support it...). Specially when Qwen 3.6 with mere 27b is able to do almost as good or better in many aspects...

ig for coding it might perform well, but other stuff bruh, no, even gemma much is smarter in terms of knowledge..

What is clear to me is that it is a purpose action to make the "local" model less local as possible, to force users to BUY the online version.

Why not a local server?

The bad news is that there is COMPETITION. So no, I will not be expending more and more to acomodate the incompetence of others, or be forced by marketing strategy, I will simply shift to a better model, able to the same or better costing less. And I already did, now I am using Step3.7, a model that REALLY evolved from the 3.5 last version.

wdym, are you a local service customer or something I don't exactly understand. you expect a model you can indeed run locally to perform on consumer hardware, which barely fit these data outside of benchmarks into that small params count....

Sign up or log in to comment