his kvetching about open source models
GLM 5.2 is BTFO Sonnet in coding, close to Opus (sometimes even beating it, for website making, GLM is just as good in my opinion). Only edge they have is their Fable/Mythos model that is just a computational brute-force model (+2TB parameters probably).
I just started using GLM-5.2 via openrouter. I gave a spec to Fable and ran it on Claudecode, saved it to a branch. Switched back to the previous state and ran the same spec via GLM-5 using Opencode.
The UI that Fable produced was way better. Looked beautiful. The actual code implementation was garbage compared to GLM. GLM separated out all the classes, made a better hierarchy, better database schema; just overall cleaner. I then copied just the UI pieces to a temp directory and told Opencode/GLM to integrate that html/css/assets and now I have something fairly solid.
The nice thing about GLM is that you
can run it locally. You need somewhere between 256GB and 800GB of VRAM depending on the quant size; so a minimum of $8k~$10k in compute depending on if you buy a set of AMD R9700s, RTX 6000s or connect up 3 x Strix devices. It's completely impractical today. No group of friends are going to go in togeher for $2k/each to buy gear that they can only load one model on at a time, and depent on one person to maintain the uptime. So it's not practical, but using an open weight model is hedging against the possibility of being able to purchase local hardware in the future.
Also, Opus/Fable is probably only better at UI because they used all the training data from Figma to create Claude UI, tanking Figma stocks. Truly evil levels of Microsoft shit.
Sakana AI (yes, the Japs that are 20 years behind in computers)
They created Ruby. .. Sony still makes good stuff. Nobody remembers Panasonic because they only make pro-grade stuff an no consumer products. ... Yea it's sad to see how far Japan has fallen behind in software engineering.
To be fair to Anthropic they "tried" to stop it with ID verification etc.,
I think July 7th will be a footgun moment. Sure, over half of the subscribers will just scan their ID and face, but I think a non-trivial amount (hopefully 15% ~ 20%) will just cancel. I will be one of them. Between both home and work on a Pro plan I get reimbured for, I use ~2m tokens a month or ~200k/week. There are a ton of routers (openrouter, portkey, etc.) and a lot of them allow basic shit like using a password instead of a fucking e-mail magic-link, purchase without saving your credit card, details pricing reports, and other features that are not available from Anthropic's 100% dark pattern of a WebUI. I know personally, I don't get anywhere near maxing out my Pro plan, and it just seems cheaper and way less of a hastle to use an API gateway even with their overhead costs.