DeepSeek is cool.
They essentially "load balanced" the model.
In simple terms instead of one bloated generalized model they've broken it up into smaller "experts" that wake up and run more efficiently based on what's needed.
We need a 🇺🇸 manhattan project for AGI.