Why I Swapped a 122B Model for a 35B One — and Got Faster
Counterintuitive move: trading a dense 122-billion-parameter model for a 35-billion-parameter mixture-of-experts, and having the site feel noticeably snappier afterward. Active parameters matter more than total parameters.
Read more →