I was thinking about this the other day. If we did a plot of 'model ability' vs ...

chasd00 · 2026-01-14T21:51:36 1768427496

i don't think adding more hardware does anything except increase performance scaling. I think most improvement gains are made through specialized training (RL) after the base training is done. I suppose more GPU RAM means a larger model is feasible, so in that case more hardware could mean a better model. I get the feeling all the datacenters being proposed are there to either serve the API or create and train various specialized models from a base general one.

ryoshu · 2026-01-14T21:07:49 1768424869

I think the harnesses are responsible for a lot of recent gains.

NitpickLawyer · 2026-01-14T21:11:27 1768425087

Not really. A 100 loc "harness" that is basically a llm in a loop with just a "bash" tool is way better today than the best agentic harness of last year.

Check out mini-swe-agent.

SOLAR_FIELDS · 2026-01-15T01:04:13 1768439053

Everyone is currently discovering independently that “Ralph Wigguming” is a thing