OpenAI releases o1, its first model with ‘reasoning’ abilities

nave@lemmy.ca · edit-2 5 months ago

OpenAI releases o1, its first model with ‘reasoning’ abilities

khepri@lemmy.world · 5 months ago

So they slapped some reinforcement learning on top of their LLM and are claiming that gives it “reasoning capabilities”? Or am I missing something?

Evotech@lemmy.world · 5 months ago

It’s like 3 lms on top of eachother in a trenchcoat, and appau a calculator so it gets math right

Zos_Kia@lemmynsfw.com · 5 months ago

No the article is badly worded. Earlier models already have reasoning skills with some rudimentary CoT, but they leaned more heavily into it for this model.

My guess is they didn’t train it on the 10 trillion words corpus (which is expensive and has diminishing returns) but rather a heavily curated RLHF dataset.