OpenAI releases o1, its first model with ‘reasoning’ abilities

nave@lemmy.ca · edit-2 5 months ago

OpenAI releases o1, its first model with ‘reasoning’ abilities

Zos_Kia@lemmynsfw.com · 5 months ago

No the article is badly worded. Earlier models already have reasoning skills with some rudimentary CoT, but they leaned more heavily into it for this model.

My guess is they didn’t train it on the 10 trillion words corpus (which is expensive and has diminishing returns) but rather a heavily curated RLHF dataset.