AI researchers develop 'reasoning' model for under $50

The o1 model was trained using reinforcement learning, which rewards the model for performing actions that help in achieving ...
Researchers developed the S1 reasoning AI using less than $50 in compute cost to achieve a reasoning model as powerful as ...
The company finally unveiled the new system in September, outing it as OpenAI’s first “reasoning” model and renaming it “o1.” Much like the two-stage release of GPT-2, where a stripped ...
The starting point of the project was Qwen2.5-32B-Instruct, an open-source LLM released by Alibaba Group Holding Ltd. last year. The researchers created s1-32B by customizing Qwen2.5-32B-Instruct ...