The Watch Project X Onlinefloodgates have opened for building AI reasoning models on the cheap.
Researchers at Stanford and the University of Washington have developed a model that performs comparably to OpenAI o1 and DeepSeek R1 models in math and coding — for less than $50 of cloud compute credits.
What's more, the model was trained on only 1,000 questions, and took just 26 minutes and 16 Nvidia H100 GPUs. Stanford researcher Niklas Muennighoff said in a email to Mashable that the cost is an estimate based on the GPU runtime and number of H100 GPUs used.
The AI industry of late is all about how new approaches to the pre and post training process can massively save computing costs, as evidenced by DeepSeek's disruptive impact. On top of that, developers are now able to build on top of existing AI models at little or no cost, through APIs, open-source access, and even closed-source models by distilling their data, bringing the costs down even more.
According to the team's research paper which was published last Friday, s1 was trained on a dataset consisting of "1,000 carefully curated questions paired with reasoning traces and answers distilled from Gemini Thinking Experimental." Google's Gemini Thinking Experimental model is accessible with daily limits through AI Studio. While it's a closed-source model, that clearly hasn't stopped researchers from making use of its responses.
SEE ALSO: OpenAI launches 'deep research' AI agent for ChatGPTNext, the researchers used an "off the shelf" pretrained model from Alibaba-owned lab, Qwen, and performed supervised fine-tuning of its curated dataset. Then, the team created a token budget to control the amount of compute time for testing the model. If s1 went over budget on thinking tokens, it was cut off and forced to generate whatever answer it came up with. If the researchers wanted the model to spend more "test-time compute" on a problem, they would simply tell the model to "wait," which extended its thinking time and led to more accurate results.
By controlling the amount of time and compute spent on a problem, the researchers were able to show how increased thinking team leads to improved performance.
S1 is one example of open-source reasoning models that have been developed for a fraction of the cost of flagship models from Google and OpenAI. In January, UC Berkeley researchers released an open-source reasoning model called Sky-T1 that cost $450, "demonstrating that it is possible to replicate high-level reasoning capabilities affordably and efficiently," per its blog post. There's also the open-source rStar-Math reasoning model from Microsoft Asia researchers, Tulu 3 from non profit research institute Ai2, and HuggingFace has its own initiative to replicate DeepSeek's R1.
As high-quality models become more accessible and cheaper, we're starting to see a power shift from the few AI heavy hitters, to the many.
Topics Artificial Intelligence OpenAI
Peak Halloween meme costume achieved with 'Babadook' clap back'Inferno' brings the 'Da Vinci Code' series to a new box office lowPeak Halloween meme costume achieved with 'Babadook' clap backMartha Stewart's Halloween costume is gory and gloriousOutrage as government says Australia closed for life to boat asylum seekersProof that David Pumpkins belongs in every horror movie17 Halloween costumes that definitely won't get you laidPeople are being asked to help spot real life 'witches' marks' for HalloweenRequiem for the Mac ProYour next dessert obsession is probably coming from AustraliaKylie Jenner's 'Dirrty' Christina Aguilera costume is pop culture perfectionThese connected electric bicycles are set to run on Singapore's roads by 2017Dad who dressed as Princess Peach for his daughter addresses critics in open letterA Cubs fan's World Series diary: On the edge of disasterJustin Bieber addresses his temper tantrum in an open letterJustin Bieber addresses his temper tantrum in an open letterTaipei raises rainbow flag at City Hall as thousands march in pride paradeWho's left in the ESL 'CS:GO' Pro League finals?These 'Harry Potter' Halloween costumes all got the J.K. Rowling seal of approvalTaipei raises rainbow flag at City Hall as thousands march in pride parade Samsung CEO admits Galaxy Fold launch was 'embarrassing' Facebook's crypto plan is already influencing the most powerful banks Sony's new wireless earbuds will kill the noise around you Reporter's calm stiff arm is the stoic mood we need for 2019 The very best apps of 2019 (so far) High school valedictorian comes out at graduation to wild applause 'The L Word' creator on the next generation of LGBTQ love on screen Michelle Obama calls out Barack for his not so funny dad jokes in Christmas address Early morning fire at fireworks store leads to surprise show New report sheds more light on Jony Ive's departure Facebook and YouTube to fight sensational and misleading health claims Mistletoe man rushing to the airport is like 'Love Actually' IRL Taiwan is one step closer to legalizing same 'Stranger Things' star Millie Bobby Brown's post about Hopper will make you well up Kylie Jenner and Tyga made a NSFW video to test your eyeballs It's Jony Ive's world now. We just live in it. OK, calling it: This little boy stars in 2016's most heartwarming Christmas video Anchorage roasts as heat records break across Alaska Of course Mark Zuckerberg finished his year of running Revolut launches new, effortless way to donate to charities
2.5983s , 10133.6875 kb
Copyright © 2025 Powered by 【Watch Project X Online】,Charm Information Network