The The Perfect Taste of Sexfloodgates have opened for building AI reasoning models on the cheap.
Researchers at Stanford and the University of Washington have developed a model that performs comparably to OpenAI o1 and DeepSeek R1 models in math and coding — for less than $50 of cloud compute credits.
What's more, the model was trained on only 1,000 questions, and took just 26 minutes and 16 Nvidia H100 GPUs. Stanford researcher Niklas Muennighoff said in a email to Mashable that the cost is an estimate based on the GPU runtime and number of H100 GPUs used.
The AI industry of late is all about how new approaches to the pre and post training process can massively save computing costs, as evidenced by DeepSeek's disruptive impact. On top of that, developers are now able to build on top of existing AI models at little or no cost, through APIs, open-source access, and even closed-source models by distilling their data, bringing the costs down even more.
According to the team's research paper which was published last Friday, s1 was trained on a dataset consisting of "1,000 carefully curated questions paired with reasoning traces and answers distilled from Gemini Thinking Experimental." Google's Gemini Thinking Experimental model is accessible with daily limits through AI Studio. While it's a closed-source model, that clearly hasn't stopped researchers from making use of its responses.
SEE ALSO: OpenAI launches 'deep research' AI agent for ChatGPTNext, the researchers used an "off the shelf" pretrained model from Alibaba-owned lab, Qwen, and performed supervised fine-tuning of its curated dataset. Then, the team created a token budget to control the amount of compute time for testing the model. If s1 went over budget on thinking tokens, it was cut off and forced to generate whatever answer it came up with. If the researchers wanted the model to spend more "test-time compute" on a problem, they would simply tell the model to "wait," which extended its thinking time and led to more accurate results.
By controlling the amount of time and compute spent on a problem, the researchers were able to show how increased thinking team leads to improved performance.
S1 is one example of open-source reasoning models that have been developed for a fraction of the cost of flagship models from Google and OpenAI. In January, UC Berkeley researchers released an open-source reasoning model called Sky-T1 that cost $450, "demonstrating that it is possible to replicate high-level reasoning capabilities affordably and efficiently," per its blog post. There's also the open-source rStar-Math reasoning model from Microsoft Asia researchers, Tulu 3 from non profit research institute Ai2, and HuggingFace has its own initiative to replicate DeepSeek's R1.
As high-quality models become more accessible and cheaper, we're starting to see a power shift from the few AI heavy hitters, to the many.
Topics Artificial Intelligence OpenAI
Mark Zuckerberg and Travis Kalanick laugh it up at 'Babes and Balls' partyIt's Saturday night and 75,000 people are watching the giraffe livestreamThese IRL 'Handmaids' are creeping people outThe world's most wanted hacker is basically a nerdy supervillainCory Booker refuses to rule out running for president at SXSWJulia LouisFacebook dips its gigantic toe into the U.S. sports sceneJulia Louis'Game of Thrones' showrunners address those spinoff rumorsApple to debut new file system with iOS 10.3Video shows Tesla Model 3 prototype driving down the streetBarack and Michelle Obama enjoy a stylish lunch with Bono in NYCA MashBash made for a 'king'Strong winds blow away 4Top 9 emojis if you're, like, really into graphsMashTalk: Did WikiLeaks just do Apple and Google a huge favor?Nicki Minaj breaks her silence about Remy Ma by dropping 3 surprise songsYou can now manage your bank account with emojisWord of the Day: Cory Booker explains 'freedom' to Bert'Game of Thrones' showrunners address those spinoff rumors The Morning News Roundup for October 8, 2014 The Morning News Roundup of October 17, 2014 Letter from Casa Dracula Reunion by Sadie Stein This Week on the Paris Review Daily An Interview with David Gordon A Conversation About “Our Secret Life in the Movies” Two Stories of Madness and Civilization Tonight: Prurience! by Dan Piepenbring Eric Fischl’s “Disturbing Innocence” Nevermore by Sadie Stein A Sentence by Robert Walser Royal Quiet Deluxe The Morning News Roundup for Thursday, October 23, 2014 Sharing Economy Staff Picks: Thirteen Days, One Hundred Brothers, Five Cars by The Paris Review A Crash Course in Prewar Hilarity The Morning News Roundup for November 3, 2014 John Berryman and Yeats Light Up Italy in the Years of Lead
1.7974s , 10132.6953125 kb
Copyright © 2025 Powered by 【The Perfect Taste of Sex】,Warmth Information Network