Tik Tok parent company Byte Dance releases new open source Seed-OSS-36B model with 512K token context

Tik Tok parent company Byte Dance releases new open source Seed-OSS-36B model with 512K token context

In a significant move set to shake up the AI landscape, ByteDance, the parent company of TikTok, has announced the release of its latest open-source AI model, Seed-OSS-36B, on the popular code-sharing platform, Hugging Face. This innovative model aims to enhance advanced reasoning capabilities and is tailored for developers needing extensive usability. What sets Seed-OSS-36B apart is its remarkable ability to handle a context of up to 512,000 tokens, allowing it to process and generate responses based on a vast amount of information at once. This feature significantly surpasses many of its competitors, including leading models from OpenAI and Anthropic. The Seed Team, which was established in 2023, has introduced three distinct versions of this model: the synthetic and non-synthetic Base models and the Instruct variant. The synthetic model is trained with added instruction data to achieve superior performance on various benchmarks, making it an excellent general-purpose tool. In contrast, the non-synthetic version aims to provide a more unbiased foundation, free from the influences of synthetic data. The Instruct model, on the other hand, is specifically designed for task execution and following instructions, making it ideal for applications that require precise operation. All three models are made available under the Apache-2.0 license, which allows users to modify and redistribute them freely. This approach supports commercial applications without incurring any licensing fees from ByteDance, enhancing accessibility for businesses and developers. In a landscape where Chinese tech companies are increasingly releasing powerful open-source models, Seed-OSS-36B is positioned for global application, emphasizing versatility in reasoning tasks and multilingual capabilities. Its architecture features 36 billion parameters across 64 layers and integrates advanced technologies like causal language modeling and grouped query attention. A standout feature of Seed-OSS-36B is its unique 'thinking budget' capability, which lets developers dictate the amount of reasoning the model undertakes before providing answers. This functionality can optimize performance based on task complexity, allowing users to fine-tune their experience effectively. Early benchmarks indicate that Seed-OSS-36B ranks among the top open-source models available today. The Instruct variant has particularly excelled in various categories, while the non-synthetic Base model also holds its ground competitively. This positions the Seed-OSS series as a strong contender for enterprises focused on math-heavy, coding, and long-context applications. To further ease the integration process, the Seed Team has provided comprehensive support for deployment, including quantization options and configuration examples for scalable serving. This is especially beneficial for technical leaders managing limited resources, making experimentation with high-parameter models more feasible. Overall, the launch of Seed-OSS-36B by ByteDance's Seed Team not only offers high performance but also flexibility in deployment, providing a valuable resource for researchers and developers navigating the evolving open-source AI landscape.

Sources : VentureBeat

Published On : Aug 20, 2025, 23:20

Startups
Anduril Eyes Massive $60 Billion Valuation in Latest Funding Bid

Palmer Luckey’s defense technology firm, Anduril, is currently engaged in a substantial funding round, targeting a valua...

TechCrunch | Mar 03, 2026, 20:10
Anduril Eyes Massive $60 Billion Valuation in Latest Funding Bid
Cybersecurity
Government Hacking Tools Leak: Cybercriminals Exploit iPhone Vulnerabilities

Security experts have uncovered a set of sophisticated hacking tools designed to breach older iPhone software, which hav...

TechCrunch | Mar 04, 2026, 24:00
Government Hacking Tools Leak: Cybercriminals Exploit iPhone Vulnerabilities
AI
Leadership Shakeup at Alibaba's Qwen AI Project Amid Intensifying Competition

In a surprising turn of events, Alibaba's Qwen AI initiative has lost a key technical figure, Junyang Lin, just one day ...

TechCrunch | Mar 03, 2026, 23:35
Leadership Shakeup at Alibaba's Qwen AI Project Amid Intensifying Competition
Startups
Palantir Seeks Reunion with Former Employees, Invoking Epic Fantasy Themes

In a bold move, Palantir has reached out to its former employees with an enticing invitation reminiscent of epic tales. ...

Business Insider | Mar 03, 2026, 21:20
Palantir Seeks Reunion with Former Employees, Invoking Epic Fantasy Themes
AI
AI Industry Targets Congressional Candidate with Heavy Ad Spending

Recent advertisements attacking New York Assembly member Alex Bores highlight his previous affiliation with Palantir, a ...

TechCrunch | Mar 03, 2026, 22:05
AI Industry Targets Congressional Candidate with Heavy Ad Spending
View All News