Boson AI Releases Open Source 4B Audio Model Higgs Audio v3 with Streaming Emotion Control

According to monitoring by Beating, AI startup Boson AI has released the weights for its autoregressive text-to-speech (TTS) model Higgs Audio v3 TTS. The model is built on the Qwen3-4B foundation with approximately 4 billion parameters, specifically optimized for streaming interactions of real-time voice agents. It supports streaming synthesis even before the text is fully generated, reducing latency in real-time voice conversations. Higgs Audio v3 TTS supports over 100 languages and dialects, achieving a single-digit word error rate on test sets such as Seed-TTS, CV3, and MiniMax-Multilingual. The model supports zero-shot voice cloning and allows for the direct embedding of over 20 emotions and various inline control tags (including tone, speech rate, pitch, pauses, as well as effects like coughing, sighing, and laughter) in the input text for highly controllable vocal expressiveness. Boson AI has collaborated with the LMSYS team to optimize the end-to-end serving performance of Higgs Audio v3 TTS on the SGLang-Omni inference framework. Testing on an H100 GPU achieved a single concurrent real-time factor (RTF) of 0.147. The weights have been made publicly available on Hugging Face under a non-commercial research license.

Recently Searched

Hot Coins

Trending

Daily Must-Read

Welcome Back

Join CoinTime

Sign in with email

Sign up with email

Check your inbox

Boson AI Releases Open Source 4B Audio Model Higgs Audio v3 with Streaming Emotion Control

All Comments

Recommended for you

From AI Router to Agent Economy Network: How UniKey Builds the Next Generation of AI Infrastructure

More Than Nodes: What Kind of Network Is ENI Building with Its Top 100 Supernode Program?

38,244.04 DMD Permanently Burned in the Past 7 Days

Gold falls below $4000 key support: not due to risk aversion failure, interest rates regain control over global asset pricing power

BTC Falls Below $60,000

ETH Drops Below $1600

Billionaire Philippe Laffont Prefers Investing in Space Over Bitcoin

Tech Giants' Data Center Leasing Commitments Exceed $850 Billion

Address with $34.61 Million Long Position in 21,000 ETH Faces $1.696 Million Loss at 18x Leverage

U.S. 10-Year Treasury Yield Falls to 4.4138%, Lowest Since May 11

Daily Must-Read

From AI Router to Agent Economy Network: How UniKey Builds the Next Generation of AI Infrastructure

SuperStrike: As AI Takes Over Financial Decision-Making, A New Era of Wealth Creation Begins

The largest IPO in history! Panoramic analysis of SpaceX's IPO: trillion dollar valuation, track change, and Musk's trillion dollar net worth

The 90 trillion yuan track is officially legalized: the United States opens up perpetual contracts on the blockchain, and encrypted finance completely rewrites the global trading landscape

PayStill Enters the Pure PAYS Era: A Value Aggregator Embarking on a New Growth Cycle

Bitwise deep review: Cryptocurrency market faces three major structural changes, reverse layout window opens as bear market ends

Popular Activities

RaveDAO at Terra Solis by Tomorrowland: A Female-Led Techno Night Where Web3 Culture Converges

Popular Tags

Share