OpenAI Discovers New Method to Halve Inference Costs

According to a source familiar with the discussions, there is previously undisclosed news: earlier this month, OpenAI engineers informed some colleagues that, relying on several newly developed optimization technologies, they have found a solution that can reduce model inference costs by more than half. After applying this new technology to scenarios where free/paid account visitors use ChatGPT, the number of required Nvidia graphics processing units (GPUs) was reduced to just a few hundred — a remarkably low figure. It is currently unclear what specific technical means OpenAI used to achieve this significant improvement in computational efficiency. Common optimization methods in the industry generally include: quantization compression, key-value caching, batch processing of user queries instead of computing them individually, and redirecting some requests to lower-power lightweight models or model shards for responses.

Comments

All Comments

Recommended for you

Cryptocurrency Industry Spends $189 Million in 2026 U.S. Midterm Elections

As of June 30, the cryptocurrency industry has become the largest political donor among U.S. businesses. Data shows that political spending by crypto companies for the 2026 U.S. midterm elections has reached $189 million, surpassing the total expenditure for the 2024 election cycle. Reports indicate that following progress in stablecoin regulatory legislation, the crypto industry is further increasing its political investments to promote more legislation related to digital assets. Additionally, political donations from industries such as artificial intelligence, technology, and online gambling have also seen significant growth compared to previous periods.
Micron Technology Invests $250 Million in 'Trump Account'

On June 30, Micron Technology (MU.O) announced a $250 million investment in the 'Trump Account', which will cover 1 million people. The 'Trump Account' program aims to provide eligible children with a one-time seed funding of $250. As part of this initiative, the company will introduce an employee matching benefit, offering up to $1,000 in matching funds for contributions to accounts for each child under 18.
Multiple Financial Giants Plan to Launch Stablecoin OUSD

On June 30, dozens of financial institutions, including Visa, Stripe, Mastercard, BlackRock, and Coinbase, are preparing to launch a new stablecoin called OUSD, aimed at building an on-chain dollar infrastructure for institutional payments and settlements. According to reports, OUSD will operate under a consortium model, with participating institutions sharing the reserve earnings and related revenue generated by the stablecoin. This indicates a shift in the stablecoin business model from being dominated by a single issuer to a revenue-sharing system involving payments, asset management, and crypto platforms, potentially accelerating the integration of traditional finance with on-chain payments.
Bank of America: Data Center Demand Still Underestimated

On June 30, analysts at Bank of America stated in a research report that the outlook for the capital goods sector appears increasingly optimistic, with demand from data centers still underestimated among major industrial companies. These companies include Schneider Electric, ABB, Siemens, and Siemens Energy. Analysts noted that structural growth in infrastructure related to artificial intelligence will significantly expand the potential market size in the coming years. Stronger investments in power generation are leading indicators of future orders for electrical equipment, which should support continued growth in the grid and electrification businesses. The most attractive opportunities are expected to come from high-value areas such as power conversion, grid equipment, and cooling systems.
Becerra Urges Gas Retailers to Lower Prices for Independence Day

On June 30, U.S. Treasury Secretary Becerra urged gas retailers to lower prices in alignment with the celebrations for the 250th anniversary of the founding of the United States this month, warning that the Trump administration is closely monitoring the situation. "I call on all gas retailers—whether they are large oil company affiliates, independently operated, or part of international convenience store chains—to demonstrate good corporate behavior," Becerra stated, "especially at this significant moment of the 250th anniversary, as we are closely watching."
U.S. Stock Index Futures Turn Lower

On June 30, Dow Jones futures fell by 0.11%, S&P 500 futures declined by 0.07%, and Nasdaq 100 futures decreased by 0.05%.
S&P 500 Set to Achieve Best Quarterly Close in Six Years

On June 30, U.S. stock index futures rose slightly, with the S&P 500 index poised to record its best quarterly close in six years.
BTC Falls Below $59,000

Market data shows that BTC has fallen below $59,000, currently priced at $58,981.23, with a 24-hour decline of 2.77%. The market is experiencing significant volatility, so please ensure proper risk management.
U.S. and Brent Crude Oil Prices Rise Over 1%

On June 30, Brent crude oil rose over 1% during the day, currently priced at $74.42 per barrel. WTI crude oil reached $71 per barrel, increasing by 1.07% during the day.
Iran to Receive $3 Billion in Frozen Assets This Week

According to sources cited by Saudi Arabia's Al Arabiya television, Iran and a U.S. delegation will hold indirect negotiations in Qatar tomorrow with the involvement of mediators. These indirect talks will focus on issues concerning the Strait of Hormuz and overall regional stability. Sources indicate that Iran will receive $3 billion in frozen assets by the end of this week. Delegations from both sides are expected to meet separately today with the Prime Minister of Qatar and Pakistani mediators in Doha.

Daily Must-Read

Popular Activities

RaveDAO at Terra Solis by Tomorrowland: A Female-Led Techno Night Where Web3 Culture Converges

April 30 - April 30

Dubai

Recently Searched

Hot Coins

Trending

Daily Must-Read

Welcome Back

Join CoinTime

Sign in with email

Sign up with email

Check your inbox

OpenAI Discovers New Method to Halve Inference Costs

All Comments

Recommended for you

Cryptocurrency Industry Spends $189 Million in 2026 U.S. Midterm Elections

Micron Technology Invests $250 Million in 'Trump Account'

Multiple Financial Giants Plan to Launch Stablecoin OUSD

Bank of America: Data Center Demand Still Underestimated

Becerra Urges Gas Retailers to Lower Prices for Independence Day

U.S. Stock Index Futures Turn Lower

S&P 500 Set to Achieve Best Quarterly Close in Six Years

BTC Falls Below $59,000

U.S. and Brent Crude Oil Prices Rise Over 1%

Iran to Receive $3 Billion in Frozen Assets This Week

Daily Must-Read

From AI Router to Agent Economy Network: How UniKey Builds the Next Generation of AI Infrastructure

SuperStrike: As AI Takes Over Financial Decision-Making, A New Era of Wealth Creation Begins

The largest IPO in history! Panoramic analysis of SpaceX's IPO: trillion dollar valuation, track change, and Musk's trillion dollar net worth

The 90 trillion yuan track is officially legalized: the United States opens up perpetual contracts on the blockchain, and encrypted finance completely rewrites the global trading landscape

PayStill Enters the Pure PAYS Era: A Value Aggregator Embarking on a New Growth Cycle

Bitwise deep review: Cryptocurrency market faces three major structural changes, reverse layout window opens as bear market ends

Popular Activities

RaveDAO at Terra Solis by Tomorrowland: A Female-Led Techno Night Where Web3 Culture Converges

Popular Tags

Share