Cointime

Download App
iOS & Android

Chinese Tech Giant Alibaba Unveils New AI Video Tool

The company announced the release of the model’s weights today after publishing the model’s research paper last month.

I2VGen-XL is engineered using cascaded diffusion models, the paper explains, a sophisticated AI technique that ensures the generated videos are not only visually impressive but also contextually coherent and semantically accurate. It operates on a two-stage process: the base stage focuses on maintaining coherence with the input text and images, and the refinement stage enhances the details and resolution of the video, achieving up to 1280x720 pixels.

This technique may sound similar to those used to generate images with SDXL. Unlike SD 1.5 and SD 2.1 which relied on a single model, Stability AI developed two different models, a base and a refiner, which should be combined to generate the best quality images possible.

Alibaba Cloud says the model's training utilized an extensive dataset of around 35 million text-to-video pairs and a staggering 6 billion text-to-image pairs. Such a vast dataset ensures the model's versatility and accuracy across various scenarios and subjects.

This release comes as the global tech landscape is witnessing heightened tensions and competition, particularly between the US and China. Amidst a backdrop of trade restrictions and a push for technological self-reliance, Alibaba's move is both timely and strategically significant for the country.

Alibaba's latest innovation is not an isolated development but part of a longer narrative of technological rivalry. With the US imposing restrictions on chip exports and China responding with its countermeasures, the race for AI supremacy has accelerated. This environment has spurred advancements in indigenous technologies, with both nations vying for a leading position in AI, semiconductor technology, and 5G innovation.

When contrasted with other notable advancements in the field, such as Pika Labs' model and Stable Video Diffusion, I2VGen-XL distinguishes itself through its unique approach and high semantic accuracy. A demo with several examples of using HiGen (a diffusion model) with I2VGen-XL shows a major improvement in temporal and frame consistency when compared to the use of HiGen alone.

Alibaba's I2VGen-XL model represents a significant milestone in the AI landscape because it provides an alternative to models that are either banned for Chinese users or could be restricted in the future by the US or the Chinese government.

Alibaba goes beyond just e-commerce. It has been a significant player in emerging technologies for a while, consistently pushing new developments in the realms of AI, the metaverse, software, and even digital currencies.

In AI-driven animation, besides sI2VGen-XL, Alibaba's "Animate Anyone" model stands out. This tool transforms static images into dynamic animations, employing a novel framework called ReferenceNet. Integrating sophisticated diffusion models achieves temporally stable and visually consistent videos.

Alibaba Cloud also partnered with Avalanche to launch its Cloudverse platform. This technology offers businesses a seamless pathway to create and maintain their digital universes. The strategic alliance with Avalanche and Metaverse Universal Assets DAO's involvement in middleware solutions highlights Alibaba's collaborative approach and its dedication to harnessing Web3 technologies.

Moreover, Jack Ma's insights on digital currencies point to Alibaba's keen interest in the future of global finance. Ma's advocacy for the transformative role of digital currencies in establishing a new financial system aligns with the growing global trend toward digitalization in finance. The Alibaba CEO portrayed himself as a crypto skeptic, but such a position is far from being a crypto hater, with Alibaba launching a Blockchain as a Service business in the middle of 2018’s infamous crypto winter.

https://decrypt.co/210018/alibaba-ai-text-to-video-generative-cloud

By Jose Antonio Lanz

Edited by Ryan Ozawa.

Comments

All Comments

Recommended for you

  • US Spot Ethereum ETF Sees $5.6 Million Net Outflow

    On May 15, according to monitoring data from Farside Investors, the US spot Ethereum ETF experienced a net outflow of $5.6 million yesterday.

  • Xi Jinping Holds Restricted Meeting with Trump in Zhongnanhai

    May 15 — Chinese President Xi Jinping held a restricted meeting with US President Donald Trump at Zhongnanhai. (CCTV News)

  • US Spot Bitcoin ETF Sees Net Inflow of $131.32 Million Yesterday

    On May 15, according to monitoring by Trader T, the US spot Bitcoin ETF experienced a net inflow of $131.32 million yesterday.

  • Kechuang 50 Index Declines by 2%

    On May 15, the Kechuang 50 Index experienced a decline of 2.36% during the day. Among the constituent stocks, JinkoSolar fell by 7.60%, Tianyue Advanced dropped by 7.11%, Canadian Solar decreased by 5.54%, and Zhongke Feiyun fell by 5.64%. (Dongxin News Agency)

  • Nikkei 225 Index Falls Below 62,000 Points for the First Time Since May 7

    On May 15, the Nikkei 225 index fell below 62,000 points during trading hours, marking the first time it has done so since May 7. (Tokyo News Agency)

  • U.S. 30-Year Treasury Yield Rises to 5.056%, Reaching 10-Month High

    On May 15, the yield on U.S. 30-year Treasury bonds rose to 5.056%, marking a 10-month high, while the yield on 10-year Treasury bonds reached 4.512%. (Dongxin News Agency)

  • Japan's 10-Year Government Bond Yield Reaches Highest Level in Nearly 29 Years

    On May 15, according to CCTV, the yield on newly issued 10-year government bonds, which serves as a long-term interest rate indicator in Japan's domestic bond market, rose to 2.665%, reaching its highest level in nearly 29 years. This increase is attributed to inflationary pressures from rising oil prices and market concerns about the deterioration of fiscal policy due to Japan's domestic economic measures, leading to selling pressure on bonds. (Dongxin News Agency)

  • ETH Surpasses $2300

    Market data shows that ETH has surpassed $2300, currently priced at $2300.06, with a 24-hour increase of 1.42%. The market is experiencing significant volatility, so please ensure proper risk management.

  • ETH Surpasses $2300

    Market data shows that ETH has surpassed $2300, currently priced at $2300.02, with a 24-hour increase of 1.97%. The market is highly volatile, so please ensure proper risk management.

  • Trump's Securities Trading Records Exposed, Invests in Nvidia and Apple

    On May 15, the U.S. Office of Government Ethics released two new financial disclosure documents on Thursday, revealing that Trump disclosed large-scale financial transactions worth at least $220 million earlier this year, involving securities from several major U.S. companies. The newly disclosed documents cover the first three months of 2026, with transaction values ranging broadly from $220 million to approximately $750 million. Significant purchases valued between $1 million and $5 million include S&P 500 index funds, Nvidia, and Apple. Large sales valued between $5 million and $25 million include Microsoft, Amazon, and Meta. The documents do not consistently specify the exact types of securities involved, such as whether they are stocks or corporate bonds, nor do they indicate which accounts the transactions occurred in or who authorized the trades. Such disclosure documents are mandatory but only partially reflect officials' financial activities, as they only list transactions exceeding $1,000 and present them in broad value ranges without disclosing specific transaction prices, profit situations, or whether assets were directly purchased or held through managed accounts. Trump's assets are held in a trust controlled by his children, and some transactions in the new documents indicate the involvement of brokers as agents. (NBC)