position: EnglishChannel  > News > Text-to-Video Generator Sora a Mixed Blessing

Text-to-Video Generator Sora a Mixed Blessing

Source:Science and Technology | 2024-02-21 15:55:30 | Author:Tang Zhexiao

OpenAI recently announced Sora artificial intelligence, which can transforms text into video of up to 1 minute. (PHOTO: VCG)

OpenAI, the creator of ChatGPT and image generator DALL-E, launched a new artificial intelligence (AI) tool that enables users to create short videos from text prompts on February 15.

Named "Sora," this AI-video tool can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions, OpenAI said.

However, the San Francisco-based startup admitted that the new tool still has some limitations, such as possibly "mixing up left and right", according to AFP.

The technology that supports Sora is an adaptation of DALL-E. It generates a video by starting off with noise and "gradually transforms it by removing the noise over many steps," the company explained. It recognizes objects and concepts listed in the written prompt and pulls them out of the noise, so to speak, until a coherent series of video frames emerge.

The impact of Sora in shaping video generation and its implications for various industries has been seen through factors like enhanced text-to-video capabilities and exploration of novel applications.

According to AFP, the French video game giant Ubisoft hailed the tool as a "quantum leap forward" with the potential to let players and development teams express their imaginations.

"For professions like marketing or creative, multimodal models could be a game changer and could create significant cost savings for film and television makers, and may contribute to the proliferation of AI-generated content rather than using actors," Reece Hayden, senior analyst at a tech intelligence company ABI Research, told CBS MoneyWatch.

Besides the praise by some AI researchers, concerns about security were also raised.

"The video generation model is spurring excitement about advancing AI technology, along with growing concerns over how artificial deepfake videos worsen misinformation and disinformation during a pivotal election year worldwide," said New Scientist.

Hany Farid, professor at the University of California, Berkeley, specializing in image analysis and digital forensics, said "text-to-video will continue to rapidly improve — moving us closer and closer to a time when it will be difficult to distinguish the fake from the real."

The new video tool is not yet publicly available. OpenAI has restricted its use to "red teamers" and some visual artists, designers and filmmakers to test the product and deliver feedback before it is released more widely.

Editor: 汤哲枭

Top News

  • ​The Mid-Autumn Festival, one of China's most cherished traditional holidays, is deeply rooted in the country's cultural heritage. Known for the rich poetry, it has inspired and customs, the stories of the festival center around the moon, which symbolizes reunion, harmony, and togetherness.

How an American Scholar Fell for China

​William N. Brown has called China home for over 30 years. "I'm fortunate to live in a country as beautiful as China, in the vibrant city of Xiamen, and at a university as remarkable as Xiamen University," the 68-year-old American professor at Xiamen University said.

'My Wish for You is Long LifeAnd a Share in This Loveliness Far Away'

The Mid-Autumn Festival, also known as the Moon Festival or Mooncake Festival, is a harvest festival celebrated in Chinese culture. Held on the 15th day of the eighth month of the Chinese lunisolar calendar, it falls on September 17 this year according to the Gregorian calendar. On this day, the Chinese believe that the moon is at its brightest and fullest, coinciding with harvest time in the middle of autumn.

抱歉,您使用的浏览器版本过低或开启了浏览器兼容模式,这会影响您正常浏览本网页

您可以进行以下操作:

1.将浏览器切换回极速模式

2.点击下面图标升级或更换您的浏览器

3.暂不升级,继续浏览

继续浏览