Videos can also be “Photoshopped”! Google DeepMind releases a world-defying AI model with movie-level special effects that’s easy to get! - AI articles

Author：Eve Cole Update Time：2025-01-24 17:32:01

Google's DeepMind team released an AI model called "Generative Omnimatte", which can break down videos into multiple layers like a skilled editor, accurately separate people, objects and backgrounds, and even "brain" "Fill in" the blocked parts to achieve various cool special effects. This technology breaks through the limitations of traditional video matting technology and can easily complete complex video editing tasks without the need for a green screen or depth information. Say goodbye to tedious operations, make video editing simple and easy to use, and everyone can become a video editing master!

Do you still remember those cool special effects in movies? Are objects disappearing out of thin air and scenes changing instantly? Are you hooked? Now, the Google DeepMind team has developed an AI model called "Generative Omnimatte" to make these special effects possible. It’s no longer just for movies! This AI is like a skilled editor, which can break down the video into multiple layers, each layer containing a complete object and the shadows, reflections and other effects it produces.

Traditional video matting technology usually relies on green screen shooting or precise depth information, which is very complex to operate. This AI model is completely free from these limitations. It does not require any additional information and can perfectly separate the characters, objects, and backgrounds in the video, and can even "brain-fill" the occluded parts. The effect is amazing. !

The core of this AI model is a video removal model called "Casper". It is like a magical eraser that can accurately erase any object you specify in the video, and its shadows and reflections will disappear, while the background will remain intact.

More importantly, it can also recombine objects and backgrounds according to user needs to achieve various creative effects, such as "teleporting" characters from one scene to another, or changing the movement speed of objects, or even making them Turn back time!

With this artifact, it will be so easy to edit videos in the future. You can add whatever special effects you want. You don’t have to worry about technical problems at all. Everyone can become an editing master! For example, you want to "teleport" a friend from home to the beach. , you only need to use Casper to cut out your friends and put them on the seaside background. Isn’t it very simple? You can even let your friends walk backwards in the video, or copy them into several friends and dance together, as you like. It’s interesting to think about it!

Of course, Generative Omnimatte is still in the development stage, and there are still some minor bugs that need to be resolved. For example, if there are multiple very similar objects in the video, the AI may not be able to tell who is who and confuse them. In addition, if the object deforms, such as a bent pole, the AI will not know how to deal with it. However, I believe that the Google DeepMind team will soon solve these problems and make Generative Omnimatte even more perfect!

Project address: https://gen-omnimatte.github.io/

Paper address: https://arxiv.org/pdf/2411.16683

Generative Omnimatte has brought revolutionary changes to video editing, and it will bring us more surprising applications and special effects in the future, let's wait and see!