Alibaba launches video repainting tool Diffutoon to transform live-action videos into anime style in seconds

Author：Eve Cole Update Time：2025-02-24 22:00:02

Diffutoon, an AI tool jointly launched by Alibaba and East China Normal University, allows videos to be transformed into animation style with one click, completely changing the video production process. It uses advanced AI image generation technology - diffusion model to convert real videos into various animation styles, keeping the picture smooth and natural, rich in details, and supporting high-resolution output and intelligent editing. Whether you are a short video creator or a professional, Diffutoon can provide powerful creative assistance, making video production easier and more fun than ever before.

Imagine holding a mobile phone in your hand, recording a video at will, and then with one click, the video magically turns into an anime-style picture. This is not a dream, this is a miracle brought by Diffutoon.

Diffutoon, an AI tool jointly developed by Alibaba and East China Normal University, is making video production easier and more fun than ever with its unique charm.

Video from official project page

Main feature highlights:

Animation style conversion: Convert realistic videos into various animation styles, allowing seamless connection between reality and the two-dimensional world.

Content consistency: Maintain the consistency of video content, avoid flickering and distortion, and make every frame smooth and natural.

High-resolution output: Supports the generation of high-resolution, long-term videos to meet professional production needs.

Smart editing: Edit video content based on user text prompts, such as changing colors or adding special effects.

Detail preservation: Finely preserve lighting, hair, posture and other details without losing the animation-style visual experience.

Low-resolution optimization: Even if the input video resolution is low, high-quality anime-style videos can be output.

The core of Diffutoon is the advanced AI image generation technology - diffusion model. This model generates new images and videos by learning from large amounts of image data. Diffutoon is optimized on this basis, not only processing single pictures, but also cartoonizing videos.

Diffutoon first analyzes each frame in the video to capture key information such as character movements and postures, and then converts the picture style into cartoon animation while retaining this information. The whole process is like magic, making the video look like animation while maintaining the original movement and structure.

Using efficient technologies such as "Flash Attention", Diffutoon greatly improves the processing speed, and even high-definition videos shot by mobile phones can be quickly converted into animation effects.

Diffutoon has a wide range of application scenarios, ranging from animation production, game development to advertising material production. For short video creators, it is a powerful creative tool that makes content creation more diverse.

Although it still needs to be perfected when dealing with complex scenes or fast-moving images, Diffutoon's potential cannot be underestimated. With the continuous advancement of technology, we have reason to believe that Diffutoon will bring revolutionary changes to video creation.

Project page: https://top.aibase.com/tool/diffutoon

Code: https://github.com/modelscope/DiffSynth-Studio

Paper: https://arxiv.org/abs/2401.16224

Diffutoon brings new possibilities to the field of video production with its convenient operation and powerful functions. Although there are still some areas for improvement, its future development potential is huge, and it is worth looking forward to more breakthroughs in the field of video creation in the future. Visit the project page and repository to learn more.