Domestic AI company DeepSeek released the Janus-Pro multimodal model, officially entering the field of literary and biographical graphics and achieving remarkable results. This product, based on the JanusFlow model upgrade, surpassed well-known models such as OpenAI's DALL-E3 in multiple benchmarks. Its powerful performance and MIT open source protocol give it great potential for commercial applications. DeepSeek's move marks a significant breakthrough in multimodal AI technology and injects new vitality into the development of domestic AI.
The domestic big model DeepSeek has released the new Janus-Pro multimodal big model, officially entering the field of literary and biographical graphics. This move marks a major breakthrough in DeepSeek in multimodal AI technology.
In the GenEval and DPG-Bench benchmarks, Janus-Pro-7B not only beats OpenAI's DALL-E3, but also surpasses popular models such as Stable Diffusion and Emu3-Gen. Janus-Pro adopts the MIT open source protocol, which means it can be used in commercial scenarios without limitations. DeepSeek said Janus-Pro is a premium version of the JanusFlow mockup released on November 13, 2024.

Compared with previous generation models, Janus-Pro optimized the training strategy, expanded the training data, and the model scale was larger. These improvements have allowed Janus-Pro to make significant progress in multimodal understanding and text-to-image instruction tracking capabilities, while enhancing text-to-image generation stability.

Although Janus-Pro can only process images with 384x384 resolution, it is already amazing to be able to reach such a level given its compact model size.
As a multimodal model, Janus-Pro can not only generate images, but also describe images, identify landmark attractions, identify text in images, and introduce knowledge in images.
Points:
DeepSeek releases Janus-Pro multimodal model to enter the field of literary and biographical graphics.
In benchmarks, Janus-Pro-7B performance surpasses popular models such as OpenAI's DALL-E3.
Janus-Pro uses the MIT open source protocol and can be used in commercial scenarios without restrictions.
The emergence of Janus-Pro indicates that the technological strength of domestic big models in the field of literary and biographical pictures is rapidly rising. Its open source characteristics also provide valuable resources and opportunities for domestic developers and enterprises, and future development is worth looking forward to!