Kling AI, a cutting-edge tool for generating imaginative images and videos using state-of-the-art generative AI techniques, held its βFrom Vision to Screenβ Model 2.0 Launch Event in Beijing on April 15, announcing an upgrade to its foundational model and officially unveiling the KLING 2.0 Video Generation Model and KOLORS 2.0 Image Generation Model globally.
Redefining human-AI interaction
Since its launch last June, Kling AI has undergone more than 20 iterations. As of now, its global user scale has surpassed 22 million. Over 15,000 developers from around the world have integrated Klingβs Application Programming Interface (API) into a wide range of industry applications.
βKling AI has consistently focused on improving the foundational capabilities of its models, enhancing image quality, and introducing innovative features to meet the diverse needs of its users,β Mr. Gai Kun, Senior Vice President of Kuaishou and Head of the Community Science Department, said at the event, adding that the tool aims to empower everyone to tell compelling stories with AI, enabling more precise and complex creative expression.
With the latest upgrade to its foundational models, the KLING 2.0 model continues to lead globally in areas such as dynamics, prompt adherence, and visual aesthetics, while the KOLORS 2.0 model has seen improvements in prompt adherence, cinematic visual quality, and representation of artistic styles.
Gai said that both models have ranked first in the industry based on the teamβs internal multi-metric comparative evaluation.
According to the latest ranking of video generation models validated by the global AI benchmark organization Artificial Analysis on March 27, Kuaishouβs Kling 1.6 Pro (High-Quality Mode) claimed the top spot in the Image-to-Video category with a benchmark Arena ELO score over 1000, followed by Googleβs Veo 2 and Pika Art, which ranked second and third, respectively.
(Caption: Mr. Gai Kun, Senior Vice President of Kuaishou Technology and Head of the Community Science Department)
Gai said that AI holds tremendous potential for supporting creative expression; however, the industry still fails to meet user demands. To truly realize the vision of βtelling great stories with AI,β it is essential to comprehensively enhance foundational model capabilities and define a new language for human-AI interaction.
In this 2.0 model iteration, Kling AI officially introduces a new interactive concept for AI video generation: Multi-modal Visual Language (MVL). This concept enables users to efficiently convey complex, multi-dimensional creative ideasβsuch as identity, appearance, style, scenes, actions, expressions, and camera movementsβdirectly to AI by integrating multi-modal information like image references and video clips.
Multi-modal features
Based on the new MVL concept, the brand-new Multi-Elements Editor and Image Editing features were unveiled by Mr. Zhang Di, Vice President of Kuaishou and Head of Kling AI, at the launch event.
For instance, based on the existing video, the Multi-Elements Editor allows users to swap, add, or delete elements from the video with text or image inputs, empowering creators with more creative freedom and flexibility in editing.
(Caption: User interface of Kling AI 2.0)
According to Zhang, image-to-video generation currently accounts for about 85 percent of Kling AIβs video creation, with image quality playing a crucial role in the video generation process. In the field of image generation models, Kuaishouβs Kolors2.0 also leads the industry.
He said that KOLORS 2.0βs text-to-image capabilities have undergone a comprehensive upgrade, including a significant improvement in prompt adherence, a substantial enhancement in cinematic aesthetic expression, and support for over 60 stylizations for image transformation. As a result, the modelβs creativity and imagination in generating images have greatly improved.
Vitality of AI content generation
In addition to subscription services for end users (to C), Kling AI also offers API integration and other services to business clients (to B). Currently, Kling AI has established partnerships with thousands of domestic and international enterprise clients, including companies such as Xiaomi, Amazon Web Services, Alibaba Cloud, Freepik, and BlueFocus.
(Caption: Kling AI’s Business Partners)
Gai said that over 15,000 developers from around the world have applied Klingβs API to various industry scenarios, with a total of approximately 12 million images generated and more than 40 million video content created.
Today, Kling AI is becoming a new infrastructure for video creation in the AI era. The rapid development of Generative AI technology is also reshaping multiple industries, including advertising and marketing, professional creation, film, and entertainment.
To further inspire the creative passion of AI enthusiasts, Zhang also launched the βKling AI NextGen Initiativeβ at the launch event. This program aims to increase support for AI filmmakers by providing millions in funding, global promotion, personal branding, and access to latest top features.
KLING AI
Jack Huang
Beijing, China