Home
News
Tech Grid
Interviews
Anecdotes
Think Stack
Press Releases
Articles
  • Generative AI

Vidu Q3 & Q2 Pro Upgrades: Native Audio-Video AI Leadership


Vidu Q3 & Q2 Pro Upgrades: Native Audio-Video AI Leadership
  • by: Source Logo
  • |
  • February 2, 2026

ShengShu Technology has unveiled significant upgrades to its AI video platform Vidu during its Global Creativity Week, advancing the technology from experimental use toward large-scale, production-ready applications. The releases center on Vidu Q3 - the industry's first long-form AI video model with native synchronized audio and video generation - and Vidu Q2 Reference-to-Video Pro, which introduces flexible reference-driven workflows for greater control and consistency.

Quick Intel

  • Vidu Q3 ranks No.1 in China and No.2 globally per Artificial Analysis benchmarks, delivering up to 16 seconds of native HD audio-video output with multilingual voice, precise lip sync, cinematic controls, and seamless transitions.
  • Vidu Q2 Reference-to-Video Pro enables the world's first "anything-as-reference" model, supporting multiple video and image references for iterative editing without full regeneration.
  • Reference Hub 2.0 launches as the first AI video subject-based creation community, offering over 200 curated subjects across categories like camera movement, narrative structure, and character performance.
  • Upgraded Vidu Agent 1.0 provides one-click video creation with intelligent planning, custom music uploads, narration removal, and storyboard editing.
  • Since April 2024 launch, Vidu has reached users in over 200 countries, generated more than 500 million videos, and serves 40 million creators plus 10,000+ developers and enterprises.
  • Commercial projects now represent over 70% of output, with integrations in platforms like PhotoGrid, Pollo AI, and brands such as Odin for virtual try-on experiences.

The upgrades address a key industry challenge: translating frontier AI capabilities into measurable productivity for professional creators and enterprises. Vidu Q3 integrates audio and visuals at the model level, eliminating post-production synchronization issues and supporting narrative formats like animation, short dramas, and films with native 1080p rendering and text-in-visual composition.

Vidu Q2 Reference-to-Video Pro shifts from one-shot generation to structured, reference-based creation. Users can combine inputs across people, scenes, actions, expressions, effects, and textures—then modify, add, or remove elements iteratively. This approach enhances consistency, speeds revisions, and suits collaborative production environments where precision and efficiency are essential.

Reference Hub introduces a community-driven ecosystem for reusable, high-quality visual elements. Creators select from standardized subjects to build professional-grade videos more reliably, lowering barriers while enabling scalable asset reuse across projects.

The enhanced Vidu Agent streamlines workflows with automated planning and real-time generation, incorporating features like custom background music and storyboard-based controls to reduce manual coordination.

"Advanced AI only matters when it creates measurable productivity," said Yihang Luo, CEO of ShengShu Technology. "Our focus is not just model capability, but how quickly that capability becomes usable in daily production."

These advancements build on ShengShu Technology's engineering foundation, including the pioneering U-ViT architecture and the TurboDiffusion collaboration with Tsinghua University's TSAIL Lab, which accelerated inference significantly. The rapid iteration demonstrates "China Speed" in converting research into deployable tools embedded in real workflows.

Global adoption continues to grow, with Vidu powering creative platforms, enterprise projects, and commercial applications across industries. The platform's shift toward production-scale infrastructure positions it as a leading solution in the evolving multimodal generative AI landscape.

  • Generative AIAI Video GenerationCreative AI
News Disclaimer
  • Share