ShengShu Technology has unveiled significant upgrades to its AI video platform Vidu during its Global Creativity Week, advancing the technology from experimental use toward large-scale, production-ready applications. The releases center on Vidu Q3 - the industry's first long-form AI video model with native synchronized audio and video generation - and Vidu Q2 Reference-to-Video Pro, which introduces flexible reference-driven workflows for greater control and consistency.
Quick Intel
The upgrades address a key industry challenge: translating frontier AI capabilities into measurable productivity for professional creators and enterprises. Vidu Q3 integrates audio and visuals at the model level, eliminating post-production synchronization issues and supporting narrative formats like animation, short dramas, and films with native 1080p rendering and text-in-visual composition.
Vidu Q2 Reference-to-Video Pro shifts from one-shot generation to structured, reference-based creation. Users can combine inputs across people, scenes, actions, expressions, effects, and textures—then modify, add, or remove elements iteratively. This approach enhances consistency, speeds revisions, and suits collaborative production environments where precision and efficiency are essential.
Reference Hub introduces a community-driven ecosystem for reusable, high-quality visual elements. Creators select from standardized subjects to build professional-grade videos more reliably, lowering barriers while enabling scalable asset reuse across projects.
The enhanced Vidu Agent streamlines workflows with automated planning and real-time generation, incorporating features like custom background music and storyboard-based controls to reduce manual coordination.
"Advanced AI only matters when it creates measurable productivity," said Yihang Luo, CEO of ShengShu Technology. "Our focus is not just model capability, but how quickly that capability becomes usable in daily production."
These advancements build on ShengShu Technology's engineering foundation, including the pioneering U-ViT architecture and the TurboDiffusion collaboration with Tsinghua University's TSAIL Lab, which accelerated inference significantly. The rapid iteration demonstrates "China Speed" in converting research into deployable tools embedded in real workflows.
Global adoption continues to grow, with Vidu powering creative platforms, enterprise projects, and commercial applications across industries. The platform's shift toward production-scale infrastructure positions it as a leading solution in the evolving multimodal generative AI landscape.