Dahua Technology, a leading provider of video-centric AIoT solutions, has launched its Xinghan Large-scale AI Models, marking a significant advancement in industry-grade AI systems. This next-generation platform combines large-scale visual intelligence with multimodal and language capabilities, designed to tackle real-world complexities and drive intelligent transformation in sectors reliant on advanced IT and SaaS technologies.
At the heart of Xinghan lies a visual analysis foundation, augmented by multimodal processing and industry-specific knowledge to deliver tailored AI for diverse scenarios. This edge-cloud synergy powers scalable intelligence, evolving from cutting-edge research to practical deployments in AIoT ecosystems. The upgraded structure includes L-Series for natural language interaction, V-Series for video-centric tasks, and M-Series for handling heterogeneous data like text, images, audio, and video. By embedding deep expertise, Xinghan bridges the gap between theoretical AI and commercial viability, enabling enterprises to deploy adaptive solutions in SaaS-driven environments without extensive overhauls.
The V-Series, focused on Xinghan Vision Models, prioritizes key targets such as humans, motor vehicles, and non-motor vehicles to simplify complexity while upholding precision in video analytics. Perimeter Protection extends detection to smaller objects, minimizing false positives and broadening camera ranges in security applications. WizTracking introduces superior algorithms for managing occlusions and posture variations, yielding a 50% accuracy boost essential for dynamic AIoT monitoring. Crowd Map excels in long-range small-target identification, incorporating umbrella compensation for 80% improved rainy-weather performance, alongside a 2.5x expanded analysis scope and capacity for dense, low-light crowds up to 5,000 individuals. Scene Adaptive AI WDR intelligently configures cameras via spatial and contextual analysis, while AI Rule Assist automates intrusion rule setup with one-click precision, streamlining IT-integrated surveillance workflows.
M-Series Xinghan Multimodal Models process and fuse diverse data types for enhanced efficiency and natural interactions, expanding AIoT applications beyond traditional boundaries. WizSeek transforms video investigations by allowing natural language queries to retrieve relevant footage from archives, accelerating response times in enterprise security operations. Text-Defined Alarms empower users to configure alerts via descriptive language, drastically lowering development barriers and supporting rapid, customizable setups for varied real-world needs. This integration fosters seamless human-AI collaboration, optimizing SaaS platforms for proactive decision-making across industries.
Xinghan's launch underscores Dahua's commitment to evolving AIoT technologies, providing scalable tools that enhance operational intelligence and sustainability. As organizations navigate increasing data complexity, these models equip businesses with the foundation for innovative, efficient transformations in visual and multimodal AI applications.