UiPath, a global leader in agentic automation, announced that its UiPath Screen Agent powered by Claude Opus 4.5 has achieved the top ranking on the OSWorld-Verified benchmark. This independent evaluation by the OSWorld research group confirms the agent's superior effectiveness in handling real-world computer-use tasks, providing enterprises with strong validation for deploying agentic AI across diverse workflows.
The OSWorld benchmark stands out as a comprehensive, unified platform designed to assess open-ended computer tasks in realistic environments. It supports multimodal agents by simulating interactions across multiple operating systems and applications, making it a reliable standard for evaluating agentic AI in enterprise contexts. By achieving the highest ranking, UiPath Screen Agent powered by Claude Opus 4.5 demonstrates exceptional performance in handling arbitrary, real-world scenarios that go beyond scripted automation.
As a foundational element of UiPath ScreenPlay, UiPath Screen Agent leverages leading large language models to interpret natural language instructions and automate user interfaces effectively. This approach simplifies the creation and execution of sophisticated workflows, allowing users to automate complex processes without extensive coding. The top OSWorld ranking highlights its ability to outperform competing models in accuracy, adaptability, and reliability when navigating diverse applications and tasks.
“Having had an early look at UiPath ScreenPlay, we’re excited about its potential to meaningfully improve how we scale automation. Its adaptive intelligence could support our growing partner ecosystem while helping reduce ongoing maintenance so our teams can stay focused on growth,” said Noble Keyser, manager of Enterprise AI and Automation, SimpleTire.
The benchmark success reinforces UiPath's advancements in UI automation powered by agentic AI. It builds directly on prior achievements, such as the No. 2 ranking with OpenAI GPT-5, and underscores the platform's readiness for large-scale enterprise deployments.
“Organizations need the confidence that their large-scale commitments to AI will pay off, which is where benchmarks can be incredibly helpful in validating specific use cases and critical workflows,” said Mircea Neagovici-Negoescu, Senior Vice President of AI and Research, UiPath. “Investing in AI and agents at enterprise speed and scale can be daunting. This ranking underscores UiPath ongoing investment in this area and its commitment to empowering customers with enterprise-grade computer-use capabilities.”
This top ranking positions UiPath as a frontrunner in delivering secure, governed, and high-performing agentic automation solutions that help organizations realize the full value of AI across industries.
About UiPath
UiPath is a global leader in agentic automation, empowering enterprises to harness the full potential of AI agents to autonomously execute and optimize complex business processes. The UiPath Platform™ uniquely combines controlled agency, developer flexibility, and seamless integration to help organizations scale agentic automation safely and confidently. Committed to security, governance, and interoperability, UiPath supports enterprises as they transition into a future where automation delivers on the full potential of AI to transform industries.