UiPath Screen Agent Tops OSWorld Benchmark with Claude Opus 4.5

by:
|
January 15, 2026

UiPath, a global leader in agentic automation, announced that its UiPath Screen Agent powered by Claude Opus 4.5 has achieved the top ranking on the OSWorld-Verified benchmark. This independent evaluation by the OSWorld research group confirms the agent's superior effectiveness in handling real-world computer-use tasks, providing enterprises with strong validation for deploying agentic AI across diverse workflows.

Quick Intel

UiPath Screen Agent powered by Claude Opus 4.5 ranks No. 1 on the OSWorld-Verified benchmark for agentic automation.
OSWorld evaluates multimodal agents in a scalable, real computer environment across 369 open-domain tasks involving web, desktop apps, OS file I/O, and multi-app workflows.
The benchmark tests performance against general-purpose and specialized computer-using models as well as other agentic frameworks.
UiPath Screen Agent enables natural language-driven UI automation for complex, end-to-end tasks within UiPath ScreenPlay.
This milestone follows UiPath Screen Agent powered by OpenAI GPT-5 ranking No. 2 on the same benchmark in September 2025.
Enterprises gain confidence in scaling agentic AI with validated, production-ready capabilities for secure and efficient automation.

OSWorld Benchmark: A Rigorous Test for Agentic AI

The OSWorld benchmark stands out as a comprehensive, unified platform designed to assess open-ended computer tasks in realistic environments. It supports multimodal agents by simulating interactions across multiple operating systems and applications, making it a reliable standard for evaluating agentic AI in enterprise contexts. By achieving the highest ranking, UiPath Screen Agent powered by Claude Opus 4.5 demonstrates exceptional performance in handling arbitrary, real-world scenarios that go beyond scripted automation.

Core Capabilities of UiPath Screen Agent

As a foundational element of UiPath ScreenPlay, UiPath Screen Agent leverages leading large language models to interpret natural language instructions and automate user interfaces effectively. This approach simplifies the creation and execution of sophisticated workflows, allowing users to automate complex processes without extensive coding. The top OSWorld ranking highlights its ability to outperform competing models in accuracy, adaptability, and reliability when navigating diverse applications and tasks.

Enterprise Validation and Real-World Impact

“Having had an early look at UiPath ScreenPlay, we’re excited about its potential to meaningfully improve how we scale automation. Its adaptive intelligence could support our growing partner ecosystem while helping reduce ongoing maintenance so our teams can stay focused on growth,” said Noble Keyser, manager of Enterprise AI and Automation, SimpleTire.

The benchmark success reinforces UiPath's advancements in UI automation powered by agentic AI. It builds directly on prior achievements, such as the No. 2 ranking with OpenAI GPT-5, and underscores the platform's readiness for large-scale enterprise deployments.

“Organizations need the confidence that their large-scale commitments to AI will pay off, which is where benchmarks can be incredibly helpful in validating specific use cases and critical workflows,” said Mircea Neagovici-Negoescu, Senior Vice President of AI and Research, UiPath. “Investing in AI and agents at enterprise speed and scale can be daunting. This ranking underscores UiPath ongoing investment in this area and its commitment to empowering customers with enterprise-grade computer-use capabilities.”

This top ranking positions UiPath as a frontrunner in delivering secure, governed, and high-performing agentic automation solutions that help organizations realize the full value of AI across industries.

About UiPath

UiPath is a global leader in agentic automation, empowering enterprises to harness the full potential of AI agents to autonomously execute and optimize complex business processes. The UiPath Platform™ uniquely combines controlled agency, developer flexibility, and seamless integration to help organizations scale agentic automation safely and confidently. Committed to security, governance, and interoperability, UiPath supports enterprises as they transition into a future where automation delivers on the full potential of AI to transform industries.

Agentic AIAgentic AutomationUi PathEnterprise AIAI Automation

Share

Join 110k+ Avid Tech Readers!

Trending tech news, interviews & insights straight to your inbox.

UiPath Screen Agent Tops OSWorld Benchmark with Claude Opus 4.5

Quick Intel

OSWorld Benchmark: A Rigorous Test for Agentic AI

Core Capabilities of UiPath Screen Agent

Enterprise Validation and Real-World Impact

Join 110k+ Avid Tech Readers!

About Us

Quick Links

Connect With Us

Search TechIntelPro

Subscribe to Our Newsletter

UiPath Screen Agent Tops OSWorld Benchmark with Claude Opus 4.5

Quick Intel

OSWorld Benchmark: A Rigorous Test for Agentic AI

Core Capabilities of UiPath Screen Agent

Enterprise Validation and Real-World Impact

Join 110k+ Avid Tech Readers!

About Us

Quick Links

Connect With Us