• About
  • Disclaimer
  • Privacy Policy
  • Contact
Sunday, June 15, 2025
Cyber Defense GO
  • Login
  • Home
  • Cyber Security
  • Artificial Intelligence
  • Machine Learning
  • Data Analysis
  • Computer Networking
  • Disaster Restoration
No Result
View All Result
  • Home
  • Cyber Security
  • Artificial Intelligence
  • Machine Learning
  • Data Analysis
  • Computer Networking
  • Disaster Restoration
No Result
View All Result
Cyber Defense Go
No Result
View All Result
Home Data Analysis

UFO2 turns your desktop into an agent playground

Md Sazzad Hossain by Md Sazzad Hossain
0
UFO2 turns your desktop into an agent playground
585
SHARES
3.2k
VIEWS
Share on FacebookShare on Twitter


What if automating a desktop wasn’t about scripting click on patterns, however about giving your working system an clever staff of brokers? That’s the core thought behind UFO2, Microsoft’s latest open-source system that pushes past present Laptop-Utilizing Brokers (CUAs) and reinvents automation as a first-class OS abstraction. It turns your desktop into an clever management panel the place language-driven duties are executed natively, reliably, and with minimal disruption to your workflow.

Conventional desktop automation instruments like RPA techniques have all the time struggled with robustness. A minor change in a UI can wreck a whole script. CUAs tried to handle this with giant language fashions and screenshot evaluation, however they remained restricted by shallow system integration and clunky consumer experiences. UFO2 flips this mannequin by constructing from the OS upward. It introduces a multiagent structure the place a central HostAgent coordinates specialised AppAgents for various functions. Every agent speaks the native language of the app through APIs and UI metadata, not simply pixels.

You might also like

What Is Hashing? – Dataconomy

“Scientific poetic license?” What do you name it when somebody is mendacity however they’re doing it in such a socially-acceptable manner that no person ever calls them on it?

How knowledge high quality eliminates friction factors within the CX

UFO2 turns your desktop into an agent playground
A comparability of (a) present CUAs and (b) desktop AgentOS UFO2 (Picture)

One in all UFO2’s key technical improvements is its hybrid motion mannequin. As a substitute of simply clicking buttons like a human, every AppAgent can name actual APIs when out there. This implies duties like exporting a spreadsheet or formatting textual content are decreased from multi-step GUI dances to a single, atomic perform name. The system additionally speculates forward—utilizing a single LLM name to plan a number of steps and validating each stay with Home windows UI knowledge. This speculative multi-action execution dramatically cuts down on latency with out risking correctness.

Keep Forward of the Curve!

Do not miss out on the newest insights, traits, and evaluation on the earth of knowledge, expertise, and startups. Subscribe to our publication and get unique content material delivered straight to your inbox.

Isolation with out interruption

CUAs sometimes hijack your desktop, locking the mouse and keyboard throughout execution. UFO2’s Image-in-Image (PiP) mode solves this with a digital desktop window that runs automation duties in parallel. The agent does its factor in a sandboxed setting, whilst you proceed working in the principle session. It’s seamless, safe, and makes use of native Home windows RDP loopback to keep up session integrity.

UFO2 turns your desktop into an agent playground_02
An outline of the structure of UFO2 (Picture)

UFO2 integrates assist documentation and execution logs right into a retrieval-augmented reminiscence, enriching its prompts with procedural information. Over time, this creates a self-improving agent that will get higher at new duties with out retraining. Every AppAgent pulls from documentation, patch notes, and prior runs to make smarter choices. It’s an automation system with reminiscence, not simply response technology.

In head-to-head benchmarks in opposition to OpenAI’s Operator and different prime CUAs, UFO2 constantly outperforms. On the OSWorld-W benchmark, UFO2 reaches a 32.7% success charge utilizing the o1 mannequin—greater than doubling Operator’s 14.3%. Its speculative planning reduces motion steps by as much as 50%. Hybrid management detection (combining UIA APIs and imaginative and prescient parsing) recovers over 25% of beforehand failed interactions. Merely put, UFO2 isn’t simply smarter—it’s systemically higher.

All the pieces is an agent now

Extensibility is baked in. UFO2 permits third-party instruments, together with different CUAs like Operator, to be wrapped as AppAgents. This implies you possibly can combine specialised copilots or proprietary automation backends into the UFO2 ecosystem with out retraining or rewriting code. It additionally helps a client-server structure for enterprise deployment, preserving orchestration centralized and consumer units gentle.

The paper outlines future objectives, together with cross-platform compatibility with macOS and Linux through analogous accessibility APIs, sooner response through smaller LLMs, and improved reasoning from devoted GUI-interaction datasets. However even in its present state, UFO2 represents a new baseline for desktop automation. It’s open-source, already outperforming business techniques, and brings a brand new stage of modularity, reliability, and intelligence to human-computer interplay.

For anybody constructing the subsequent technology of clever brokers—or simply uninterested in brittle scripts—UFO2 is accessible on GitHub together with its documentation.


Featured picture credit score

Tags: AgentdesktopplaygroundTurnsUFO2
Previous Post

Robotic system zeroes in on objects most related for serving to people | MIT Information

Next Post

Who’s Accountable for Water Harm in Your Storage Unit?

Md Sazzad Hossain

Md Sazzad Hossain

Related Posts

What’s large information? Huge information
Data Analysis

What Is Hashing? – Dataconomy

by Md Sazzad Hossain
June 14, 2025
“Scientific poetic license?”  What do you name it when somebody is mendacity however they’re doing it in such a socially-acceptable manner that no person ever calls them on it?
Data Analysis

“Scientific poetic license?” What do you name it when somebody is mendacity however they’re doing it in such a socially-acceptable manner that no person ever calls them on it?

by Md Sazzad Hossain
June 14, 2025
How knowledge high quality eliminates friction factors within the CX
Data Analysis

How knowledge high quality eliminates friction factors within the CX

by Md Sazzad Hossain
June 13, 2025
Agentic AI 103: Constructing Multi-Agent Groups
Data Analysis

Agentic AI 103: Constructing Multi-Agent Groups

by Md Sazzad Hossain
June 12, 2025
Monitoring Information With out Turning into Massive Brother
Data Analysis

Monitoring Information With out Turning into Massive Brother

by Md Sazzad Hossain
June 12, 2025
Next Post
Who’s Accountable for Water Harm in Your Storage Unit?

Who's Accountable for Water Harm in Your Storage Unit?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recommended

Decoding CLIP: Insights on the Robustness to ImageNet Distribution Shifts

dMel: Speech Tokenization Made Easy

March 4, 2025
The vCCAP Evo™ Answer Benefit, Half 1: Scalability and Reliability

The vCCAP Evo™ Answer Benefit, Half 1: Scalability and Reliability

May 12, 2025

Categories

  • Artificial Intelligence
  • Computer Networking
  • Cyber Security
  • Data Analysis
  • Disaster Restoration
  • Machine Learning

CyberDefenseGo

Welcome to CyberDefenseGo. We are a passionate team of technology enthusiasts, cybersecurity experts, and AI innovators dedicated to delivering high-quality, insightful content that helps individuals and organizations stay ahead of the ever-evolving digital landscape.

Recent

Ctrl-Crash: Ny teknik för realistisk simulering av bilolyckor på video

June 15, 2025
Addressing Vulnerabilities in Positioning, Navigation and Timing (PNT) Companies

Addressing Vulnerabilities in Positioning, Navigation and Timing (PNT) Companies

June 14, 2025

Search

No Result
View All Result

© 2025 CyberDefenseGo - All Rights Reserved

No Result
View All Result
  • Home
  • Cyber Security
  • Artificial Intelligence
  • Machine Learning
  • Data Analysis
  • Computer Networking
  • Disaster Restoration

© 2025 CyberDefenseGo - All Rights Reserved

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In