Skip to content
Agent

Creating Custom Third-Party Agents - Complete Tutorial

by microsoft

AI Summary

A comprehensive tutorial for creating, registering, and deploying custom third-party agents that extend UFO² automation capabilities beyond Windows GUI automation. Developers building domain-specific automation agents (hardware control, Linux CLI, web automation, IoT) will find this invaluable for understanding the complete agent architecture and implementation workflow.

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to set up the "Creating Custom Third-Party Agents - Complete Tutorial" agent in my project.

Please run this command in my terminal:
# Add AGENTS.md to your project root
curl --retry 3 --retry-delay 2 --retry-all-errors -o AGENTS.md "https://raw.githubusercontent.com/microsoft/UFO/main/documents/docs/tutorials/creating_third_party_agents.md"

Then explain what the agent does and how to invoke it.

Description

UFO³: Weaving the Digital Agent Galaxy

Architecture Overview

Third-party agents integrate with UFO² through a well-defined architecture: `mermaid graph TB HostAgent["<b>HostAgent</b><br/>- Orchestrates all agents<br/>- Registers third-party agents as selectable targets<br/>- Routes tasks to appropriate agents"] AppAgent["<b>AppAgent</b><br/>(GUI tasks)"] HardwareAgent["<b>HardwareAgent</b><br/>(Hardware)"] YourAgent["<b>YourAgent</b><br/>(Custom)"] Strategies["<b>Processing Strategies</b><br/>- LLM Interaction<br/>- Action Execution<br/>- Memory Updates"] HostAgent --> AppAgent HostAgent --> HardwareAgent HostAgent --> YourAgent AppAgent --> Strategies HardwareAgent --> Strategies YourAgent --> Strategies style HostAgent fill:#e1f5ff,stroke:#0288d1,stroke-width:2px style AppAgent fill:#f3e5f5,stroke:#9c27b0,stroke-width:2px style HardwareAgent fill:#fff3e0,stroke:#ff9800,stroke-width:2px style YourAgent fill:#e8f5e9,stroke:#4caf50,stroke-width:2px style Strategies fill:#fce4ec,stroke:#e91e63,stroke-width:2px `

Creating Custom Third-Party Agents - Complete Tutorial

This tutorial teaches you how to create, register, and deploy custom third-party agents that extend UFO²'s capabilities beyond Windows GUI automation. You'll learn the complete process using HardwareAgent as a reference implementation. Prerequisites: Basic Python knowledge, familiarity with UFO² agent architecture, Agent Configuration, and Third-Party Configuration. ---

Table of Contents

• Overview • Understanding Third-Party Agents • Step-by-Step Implementation • Complete Example: HardwareAgent • Registering with HostAgent • Configuration and Deployment • Best Practices • Troubleshooting ---

What are Third-Party Agents?

Third-party agents are specialized agents that extend UFO²'s capabilities to handle tasks beyond standard Windows GUI automation. They work alongside the core agents (HostAgent and AppAgent) to provide domain-specific functionality. Key Characteristics: • ✅ Independent agent implementation with custom logic • ✅ Registered and managed by HostAgent • ✅ Selectable as execution targets by the LLM • ✅ Can use MCP servers and custom tools • ✅ Configurable via YAML files Common Use Cases: • 🔧 Hardware Control: Physical device manipulation (HardwareAgent) • 🐧 Linux CLI: Server and CLI command execution (LinuxAgent) • 🌐 Web Automation: Browser-based tasks without GUI • 📡 IoT Integration: Smart device control • 🤖 Robotic Process Automation: Custom automation workflows ---

Discussion

0/2000
Loading comments...

Health Signals

MaintenanceCommitted 25d ago
Active
Adoption1K+ stars on GitHub
8.2k ★ · Popular
DocsMissing or thin
Undocumented

GitHub Signals

Stars8.2k
Forks1.0k
Issues50
Updated25d ago
View on GitHub
MIT License

My Fox Den

Community Rating

Sign in to rate this booster

Works With

Claude Code
Claude.ai