Open Agent Studio - Open Agent Studio

Open Agent Studio is the first cross platform desktop application built to enable Agentic Process Automation as an open-source alternative and replacement for UIpath and all other RPA tools today.

Download for Windows and Mac at https://openagent.studio.

Origin

We founded Cheat Layer during the pandemic to help people who had lost their jobs and businesses rebuild online using GPT-3. We personally helped hundreds pro bono before turning to AI for help.

In August 2021, we were the first startup to get approved by openAI to sell GPT-3 for automation. We were the first to publish our framework for agents, Project Atlas, in July 2022.

Long-term Vision

Businesses today are threatening to fire employees and replace them with AI. We believe within 2 years, the AI necessary to generate most of those businesses will be a commodity.

In a future where the AI in your pocket can generate custom, secure, and free versions of all the most expensive business software, we believe there’ll be a level playing field that removes the barrier to build these businesses. More small businesses will build competitive brands through personal relationships, better quality service, and network effects through unique data. We invite you to join us in building this future together faster.

Agentic Process Automation

Semantic Targets

Semantic targets distils the underlying intent of a target to english, so they still work even if services completely change their designs. This enables building robust, future-proof, agents.

Semantic targets can be dynamic or robust based on how strict the language is.

Semantic Triggers

Reasoning Based Targets

Traditional RPA tools rely heavily on code selectors and computer vision to interact with websites. These selectors are brittle and break whenever the target website updates its design, requiring manual code adjustments.

Open Agent Studio eliminates this issue by using Large Language Models (LLMs) to interpret semantic targets. This allows you to describe what you want to automate in simple English. For example, instead of writing complex code to click a specific button, you can tell Open Agent Studio to “click the button that says ‘Submit’“.

This approach enables robust and adaptable automations that withstand website design changes, making your agents significantly more efficient and reliable.

Open Agent Studio offers a comprehensive suite of features to empower anyone, regardless of their technical skills, to build powerful RPAs.

Windows Install Instructions

Download and extract cheatlayer.zip
Double click cheatlayer.exe to open and click “more info” and “run anyway” if it asks.
Enter your Cheat Layer API key you can find at the bottom of the billing page at cheatlayer.com/billing.

Mac Install Instructions

Under “Privacy and Security” in your settings, add “terminal” to “Accessability”, “Screen Recording”, and “Full Disk Access”.
Double click the unzipped app—it will close.
Open Privacy and Security “General” tab and click “run anyway” next to Cheat Layer.

Linux Install Instructions

Download main.zip
Extract main.zip
Set ENV variable CHEAT_KEY with your cheatlayer.com/billing API key .
./main to run

1.1 Semantic Targeting

Define actions and targets using plain English commands.
Eliminate the need for complex code selectors.
Examples:
- “Click the button labeled ‘Login’"
- "Extract the text from the table header"
- "Enter {{ my_variable }} into the search field”

1.2 Node-Based Visual Workflow

Build automations through an intuitive drag-and-drop interface.
Utilize various node types representing distinct actions like clicks, keypresses, data extraction, data processing, and web interactions.
Connect nodes to define the flow and logic of your automation.

1.3 LLMs for Enhanced Understanding

Understand your semantic targets and translate them into specific actions.
Dynamically adapt to website changes.
Automate tasks on websites without building custom selectors for each element.

1.4 Variables and Data Management

Define variables to store data and dynamically use it throughout your automation.
Manage data from various sources, including web scraping, files (PDF, Docx, CSV, Excel), and API calls.

1.5 Scheduling and Triggers

Schedule your agents to run automatically at specific times or intervals using a visual cron scheduler.
Trigger agents based on:
- Scheduled events (specific times, intervals)
- Emails (specific content, sender)
- Webhooks (external triggers from other applications)
- SMS messages (specific content, sender)
- Phone calls (initiate automations based on voice prompts)
- Semantic Triggers (initiate automations based on specific screen states)

Upload your completed agents to the Cheat Layer community to share with others.
Access a library of pre-built agents for various tasks, saving you development time.

Advanced Features

Open Agent Studio provides a number of advanced features for building sophisticated and robust automations.

2.1 Looping

Implement looping mechanisms to iteratively execute actions on multiple targets.
Loop Total Runs: Executes a section of the workflow a predetermined number of times.
Loop Node Runs: Executes a section of the workflow based on the number of times a specific node has executed.
- Useful for processing lists of data, scraping multiple pages on a website, or sending out batch emails.

2.2 Conditional Logic

Use “If Else” nodes to create branching paths in your automation based on specific conditions.
Conditions can be based on:
- Variable values
- Comparisons (equals, greater than, less than)
- Regular expressions
This enables dynamic execution of actions based on data or website state changes.

2.3 Python Integration

Embed custom Python code blocks within your automation using the “Python Code” node.
Access Open Agent Studio’s API through the provided thread_signals object within your code to:
- Send notifications (thread_signals.notifications.emit)
- Trigger internal actions (thread_signals.chat.emit)
- Interact with global variables (graph.global_variables)
This allows you to:
- Implement complex logic not possible with pre-built nodes
- Integrate with external libraries and services
- Perform advanced data manipulations

2.4 Web Scraping with “Magic Scraper”

Efficiently extract data from websites using various “Magic Scraper” configurations.

Options:

Target specific HTML elements (links, images, text)
Filter based on text content and attributes
Handle dynamic content
Use variables to dynamically define targets

Extracted data can be:

Stored in variables
Used for further processing within the automation
Exported to external services like Google Sheets

2.5 Semantic Element Detection

Combine “SemanticDescribe” and “Click” nodes for robust element interaction.

”SemanticDescribe” captures a target based on your description, even if its visual appearance changes.
”Click” then uses the captured target for precise interaction.
This method improves automation resilience against website design updates.

2.6 Data Processing with GPT-4

Leverage GPT-4’s capabilities to:

Summarize large amounts of text
Analyze and extract insights from data
Translate languages
Generate text content
Build conversational agents

Use variables to pass data between nodes and GPT-4 prompts.

2.7 API Integrations

Connect with external platforms and services through API calls using the “Send Data” and “Python Code” nodes.

Automate processes involving:

Data Retrieval
Workflow Triggering
Real-time Data Exchange

2.8 Agent Management

Easily manage your agents through the intuitive interface.

Features:

Create, edit, and delete agents.
Save and open automation files (.cheat).
Version control for tracking changes.
Duplicate existing agents for quick customization.

2.9 Global Variables

Utilize global variables to store data and share it across different nodes within your automation.
Access variables through the dropdown menus in node properties.
Update variable values dynamically during automation execution.

Get Started

​Origin

​Long-term Vision

​Agentic Process Automation

​Semantic Targets

​Semantic Triggers

​Reasoning Based Targets

​Windows Install Instructions

​Mac Install Instructions

​Linux Install Instructions

​1.1 Semantic Targeting

​1.2 Node-Based Visual Workflow

​1.3 LLMs for Enhanced Understanding

​1.4 Variables and Data Management

​1.5 Scheduling and Triggers

​1.6 Agent Sharing and Community

​Advanced Features

​2.1 Looping

​2.2 Conditional Logic

​2.3 Python Integration

​2.4 Web Scraping with “Magic Scraper”

​2.5 Semantic Element Detection

​2.6 Data Processing with GPT-4

​2.7 API Integrations

​2.8 Agent Management

​2.9 Global Variables