Quick Start Guide for Sentius Copilot+ and Browser Agent

This guide will walk you through the steps to quickly integrate the Sentius Copilot+ Browser Agent API into your application. After completing this tutorial, you'll be able to control a web browser to perform tasks on a specific web page using the Browser Agent API.

Prerequisites

Before starting, ensure you have the following:

Sentius Copilot+ Installed
Download and install the Sentius Copilot+ application. The app interacts with a Chromium-based browser and must be installed on the machine where the tasks will be executed.
Join the Waitlist
Sign up for the Sentius Teach & Repeat Platform in the developer role by clicking Join Waitlist when running the Sentius Copilot+ Application.
API Key
Obtain an API Key from your Sentius Copilot+ application by navigating to Settings API.

Step 1: Start a Dialog Session

A dialog session represents a conversation with the Browser Agent. You must create or retrieve a dialog session before sending tasks.

Create a New Dialog Session

If no session exists, create a new one by using the POST request:

curl -X POST "https://api.sentius.ai/dialog_sessions?api_key=<your_api_key>" \
     -H "Content-Type: application/json"

import requests

url = "https://api.sentius.ai/dialog_sessions"
params = {"api_key": "<your_api_key>"}
response = requests.post(url, params=params)
print(response.json())

After creating or retrieving a session, you'll receive a <dialog_session_id>, which you’ll use for future interactions.

Retrieve an Active Dialog Session

To retrieve an existing session, use the following GET request:

curl -X GET "https://api.sentius.ai/dialog_sessions?api_key=<your_api_key>" \
     -H "Content-Type: application/json"

import requests

url = "https://api.sentius.ai/dialog_sessions"
params = {"api_key": "<your_api_key>"}
response = requests.get(url, params=params)
print(response.json())

Step 2: Send a Task to the Browser Agent

With an active dialog session, you can now send a task for the Browser Agent to perform. Here's an example of sending a task to open a web page and perform an action.

Send a Task to Open a Web Page

Use the following POST request to send a message with the task. You can specify the task in the text field and (optional) choose an instruction for handling the task in the <instruction_id> field.

curl -X POST "https://api.sentius.ai/dialog_sessions/<dialog_session_id>/chat?api_key=<your_api_key>" \
     -H "Content-Type: application/json" \
     -d '{
       "text": "Open https://example.com and click on the sign-up button",
       "instruction_id": "AeOfd32"
     }'

import requests

url = f"https://api.sentius.ai/dialog_sessions/<dialog_session_id>/chat"
params = {"api_key": "<your_api_key>"}
data = {
    "text": "Open https://example.com and click on the sign-up button",
    "instruction_id": "AeOfd32"  # Optional, specify an instruction if needed
}
response = requests.post(url, json=data, params=params)
print(response.json())

In this example, the task is to open a specific web page (https://example.com) and click a button. The <instruction_id> is optional, but if specified, the agent will use a pre-recorded instruction to complete the task.

Step 3: Monitor the Task Progress and Retrieve Session History

You can retrieve the dialog session history to check how the agent performed the task.

Retrieve Session History

To fetch the history of a particular dialog session:

curl -X GET "https://api.sentius.ai/dialog_sessions/<dialog_session_id>/history?api_key=<your_api_key>" \
     -H "Content-Type: application/json"

import requests

url = f"https://api.sentius.ai/dialog_sessions/<dialog_session_id>/history"
params = {"api_key": "<your_api_key>"}
response = requests.get(url, params=params)
print(response.json())

Step 4: Customize the Task with Instructions (Optional)

You can also record custom instructions in the Sentius Copilot+ application to handle specific tasks in a predefined manner.

Record and Use Instructions

To record an instruction:

Open the Sentius Copilot+ app and go to the "Library" tab.
Click on the "Record" button to start recording a custom instruction.

After recording, you can obtain the Instruction ID and use it in your API calls by specifying it in the <instruction_id> field.

Using a Particular Instruction in the Task

Here's how to specify a particular instruction in your API request:

import requests

url = f"https://api.sentius.ai/dialog_sessions/<dialog_session_id>/chat"
params = {"api_key": "<your_api_key>"}
data = {
    "text": "Open https://example.com and perform the custom task",
    "instruction_id": "<Instruction_id>"  # Replace with your custom instruction ID
}
response = requests.post(url, json=data, params=params)
print(response.json())

Conclusion

You have now set up your Browser Agent to perform tasks on a web page. By integrating with the Sentius Copilot+ platform, you can extend this functionality to automate complex workflows, manage sessions, and reuse instructions for repetitive tasks.

For additional details, check out the full documentation on Dialog Sessions and Instructions.

Quick Start for AI Agents