Traditional Robotic Process Automation (RPA) has a well-known secret: it’s fragile. For years, businesses have built automation bots that rely on screen scraping and rigid, rules-based logic. These bots work perfectly—until a button on a website moves, a form field changes, or an invoice arrives in a slightly different format. The result? Broken workflows, manual interventions, and a high-maintenance process that often negates the promise of automation.
The core of this fragility lies in RPA's historical inability to handle unstructured data. Invoices, emails, contracts, and customer support tickets don't follow a neat, predictable structure. This is where modern AI, specifically Large Language Models (LLMs), changes everything.
At rpa.do, we believe the future of automation isn't just about mimicry; it's about comprehension. By integrating the power of LLMs directly into our developer-first, API-driven platform, we empower businesses to build intelligent AI agents that don't just see data—they understand it.
Why do traditional bots crumble when faced with slight variations? Because they are programmed to look for data in specific places.
This is the "unstructured data wall." It represents the massive portion of business information that has, until now, remained stubbornly resistant to automation. Processes like invoice processing, sales order entry from emails, and customer ticket routing have required human cognition to interpret context and variation. Attempting to automate them with old-school RPA led to a web of complex rules and constant break-fixes.
Large Language Models (like the technology behind ChatGPT) are designed to understand context, semantics, and intent within human language and documents. They don't need fixed templates. You can provide an LLM with a document and ask a question in plain English.
The LLM will parse the entire document, identify phrases like "Total Due," "Amount Payable," or "Balance," and intelligently extract the correct figure, regardless of its position.
This is the fundamental shift that rpa.do brings to life. Our agents are AI-native. Instead of relying on brittle UI interactions, you define robust workflows as code that leverage LLMs to handle the messy, unstructured parts of a process. This 'Business-as-Code' approach makes your automations more reliable, scalable, and easy to maintain.
Let's see what this looks like in practice. Traditional RPA would require a clunky visual designer and dozens of steps to record clicks and screen scrapes. With rpa.do, you define the entire intelligent workflow with a simple, clean API call.
Imagine you want to automate processing invoices that arrive as PDF attachments in an email inbox.
import { Agent } from '@do/rpa';
// Create an agent to handle accounts payable tasks
const accountsPayableAgent = new Agent('ap-processor-agent');
// Define and run the intelligent workflow
async function processNewInvoices() {
const result = await accountsPayableAgent.run({
workflow: 'extract_invoice_data_and_create_bill',
// 1. Source: Watch an email inbox for new invoices
source: {
type: 'email_inbox',
id: 'invoices@company.com',
filter: { has_attachment: '.pdf' }
},
// 2. Transform: Use an AI-powered step to understand the PDF
transform: {
type: 'llm_extract',
prompt: `From the attached document, extract the following as a JSON object:
invoiceNumber, vendorName, dueDate, and totalAmount.`
},
// 3. Destination: Use the structured data to create a bill in QuickBooks
destination: {
type: 'quickbooks_api',
action: 'create_bill',
// Map the clean data extracted by the LLM
map_fields: {
vendor: '{{transform.vendorName}}',
dueDate: '{{transform.dueDate}}',
amount: '{{transform.totalAmount}}'
}
}
});
console.log(`Automation run complete. ${result.processed} invoices processed.`);
}
runNewInvoices();
In this example, the transform step is where the magic happens. The rpa.do agent securely passes the PDF to an LLM, which "reads" the document and returns clean, structured JSON. This predictable output can then be reliably used in the next step of the automation—in this case, creating a bill via the QuickBooks API.
The automation is no longer dependent on the invoice's layout. It’s dependent on the LLM's ability to understand the document's content, making it vastly more resilient.
Adopting an AI-native, 'Business-as-Code' approach with rpa.do delivers transformative benefits:
The era of fragile, screen-scraping RPA is over. The future of workflow automation is intelligent, resilient, and defined as code. By embedding the comprehension power of LLMs into a seamless, API-driven framework, rpa.do provides the tools for developers and enterprises to build the next generation of software robots.
Ready to automate anything? Explore the rpa.do platform and start deploying intelligent agents today.