rbtfl.

AI Agents

US-led category of startups building autonomous software systems that execute multi-step enterprise tasks, now the fastest-growing venture segment globally as of mid-2026.

Startups·AI· ·3 takes ·
post

What it is

AI agent startups build software systems that autonomously execute multi-step tasks on behalf of a user or an enterprise. Unlike earlier AI tools that respond to a single prompt and stop, agents take actions: browsing the web, writing and running code, routing customer support tickets, filing legal documents, or coordinating other AI tools in a pipeline. The category divides into two types: vertical specialists targeting a single industry (legal, customer service, software engineering, finance) and infrastructure providers building orchestration frameworks, memory layers, and multi-agent coordination systems. The primary market is United States enterprise software, though sovereign capital from Saudi Arabia, Singapore, and the United Arab Emirates has entered the sector alongside traditional US venture firms. A parallel branch, embodied agents, funds startups that extend agentic reasoning into physical robots.

History

Open-source experiments in 2023, notably AutoGPT (released April 2023), were the first widely circulated attempts to chain a large language model into a loop of self-directed actions. They were unreliable. Reliability improved sharply from 2024 onward as OpenAI's GPT-4 and Anthropic's Claude 3 series enabled agents to handle longer-horizon tasks without losing coherence. Vertical specialists raised their first large rounds in 2024: Harvey (United States legal AI) and Sierra (United States enterprise customer service) both crossed US$100 million in funding that year. By mid-2025, enterprise procurement of AI agents had moved from innovation-budget pilots to recurring IT line items, with deployment spreading across the United States, Europe, and Japan.

Current state

As of mid-2026, AI agents are the fastest-growing category in global enterprise software. Stanford University's 2026 AI Index documents that the best agent models complete roughly 66% of real-world computer tasks, up from 12% eighteen months earlier, approaching human performance on standardised benchmarks. United States private AI investment reached US$285.9 billion in 2025, more than 23 times China's US$12.4 billion. In Q1 2026, global venture funding to AI companies hit US$242 billion, 80% of all venture capital deployed that quarter, though the majority went to foundation-model labs rather than application-layer agent companies. Andreessen Horowitz's April 2026 enterprise analysis found 29% of Fortune 500 companies live, paying customers of leading AI startups, with coding, customer support, and enterprise search the three dominant deployments. Cognition AI (United States, coding agents) grew annual recurring revenue from US$37 million in May 2025 to US$492 million by May 2026. Harvey, the United States legal-AI agent, reached US$200 million ARR by early 2026.

Relationships

AI agent companies depend on a small number of United States foundation model providers for reasoning capability: OpenAI, Anthropic, Google DeepMind, and Meta's open-weight Llama series supply the engines inside most commercial agents. Andreessen Horowitz data show 81% of enterprises now run three or more model families in production, reflecting the multi-vendor reality of the market. Nvidia supplies the underlying compute and holds equity in multiple AI infrastructure layers that agents depend on. Y Combinator's Spring 2026 batch (196 startups, June 2026) was heavily weighted toward AI-agent verticals, with the hottest companies commanding seed valuations above US$175 million. The embodied branch is drawing the largest individual raises: General Intuition (United States) raised US$320 million in June 2026 to train action foundation models on gameplay footage, and Generalist AI (United States) raised US$400 million the same month for general-purpose robot foundation models, where its GEN-1 model reached 99% task success on dexterous-manipulation benchmarks. China is building parallel agentic systems around Alibaba's Qwen and Baidu's ERNIE model families.

What to watch

Reliability is the central constraint: at 66% task success, agents fail roughly one in three structured tasks, limiting autonomous deployment in regulated industries such as finance, healthcare, and law where errors carry legal liability. Inference pricing is compressing as open-weight models close quality gaps with closed ones, squeezing application-layer margins and pressing vertical-agent companies to build proprietary data or workflow moats. Agent-to-agent coordination, where multiple specialised agents hand off tasks across a pipeline, has drawn a new wave of infrastructure companies. United States regulatory attention on autonomous AI systems is rising, with questions about liability when an agent acts on incorrect data or executes an irreversible action. The central investor question for the second half of 2026 is whether vertical-agent companies can hold venture-scale multiples as the category matures from novelty toward commodity workflow infrastructure.

The briefing, by email