Last updated: April 5, 2026 · Core Concepts · by Daniel Ashford

What is AI Agent?

QUICK ANSWER

An LLM that can autonomously plan, use tools, and take actions to complete complex tasks.

Definition

An AI agent is a system built around a language model that can autonomously plan actions, use external tools, make decisions, and iterate to accomplish complex goals. Unlike a simple chatbot, an agent breaks down complex tasks into subtasks.

How It Works

Agents combine LLM reasoning with tool use, memory, and planning. Examples include coding agents that write, test, and debug code autonomously, and research agents that search, synthesize, and produce reports. Agent performance is measured by SWE-bench and Terminal-Bench.

Example

A coding agent given "Fix the login bug": (1) reads error logs, (2) identifies relevant files, (3) writes a fix, (4) runs tests, (5) iterates if tests fail, (6) creates a pull request when passing.

Related Terms

Function Calling / Tool Use

The ability of LLMs to invoke external tools, APIs, and databases.

Chain-of-Thought (CoT)

A prompting technique that asks the model to show its reasoning step by step.

See How Models Compare

Understanding ai agent is important when choosing the right AI model. See how 12 models compare on our leaderboard.

View Leaderboard →Our Methodology

← Browse all 47 glossary terms

Daniel Ashford

Founder & Lead Evaluator · 200+ models evaluated