Using Agents | Agents

Overview of agents in Mastra, detailing their capabilities and how they interact with tools, workflows, and external systems.

Source: https://mastra.ai/docs/agents/overview

Using Agents

Agents use LLMs and tools to solve open-ended tasks. They reason about goals, decide which tools to use, retain conversation memory, and iterate internally until the model emits a final answer or an optional stop condition is met. Agents produce structured responses you can render in your UI or process programmatically. Use agents directly or compose them into workflows or agent networks.

Watch an introduction An introduction to agents, and how they compare to workflows on YouTube (7 minutes)

Setting up agents

Model router
Vercel AI SDK

Add the Mastra core package to your project:npm install @mastra/core
Mastra's model router auto-detects environment variables for your chosen provider. For OpenAI, set OPENAI_API_KEY:.envOPENAI_API_KEY=noteMastra supports more than 600 models. Choose from the full list.
Create an agent by instantiating the Agent class with system instructions and a model:src/mastra/agents/test-agent.tsimport { Agent } from "@mastra/core/agent";export const testAgent = new Agent({ name: "test-agent", instructions: "You are a helpful assistant.", model: "openai/gpt-4o-mini",});
Include the Mastra core package alongside the Vercel AI SDK provider you want to use:npm install @mastra/core @ai-sdk/openai
Set the corresponding environment variable for your provider. For OpenAI via the AI SDK:.envOPENAI_API_KEY=noteSee the AI SDK Providers in the Vercel AI SDK docs for additional configuration options.
To create an agent in Mastra, use the Agent class. Every agent must include instructions to define its behavior, and a model parameter to specify the LLM provider and model. When using the Vercel AI SDK, provide the client to your agent's model field:src/mastra/agents/test-agent.tsimport { openai } from "@ai-sdk/openai";import { Agent } from "@mastra/core/agent";export const testAgent = new Agent({ name: "test-agent", instructions: "You are a helpful assistant.", model: openai("gpt-4o-mini"),});

Instruction formats

Instructions define the agent's behavior, personality, and capabilities. They are system-level prompts that establish the agent's core identity and expertise.

Instructions can be provided in multiple formats for greater flexibility. The examples below illustrate the supported shapes:

// String (most common)instructions: "You are a helpful assistant.";// Array of stringsinstructions: [  "You are a helpful assistant.",  "Always be polite.",  "Provide detailed answers.",];// Array of system messagesinstructions: [  { role: "system", content: "You are a helpful assistant." },  { role: "system", content: "You have expertise in TypeScript." },];

Provider-specific options

Each model provider also enables a few different options, including prompt caching and configuring reasoning. We provide a providerOptionsflag to manage these. You can set providerOptionson the instruction level to set different caching strategy per system instruction/prompt.

// With provider-specific options (e.g., caching, reasoning)instructions: {  role: "system",  content:    "You are an expert code reviewer. Analyze code for bugs, performance issues, and best practices.",  providerOptions: {    openai: { reasoningEffort: "high" },        // OpenAI's reasoning models    anthropic: { cacheControl: { type: "ephemeral" } }  // Anthropic's prompt caching  }}

See the Agent reference doc for more information.

Registering an agent

Register your agent in the Mastra instance to make it available throughout your application. Once registered, it can be called from workflows, tools, or other agents, and has access to shared resources such as memory, logging, and observability features:

src/mastra/index.ts

import { Mastra } from "@mastra/core/mastra";import { testAgent } from "./agents/test-agent";export const mastra = new Mastra({  // ...  agents: { testAgent },});

Calling an agent

You can call agents from workflow steps, tools, or the Mastra Client. Get a reference by calling .getAgent()on your mastraor mastraClientinstance, depending on your setup:

const testAgent = mastra.getAgent("testAgent");

info mastra.getAgent()is preferred over directly importing an agent in to your file, since it provides access to the Mastra instance configuration (logger, telemetry, storage, registered agents, and vector stores).

From a workflow step

The mastrainstance is passed as an argument to a workflow step’s executefunction. It provides access to registered agents using getAgent(). Use this method to retrieve your agent, then call generate()with a prompt.

const step1 = createStep({  // ...  execute: async ({ mastra }) => {    const agent = mastra.getAgent("testAgent");    const response = await agent.generate("Help me organize my day");    console.log(response.text);  }});

From a tool

The mastrainstance is available within a tool’s executefunction. Use getAgent()to retrieve a registered agent and call generate()with a prompt.

export const testTool = createTool({  // ...  execute: async ({ mastra }) => {    const agent = mastra.getAgent("testAgent");    const response = await agent.generate("Help me organize my day");    console.log(response!.text);  }});

Using HTTP or curl

You can interact with a registered agent by sending a POSTrequest to your Mastra application's /generateendpoint. Include a messagesarray of role/content pairs.

curl -X POST http://localhost:4111/api/agents/testAgent/generate \  -H "Content-Type: application/json" \  -d '{    "messages": [      {        "role": "user",        "content": "Help me organize my day"      }    ]  }'| jq -r '.text'

Example output

1. What time do you plan to start your day?2. Do you have any specific tasks or appointments scheduled for today?3. Are there any personal goals or activities you want to include (e.g., exercise, reading, hobbies)?4. How much time do you want to allocate for work versus personal time?5. Do you have any deadlines or priorities that need to be addressed?

Generating responses

Agents can return results in two ways: generating the full output before returning it or streaming tokens in real time. Choose the approach that fits your use case: generate for short, internal responses or debugging, and stream to deliver pixels to end users as quickly as possible.

Generate
Stream

Pass a single string for simple prompts, an array of strings when providing multiple pieces of context, or an array of message objects with roleand content.

(The roledefines the speaker for each message. Typical roles are userfor human input, assistantfor agent responses, and systemfor instructions.)

const response = await testAgent.generate([  { role: "user", content: "Help me organize my day" },  { role: "user", content: "My day starts at 9am and finishes at 5.30pm" },  { role: "user", content: "I take lunch between 12:30 and 13:30" },  {    role: "user",    content: "I have meetings Monday to Friday between 10:30 and 11:30",  },]);console.log(response.text);

Pass a single string for simple prompts, an array of strings when providing multiple pieces of context, or an array of message objects with roleand content.

(The roledefines the speaker for each message. Typical roles are userfor human input, assistantfor agent responses, and systemfor instructions.)

const stream = await testAgent.stream([  { role: "user", content: "Help me organize my day" },  { role: "user", content: "My day starts at 9am and finishes at 5.30pm" },  { role: "user", content: "I take lunch between 12:30 and 13:30" },  {    role: "user",    content: "I have meetings Monday to Friday between 10:30 and 11:30",  },]);for await (const chunk of stream.textStream) {  process.stdout.write(chunk);}

Completion using onFinish()

When streaming responses, the onFinish()callback runs after the LLM finishes generating its response and all tool executions are complete. It provides the final text, execution steps, finishReason, token usagestatistics, and other metadata useful for monitoring or logging.

const stream = await testAgent.stream("Help me organize my day", {  onFinish: ({ steps, text, finishReason, usage }) => {    console.log({ steps, text, finishReason, usage });  },});for await (const chunk of stream.textStream) {  process.stdout.write(chunk);}

See .generate() or .stream() for more information.

Structured output

Agents can return structured, type-safe data by defining the expected output using either Zodor JSON Schema. We recommend Zod for better TypeScript support and developer experience. The parsed result is available on response.object, allowing you to work directly with validated and typed data.

Using Zod

Define the outputshape using Zod:

import { z } from "zod";const response = await testAgent.generate(  [    {      role: "system",      content: "Provide a summary and keywords for the following text:",    },    {      role: "user",      content: "Monkey, Ice Cream, Boat",    },  ],  {    structuredOutput: {      schema: z.object({        summary: z.string(),        keywords: z.array(z.string()),      }),    },  },);console.log(response.object);

With Tool Calling

Use the modelproperty to ensure that your agent can execute multi-step LLM calls with tool calling.

import { z } from "zod";const response = await testAgentWithTools.generate(  [    {      role: "system",      content: "Provide a summary and keywords for the following text:",    },    {      role: "user",      content: "Please use your test tool and let me know the results",    },  ],  {    structuredOutput: {      schema: z.object({        summary: z.string(),        keywords: z.array(z.string()),      }),      model: "openai/gpt-4o",    },  },);console.log(response.object);console.log(response.toolResults);

Response format

By default structuredOutputwill use response_formatto pass the schema to the model provider. If the model provider does not natively support response_formatit's possible that this will error or not give the desired results. To keep using the same model use jsonPromptInjectionto bypass response format and inject a system prompt message to coerce the model to return structured output.

import { z } from "zod";const response = await testAgentThatDoesntSupportStructuredOutput.generate(  [    {      role: "system",      content: "Provide a summary and keywords for the following text:",    },    {      role: "user",      content: "Monkey, Ice Cream, Boat",    },  ],  {    structuredOutput: {      schema: z.object({        summary: z.string(),        keywords: z.array(z.string()),      }),      jsonPromptInjection: true,    },  },);console.log(response.object);

Using tools

Agents can use tools to go beyond language generation, enabling structured interactions with external APIs and services. Tools allow agents to access data and perform clearly defined operations in a reliable, repeatable way.

src/mastra/agents/test-agent.ts

export const testAgent = new Agent({  // ...  tools: { testTool },});

See Using Tools for more information.

Using RuntimeContext

Use RuntimeContextto access request-specific values. This lets you conditionally adjust behavior based on the context of the request.

src/mastra/agents/test-agent.ts

export type UserTier = {  "user-tier": "enterprise" | "pro";};export const testAgent = new Agent({  // ...  model: ({ runtimeContext }) => {    const userTier = runtimeContext.get("user-tier") as UserTier["user-tier"];    return userTier === "enterprise"      ? openai("gpt-4o-mini")      : openai("gpt-4.1-nano");  },});

See Runtime Context for more information.

Using maxSteps

The maxStepsparameter controls the maximum number of sequential LLM calls an agent can make. Each step includes generating a response, executing any tool calls, and processing the result. Limiting steps helps prevent infinite loops, reduce latency, and control token usage for agents that use tools. The default is 5, but can be increased:

const response = await testAgent.generate("Help me organize my day", {  maxSteps: 10,});console.log(response.text);

Using onStepFinish

You can monitor the progress of multi-step operations using the onStepFinishcallback. This is useful for debugging or providing progress updates to users.

onStepFinishis only available when streaming or generating text without structured output.

const response = await testAgent.generate("Help me organize my day", {  onStepFinish: ({ text, toolCalls, toolResults, finishReason, usage }) => {    console.log({ text, toolCalls, toolResults, finishReason, usage });  },});

Working with images

Agents can analyze and describe images by processing both the visual content and any text within them. To enable image analysis, pass an object with type: 'image'and the image URL in the contentarray. You can combine image content with text prompts to guide the agent's analysis.

const response = await testAgent.generate([  {    role: "user",    content: [      {        type: "image",        image: "https://placebear.com/cache/395-205.jpg",        mimeType: "image/jpeg",      },      {        type: "text",        text: "Describe the image in detail, and extract all the text in the image.",      },    ],  },]);console.log(response.text);

Testing with Studio

Use Studioto test agents with different messages, inspect tool calls and responses, and debug agent behavior.

Using Tools
Agent Memory
Runtime Context

paulweezydesign/add.agents.md

Select an option

No results found

Select an option

No results found

Using Agents | Agents

Using Agents

Setting up agents

Instruction formats

Provider-specific options

Registering an agent

Calling an agent

From a workflow step

From a tool

Using HTTP or curl

Generating responses

Completion using onFinish()

Structured output

Using Zod

With Tool Calling

Response format

Using tools

Using RuntimeContext

Using maxSteps

Using onStepFinish

Working with images

Testing with Studio

Related

paulweezydesign/add.agents.md

Using Agents | Agents

Using Agents

Setting up agents​

Instruction formats​

Provider-specific options​

Registering an agent​

Calling an agent​

From a workflow step​

From a tool​

Using HTTP or curl​

Generating responses​

Completion using onFinish()​

Structured output​

Using Zod​

With Tool Calling​

Response format​

Using tools​

Using RuntimeContext​

Using maxSteps​

Using onStepFinish​

Working with images​

Testing with Studio​

Related​

Setting up agents

Instruction formats

Provider-specific options

Registering an agent

Calling an agent

From a workflow step

From a tool

Using HTTP or curl

Generating responses

Completion using onFinish()

Structured output

Using Zod

With Tool Calling

Response format

Using tools

Using RuntimeContext

Using maxSteps

Using onStepFinish

Working with images

Testing with Studio

Related