Laravel AI SDK

Table of Contents

Introducing Laravel 13 AI SDKOn the 17th of March 2026, Laravel released Laravel 13, with the official release of the Laravel AI SDK package, which is said to be released with a primary goal of providing a unified, expressive API as an interface for interacting with various AI providers like OpenAI, Anthropic, Gemini, and Grok in a Laravel-built application.The list of providers we have mentioned is not exhaustive. Various providers with multiple features are available as well, allowing users to build AI Agents that can transition Audio to Text, Text to Image, Reranking, create Vector Embeddings, etc. What can be appreciated about the unified interface for provider interactions is the fact that providers are not limited to the built-in support, but developers can port their custom providers and extend the providers' interfaces.The approach that the Laravel team has followed when building the package enables software developers who have been developing web applications with Laravel to build AI Agents within their web applications. That alone makes it simple for the developer building the AI Agent to utilize the currently built-in Laravel features and interface classes to build Intelligent AI Agent tools. Features such as Object Relational Model, Filesystem, Queues, and State Management. The SDK package encapsulates Developer Authority Level systems instructions and Tools into reusable classes.The SDK has multimodal support capabilities. Multimodality simply means it supports different encoders, such as Image Generation, Audio Transcription, Understanding text, etc. Multimodal support helps output accuracy due to improved context resources for the AI Agent.The SDK has just boosted Laravel since it utilizes the current built-in strengths of Laravel, such as Queues, FileSystems, and Database Management. These are the primary features that one would need to have an ability to build an optimal knowledge base for the Laravel AI SDK, especially if you are working on an RAG.AI AgentThe first step to take when wanting to use the AI SDK is to install it using Composercomposer require laravel/aiPublish the configuration files and database migrations, using the commandphp artisan vendor:publish --provider="Laravel\Ai\AiServiceProvider"This is going to create a configuration file called ai.php which can be found in the applications config folder and migrations required for tables agent_conversations and agent_conversation_messages, which will store chat context for the chat history.Within the SDK, agents are primarily defined through the Agent Class, which can be created by using an artisan command. php artisan make : agent This class will contain the system instructions, the context of a message, and the agent tools with the desired JSON schema. This approach is what allows Laravel developers to reuse the Same Agent Logic across different parts of the app, just like how we normally do with App controllers. It is important to have your schemas defined due to the known fact that AI is non-deterministic, that is to say, it doesn’t return the same output for the same prompt input. In simple terms, for each identical prompt or instruction, you will get different AI responses, unless the temperature is modified. In the Laravel AI SDK and most LLM integrations, you manage this using the Temperature setting, with the values 0.8 - 1.0 (High Temperature), making the AI system more creative, and 0.1-0.2 (Low Temperature) more deterministic, and for the response to be more consistent and predictable since it will almost always pick the highest-probability token. Having JSON Schemas defined will help in structuring the response in a manner that is useful for the AI Agent to execute its task accurately.MemoryFor those that have tried building AI Agents before using an SDK such as Vercel AI or any other AI tool, you will be aware of the fact that every prompt is processed independently, AI doesn’t remember what you asked it on the previous chats resulting in a loss of context for consecutive queries, this problem is solved by many AI systems by providing what is called an AI memory which is just a history of the previous chats within the chat window, with the Laravel AI SDK this is managed through the remembersConversations trait which handles storing and retrieval of message histories. On our previous projects, when building a custom agent, if it's a smaller agent, we would store chat histories using a JSON file, but Laravel uses a database, which allows you to have a setup for scalability even before needing it.CostsWe can not avoid the fact that there are monetary and time costs involved when using AI models through an API; costs can skyrocket quickly. Before the stable release of the SDK, these were the following issues that it had, which directly or indirectly impact those costs.Problem A: Number Of ToolsWhat are AI Tools? Tools are a set of programs that are engineered to perform a specified task, such as fetching webpages, retrieving data from the database, or executing software. This is an extra layer that is enabled by the Model Context Protocol to provide extra capabilities to the AI model. If you provided any tool(s) for the AI model to load upon execution, the AI model is going to select the appropriate tool for an intended act.The number of tools that you provide to an AI Agent directly increases your costs by increasing token consumption through a phenomenon known as “Context Bloat.” In one of the live interviews, which featured Taylor Otwill, the creator of the Laravel framework, he mentioned that when you prompt an LLM in Laravel, you do not just send your message just like how you would do it when you are doing a traditional API call to the provider; you must also send the definitions of every tool the agent has available and a description of what those tools do. Because these definitions must be sent with every single message in a conversation, having a large number of tools can cause your token usage to grow rapidly. Thus, having more than 50 tools within the same agent might be too costly. Solutions for Problem A1. A straightforward solution might just be to limit the number of tools that an agent has access to. Well, if I might say so, this comes as a blow to the AI SDK. Since the tools enable an Agent to be more capable, imagine a scenario in which you want to access another agent as a tool. This means that the number of tool calls per request will grow exponentially, directly increasing the cost involved.2. Dynamic Registration as a Solution: To mitigate these costs, the SDK allows for dynamic tool registration. You can write a logic to return only the specific tools a user needs at runtime, such as giving paid users access to a specific list of tools while keeping the toolset leaner for free users. Dynamic loading is performed through the tools method within an Agent class, which allows the programmer to determine available tools at runtime. This approach is highly flexible and can be implemented as follows:Constructor Injection: You pass dependencies, such as a user object, directly into the agent's constructor.Conditional Logic: Inside the tools method, you can use that injected data to conditionally return a specific array of tools. For example, you can check a user's permissions or subscription level to decide whether to provide access to certain "premium" tools.Basically, the two provided solutions are just a way to say I want a relevant tool set to be called when i am going to need it, we were thinking about the possibility of selecting a tool locally before a query is sent and remembered that a tool is intelligently selected by the LLM, so in the near future if lowcost LLM's that can run locally can be implemented it means that we can perform the tool selection locally by allowing the Local LLM to do the tool selection, and then the query or message be sent to the smartest model with a tool already selected.Problem B: Model SelectionCosts are also directly influenced by which model you choose for the task. The AI SDK has a built-in functionality to change model selection, such as i useCheapestModel attribute for simple tasks (like basic text summarization) to avoid the high costs of "smartest" models when they aren't necessary. You can also shift to the smartest models as well for more complex tasks.Problem C: TestingImagine you are still building an Agent, and you have to send prompts to test your Agent through a provider. The provider doesn’t care if the Agent is still in the development phase; however, the AI SDK allows developers to use built-in support to fake agents, which makes it easier to write tests for non-deterministic AI interactions.Historical Technical And Reliability FailuresThe following were known issues with the SDK, and on the stable release, they have been flagged as resolved. You have to watch out for them to ensure your AI Agents don't experience them. Timeouts It has been noted that for complex tasks like long-form translations or image generation, the SDK frequently hits time limits. For instance, an early user once revealed that certain models failed to deliver results within a 60-second window, which they suggested as a benchmark for a model being "really slow."Problem Unknown Finish ReasonsEven top-tier models can return an "unknown finish reason," failing to complete a prompt without a clear explanation. Solution: While tools like structured output help, if you rely on free-form text, it is "arbitrary" and difficult to parse reliably.Provider Downtime Any AI provider can experience "bad days" where they are temporarily down or unresponsive, meaning results are never guaranteed. Hit-or-Miss Success: Success can be inconsistent even when using the same prompt with the same model; in one test, a model only successfully delivered an image one out of five times. Reliability is a known issue; during a live demonstration of the SDK on the Taylor Otwell interview, the system returned an "AI provider is overloaded" error. The SDK accounts for this by checking for HTTP 429 codes (rate limiting).Solution: The SDK provides an automatic failover to secondary providers.Disobeying RulesModels may fail to follow specific instructions, such as character limits. For example, Gemini Pro struggled to keep a tweet under the requested 280-character limit.Safety Check TriggersPrompts that mention celebrities or other restricted topics can trigger safety checks, causing the provider to block the output entirely.Formatting and Hallucinations AI-generated content often requires human intervention due to issues like typos (noted specifically in Gemini Flash) or the inclusion of incorrect data, such as wrong dates in generated images.Questionable AestheticsIn image generation, models may produce "cringe" results, such as repeating text inappropriately or failing to accurately depict requested subjects.High CostsIf not monitored, certain operations—especially image generation—can be unexpectedly expensive, costing significantly more than text-based tasks.Slow Response Times: Unlike the near-instant feel of consumer tools like ChatGPT, API calls can take 20 to 50+ seconds for complex operations, necessitating the use of background queues and websockets to manage user expectations.Solution: AI operations are not always instant. To manage user expectations and avoid blocking applications, the SDK utilizes streaming (sending text line-by-line) and asynchronous background queues.Other FailuresInstruction Failures (Infinite Loops): A specific failure point known as the "Ralph Wiggum loop," where an agent gets stuck in an infinite loop of actions without stopping.Sensitive Actions: Users must not give agents full autonomy over sensitive tasks, such as refunds, without human-in-the-loop approval, a feature that is currently on the roadmap but not yet trivial to implement.ToolingAgent tools are what make AI Agents capable of performing intelligent tasks. We use tools to augment LLMs with capabilities that they don't have, such as access to proprietary data, knowledge documents, searching the web, interacting with local file systems, and even more computation capabilities. We’ve mentioned before that if an Agent is prompted and has the list of available tools, those tools will be sent with the message, and then the LLM will determine which tool has to be called based on the user prompt. It does this by determining the user's intent on the query.The Laravel AI SDK has some built-in tools, such as web search, web fetch, and file search.Some feature that is useful when building a tool is Async Processing, in which you can queue prompts to run in the background, allowing the application to handle long-running tasks like transcribing large audio files without blocking the user interface.Basic Setup RecommendationThe AI SDK can be installed in a similar way to how a Laravel package is installed. Since you might be using a VibeCode text editor, it is recommended to also install Laravel Boost, which will give your text editor access to the Latest Laravel documentation.LimitationsThe Orchestrator PatternAlthough this functionality is a desired one for building an Agentic System, it is not built into the initial release. This pattern involves a primary "orchestrator" agent that manages a complex task by delegating specific parts of it to other specialized agents. Instead of one agent trying to do everything, the orchestrator acts as a manager that understands the high-level goal and decides which "expert" to call upon to handle specific sub-tasks.Lack of Native Human-in-the-Loop ControlsCurrently, the SDK does not have a built-in "human tool approval" mechanism. This means that if you give an agent a tool to generate payment links, it may execute that action autonomously without oversight. The framework's creator, Taylor Otwell, has noted that for sensitive actions such as a refund tool, human approval is a "meaty" and "not trivial" feature that is still on the roadmap.ConclusionWith all the mentioned capabilities and functionalities that the AI SDK is adding into the Laravel PHP ecosystem, it is important to not that the perfomance of your agent financially and also productive wise will largely depend on your setup design and how your agent interacts with the LLM, we can not wait to see the projects that you are going to build with the SDK we will be happy if you share them on the comments section.ReferencesBuilding Application With AI Laravel SDK  Laravel 13 Release Notes 

Introducing Laravel 13 AI SDK

On the 17th of March 2026, Laravel released Laravel 13, with the official release of the Laravel AI SDK package, which is said to be released with a primary goal of providing a unified, expressive API as an interface for interacting with various AI providers like OpenAI, Anthropic, Gemini, and Grok in a Laravel-built application.

The list of providers we have mentioned is not exhaustive. Various providers with multiple features are available as well, allowing users to build AI Agents that can transition Audio to Text, Text to Image, Reranking, create Vector Embeddings, etc. What can be appreciated about the unified interface for provider interactions is the fact that providers are not limited to the built-in support, but developers can port their custom providers and extend the providers' interfaces.

The approach that the Laravel team has followed when building the package enables software developers who have been developing web applications with Laravel to build AI Agents within their web applications. That alone makes it simple for the developer building the AI Agent to utilize the currently built-in Laravel features and interface classes to build Intelligent AI Agent tools. Features such as Object Relational Model, Filesystem, Queues, and State Management.

The SDK package encapsulates Developer Authority Level systems instructions and Tools into reusable classes.

The SDK has multimodal support capabilities. Multimodality simply means it supports different encoders, such as Image Generation, Audio Transcription, Understanding text, etc. Multimodal support helps output accuracy due to improved context resources for the AI Agent.

The SDK has just boosted Laravel since it utilizes the current built-in strengths of Laravel, such as Queues, FileSystems, and Database Management. These are the primary features that one would need to have an ability to build an optimal knowledge base for the Laravel AI SDK, especially if you are working on an RAG.

AI Agent

The first step to take when wanting to use the AI SDK is to install it using Composer

composer require laravel/ai

Publish the configuration files and database migrations, using the command

php artisan vendor:publish --provider="Laravel\Ai\AiServiceProvider"

This is going to create a configuration file called ai.php which can be found in the applications config folder and migrations required for tables agent_conversations and agent_conversation_messages, which will store chat context for the chat history.

Within the SDK, agents are primarily defined through the Agent Class, which can be created by using an artisan command. php artisan make : agent This class will contain the system instructions, the context of a message, and the agent tools with the desired JSON schema. This approach is what allows Laravel developers to reuse the Same Agent Logic across different parts of the app, just like how we normally do with App controllers.

It is important to have your schemas defined due to the known fact that AI is non-deterministic, that is to say, it doesn’t return the same output for the same prompt input. In simple terms, for each identical prompt or instruction, you will get different AI responses, unless the temperature is modified. In the Laravel AI SDK and most LLM integrations, you manage this using the Temperature setting, with the values 0.8 - 1.0 (High Temperature), making the AI system more creative, and 0.1-0.2 (Low Temperature) more deterministic, and for the response to be more consistent and predictable since it will almost always pick the highest-probability token. Having JSON Schemas defined will help in structuring the response in a manner that is useful for the AI Agent to execute its task accurately.

Memory

For those that have tried building AI Agents before using an SDK such as Vercel AI or any other AI tool, you will be aware of the fact that every prompt is processed independently, AI doesn’t remember what you asked it on the previous chats resulting in a loss of context for consecutive queries, this problem is solved by many AI systems by providing what is called an AI memory which is just a history of the previous chats within the chat window, with the Laravel AI SDK this is managed through the remembersConversations trait which handles storing and retrieval of message histories. On our previous projects, when building a custom agent, if it's a smaller agent, we would store chat histories using a JSON file, but Laravel uses a database, which allows you to have a setup for scalability even before needing it.

Costs

We can not avoid the fact that there are monetary and time costs involved when using AI models through an API; costs can skyrocket quickly. Before the stable release of the SDK, these were the following issues that it had, which directly or indirectly impact those costs.

Problem A: Number Of Tools

What are AI Tools?

Tools are a set of programs that are engineered to perform a specified task, such as fetching webpages, retrieving data from the database, or executing software. This is an extra layer that is enabled by the Model Context Protocol to provide extra capabilities to the AI model. If you provided any tool(s) for the AI model to load upon execution, the AI model is going to select the appropriate tool for an intended act.

The number of tools that you provide to an AI Agent directly increases your costs by increasing token consumption through a phenomenon known as “Context Bloat.” In one of the live interviews, which featured Taylor Otwill, the creator of the Laravel framework, he mentioned that when you prompt an LLM in Laravel, you do not just send your message just like how you would do it when you are doing a traditional API call to the provider; you must also send the definitions of every tool the agent has available and a description of what those tools do. Because these definitions must be sent with every single message in a conversation, having a large number of tools can cause your token usage to grow rapidly. Thus, having more than 50 tools within the same agent might be too costly.

Solutions for Problem A

1. A straightforward solution might just be to limit the number of tools that an agent has access to. Well, if I might say so, this comes as a blow to the AI SDK. Since the tools enable an Agent to be more capable, imagine a scenario in which you want to access another agent as a tool. This means that the number of tool calls per request will grow exponentially, directly increasing the cost involved.

2. Dynamic Registration as a Solution: To mitigate these costs, the SDK allows for dynamic tool registration. You can write a logic to return only the specific tools a user needs at runtime, such as giving paid users access to a specific list of tools while keeping the toolset leaner for free users. Dynamic loading is performed through the tools method within an Agent class, which allows the programmer to determine available tools at runtime. This approach is highly flexible and can be implemented as follows:

Constructor Injection: You pass dependencies, such as a user object, directly into the agent's constructor.

Conditional Logic: Inside the tools method, you can use that injected data to conditionally return a specific array of tools. For example, you can check a user's permissions or subscription level to decide whether to provide access to certain "premium" tools.

Basically, the two provided solutions are just a way to say I want a relevant tool set to be called when i am going to need it, we were thinking about the possibility of selecting a tool locally before a query is sent and remembered that a tool is intelligently selected by the LLM, so in the near future if lowcost LLM's that can run locally can be implemented it means that we can perform the tool selection locally by allowing the Local LLM to do the tool selection, and then the query or message be sent to the smartest model with a tool already selected.

Problem B: Model Selection

Costs are also directly influenced by which model you choose for the task. The AI SDK has a built-in functionality to change model selection, such as i useCheapestModel attribute for simple tasks (like basic text summarization) to avoid the high costs of "smartest" models when they aren't necessary. You can also shift to the smartest models as well for more complex tasks.

Problem C: Testing

Imagine you are still building an Agent, and you have to send prompts to test your Agent through a provider. The provider doesn’t care if the Agent is still in the development phase; however, the AI SDK allows developers to use built-in support to fake agents, which makes it easier to write tests for non-deterministic AI interactions.

Historical Technical And Reliability Failures

The following were known issues with the SDK, and on the stable release, they have been flagged as resolved. You have to watch out for them to ensure your AI Agents don't experience them.

Timeouts

It has been noted that for complex tasks like long-form translations or image generation, the SDK frequently hits time limits. For instance, an early user once revealed that certain models failed to deliver results within a 60-second window, which they suggested as a benchmark for a model being "really slow."

Problem Unknown Finish Reasons

Even top-tier models can return an "unknown finish reason," failing to complete a prompt without a clear explanation.

Solution: While tools like structured output help, if you rely on free-form text, it is "arbitrary" and difficult to parse reliably.

Provider Downtime

Any AI provider can experience "bad days" where they are temporarily down or unresponsive, meaning results are never guaranteed. Hit-or-Miss Success: Success can be inconsistent even when using the same prompt with the same model; in one test, a model only successfully delivered an image one out of five times. Reliability is a known issue; during a live demonstration of the SDK on the Taylor Otwell interview, the system returned an "AI provider is overloaded" error. The SDK accounts for this by checking for HTTP 429 codes (rate limiting).

Solution: The SDK provides an automatic failover to secondary providers.

Disobeying Rules

Models may fail to follow specific instructions, such as character limits. For example, Gemini Pro struggled to keep a tweet under the requested 280-character limit.

Safety Check Triggers

Prompts that mention celebrities or other restricted topics can trigger safety checks, causing the provider to block the output entirely.

Formatting and Hallucinations

AI-generated content often requires human intervention due to issues like typos (noted specifically in Gemini Flash) or the inclusion of incorrect data, such as wrong dates in generated images.

Questionable Aesthetics

In image generation, models may produce "cringe" results, such as repeating text inappropriately or failing to accurately depict requested subjects.

High Costs

If not monitored, certain operations—especially image generation—can be unexpectedly expensive, costing significantly more than text-based tasks.

Slow Response Times: Unlike the near-instant feel of consumer tools like ChatGPT, API calls can take 20 to 50+ seconds for complex operations, necessitating the use of background queues and websockets to manage user expectations.

Solution: AI operations are not always instant. To manage user expectations and avoid blocking applications, the SDK utilizes streaming (sending text line-by-line) and asynchronous background queues.

Other Failures

Instruction Failures (Infinite Loops): A specific failure point known as the "Ralph Wiggum loop," where an agent gets stuck in an infinite loop of actions without stopping.

Sensitive Actions: Users must not give agents full autonomy over sensitive tasks, such as refunds, without human-in-the-loop approval, a feature that is currently on the roadmap but not yet trivial to implement.

Tooling

Agent tools are what make AI Agents capable of performing intelligent tasks. We use tools to augment LLMs with capabilities that they don't have, such as access to proprietary data, knowledge documents, searching the web, interacting with local file systems, and even more computation capabilities. We’ve mentioned before that if an Agent is prompted and has the list of available tools, those tools will be sent with the message, and then the LLM will determine which tool has to be called based on the user prompt. It does this by determining the user's intent on the query.

The Laravel AI SDK has some built-in tools, such as web search, web fetch, and file search.

Some feature that is useful when building a tool is Async Processing, in which you can queue prompts to run in the background, allowing the application to handle long-running tasks like transcribing large audio files without blocking the user interface.

Basic Setup Recommendation

The AI SDK can be installed in a similar way to how a Laravel package is installed. Since you might be using a VibeCode text editor, it is recommended to also install Laravel Boost, which will give your text editor access to the Latest Laravel documentation.

Limitations

The Orchestrator Pattern

Although this functionality is a desired one for building an Agentic System, it is not built into the initial release. This pattern involves a primary "orchestrator" agent that manages a complex task by delegating specific parts of it to other specialized agents. Instead of one agent trying to do everything, the orchestrator acts as a manager that understands the high-level goal and decides which "expert" to call upon to handle specific sub-tasks.

Lack of Native Human-in-the-Loop Controls

Currently, the SDK does not have a built-in "human tool approval" mechanism. This means that if you give an agent a tool to generate payment links, it may execute that action autonomously without oversight. The framework's creator, Taylor Otwell, has noted that for sensitive actions such as a refund tool, human approval is a "meaty" and "not trivial" feature that is still on the roadmap.

Conclusion

With all the mentioned capabilities and functionalities that the AI SDK is adding into the Laravel PHP ecosystem, it is important to not that the perfomance of your agent financially and also productive wise will largely depend on your setup design and how your agent interacts with the LLM, we can not wait to see the projects that you are going to build with the SDK we will be happy if you share them on the comments section.

Related Articles

Artificial Intelligence
Prompting ChatGPT
Apr 2, 2026

Artificial Intelligence
Framework For Agentic AI Systems Design
Mar 31, 2026

Artificial Intelligence
AI Agent Protocols
Mar 31, 2026

Artificial Intelligence
Laravel AI SDK Agents (Hotel Booking Agent)
Mar 29, 2026

Laravel AI SDK

Introducing Laravel 13 AI SDK

AI Agent

Memory

Costs

Problem A: Number Of Tools

Historical Technical And Reliability Failures

Other Failures

Tooling

Basic Setup Recommendation

Limitations

Conclusion

Comments

Related Articles

Prompting ChatGPT

Framework For Agentic AI Systems Design

AI Agent Protocols

Laravel AI SDK Agents (Hotel Booking Agent)

More Articles
Await You

We use cookies

Cookie Preferences

Laravel AI SDK

Introducing Laravel 13 AI SDK

AI Agent

Memory

Costs

Problem A: Number Of Tools

Historical Technical And Reliability Failures

Other Failures

Tooling

Basic Setup Recommendation

Limitations

Conclusion

Comments

Related Articles

Prompting ChatGPT

Framework For Agentic AI Systems Design

AI Agent Protocols

Laravel AI SDK Agents (Hotel Booking Agent)

More ArticlesAwait You

More Articles
Await You