Agentic AI > Develop agentic workflows > Customize agents

Customize agents¶

Developing an agent involves editing the agentic_workflow source code. A variety of tools and commands are provided to help you test and deploy your agent during the development process.

Generic base template

You can use the generic_base template to build an agent using any framework of your choice; however, you need to implement the agent logic and structure yourself, as this template does not include any pre-defined agent code.

Modify the agent code¶

The first step in developing your agent is to modify the agent code to implement your desired functionality. The main agent code is located in the agentic_workflow directory inside the framework-specific directory you selected when you created the project. For example, if you selected the CrewAI framework, the path would be agent_crewai/agentic_workflow.

agentic_workflow/ directory

agentic_workflow/
├── __init__.py           # Package initialization
├── agent.py              # Main agent implementation, including prompts
├── custom.py             # DataRobot integration hooks
├── config.py             # Configuration management
├── mcp_client.py         # MCP server integration (optional, for tool use)
└── model-metadata.yaml   # Agent metadata configuration

File	Description
`__init__.py`	Identifies the directory as a Python package and enables imports.
`model-metadata.yaml`	Defines the agent's configuration, runtime parameters, and deployment settings.
`custom.py`	Implements DataRobot integration hooks (`load_model`, `chat`) for agent execution.
`agent.py`	Contains the main `MyAgent` class with core workflow logic and framework-specific implementation.
`config.py`	Manages configuration loading from environment variables, runtime parameters, and DataRobot credentials.
`mcp_client.py`	Provides MCP server connection management for tool integration (optional, only needed when using MCP tools).

The main class you need to modify is in agent.py. See this class for details about the implementation based on the framework for which you are developing.

The agent template provides you with a simple example that contains two agents and two tasks. You can modify this code to add more agents, tasks, and tools as needed. Each agent is connected to an LLM provider, which is specified by the llm property in the agent.py file. You can modify this property to change the LLM provider or its configuration. See the Configuring LLM providers documentation for more details. For more information on the overall structure of agentic workflow templates, see the Agent components documentation.

datarobot_genai package

Agent templates use the datarobot_genai package to streamline development. This package provides helper functions and base classes that simplify agent implementation, including LLM configuration, response formatting, and integration with DataRobot services. The templates automatically include this package, so you don't need to install it separately.

Modify agent prompts¶

Each agent template uses different approaches for defining and customizing prompts. Understanding how to modify prompts in your chosen framework is crucial for tailoring agent behavior to your specific use case.

CrewAILangGraphLlamaIndexNAT

In CrewAI templates, prompts are defined through several properties in the MyAgent class within the agent.py file:

Agent prompts: Defined using role, goal, and backstory properties.
Task prompts: Defined using description and expected_output properties.

CrewAI Agent definition example

@property
def agent_planner(self) -> Agent:
    return Agent(
        role="Content Planner",
        goal="Plan engaging and factually accurate content on {topic}",
        backstory="You're working on planning a blog article about the topic: {topic}. You collect "
        "information that helps the audience learn something and make informed decisions. Your work is "
        "the basis for the Content Writer to write an article on this topic.",
        allow_delegation=False,
        verbose=self.verbose,
        llm=self.llm,
    )

To modify CrewAI agent prompts:

Update agent behavior: Modify the role, goal, and backstory properties in agent definitions.
Use variables: Leverage {topic} and other variables for dynamic prompt content.

CrewAI Task definition example

@property
def task_plan(self) -> Task:
    return Task(
        description=(
            "1. Prioritize the latest trends, key players, and noteworthy news on {topic}.\n"
            "2. Identify the target audience, considering their interests and pain points.\n"
            "3. Develop a detailed content outline including an introduction, key points, and a call to action.\n"
            "4. Include SEO keywords and relevant data or sources."
        ),
        expected_output="A comprehensive content plan document with an outline, audience analysis, SEO keywords, "
        "and resources.",
        agent=self.agent_planner,
    )

To modify CrewAI task prompts:

Customize task instructions: Update the description property in task definitions.
Change expected outputs: Modify the expected_output property to match your requirements.
Use variables: Leverage {topic} and other variables for dynamic prompt content.

For more advanced CrewAI prompt engineering techniques, see the CrewAI Agents documentation and CrewAI Tasks documentation.

In LangGraph templates, prompts are defined using the system_prompt parameter in agent definitions and a make_system_prompt helper method within the MyAgent class:

LangGraph Agent definition example

@property
def agent_planner(self) -> Any:
    return create_react_agent(
        self.llm,
        tools=[],
        prompt=self.make_system_prompt(
            "You are a content planner. You are working with a content writer and editor colleague.\n"
            "You're working on planning a blog article about the topic. You collect information that helps the "
            "audience learn something and make informed decisions. Your work is the basis for the Content Writer "
            "to write an article on this topic.\n"
            "1. Prioritize the latest trends, key players, and noteworthy news on the topic.\n"
            "2. Identify the target audience, considering their interests and pain points.\n"
            "3. Develop a detailed content outline including an introduction, key points, and a call to action.\n"
            "4. Include SEO keywords and relevant data or sources."
        ),
    )

To modify LangGraph prompts:

Update system prompts: Modify the string passed to make_system_prompt() in agent definitions.
Customize the helper method: Update the make_system_prompt method to change the base prompt structure.
Add task-specific instructions: Include detailed instructions within the system prompt string.
Modify agent behavior: Update the prompt content to change how agents interpret and respond to tasks.

For more advanced LangGraph prompt engineering techniques, see the LangGraph documentation.

In LlamaIndex templates, prompts are defined using the system_prompt parameter in FunctionAgent definitions within the MyAgent class:

LlamaIndex Agent definition example

@property
def research_agent(self) -> FunctionAgent:
    return FunctionAgent(
        name="ResearchAgent",
        description="Useful for finding information on a given topic and recording notes on the topic.",
        system_prompt=(
            "You are the ResearchAgent that can find information on a given topic and record notes on the topic. "
            "Once notes are recorded and you are satisfied, you should hand off control to the "
            "WriteAgent to write a report on the topic. You should have at least some notes on a topic "
            "before handing off control to the WriteAgent."
        ),
        llm=self.llm,
        tools=[self.record_notes],
        can_handoff_to=["WriteAgent"],
    )

To modify LlamaIndex prompts:

Update system prompts: Modify the system_prompt string in FunctionAgent definitions.
Customize agent descriptions: Update the description parameter to change how agents are identified.
Modify handoff behavior: Update the can_handoff_to list and system prompt to control agent workflow.
Add tool-specific instructions: Include instructions about when and how to use specific tools.

For more advanced LlamaIndex prompt engineering techniques, see the LlamaIndex prompt engineering documentation.

In NAT (NVIDIA NeMo Agent Toolkit) templates, prompts are defined in the workflow.yaml file using the system_prompt field within function definitions:

NAT workflow.yaml Agent definition example

functions:
  planner:
    _type: chat_completion
    llm_name: datarobot_llm
    system_prompt: |
      You are a content planner. You are working with a content writer and editor colleague.
      You're working on planning a blog article about the topic.
      You collect information that helps the audience learn something and make informed decisions.
      Your work is the basis for the Content Writer to write an article on this topic.
      1. Prioritize the latest trends, key players, and noteworthy news on the topic.
      2. Identify the target audience, considering their interests and pain points.
      3. Develop a detailed content outline including an introduction, key points, and a call to action.
      4. Include SEO keywords and relevant data or sources.

To modify NAT prompts:

Update system prompts: Modify the system_prompt field in function definitions within workflow.yaml.
Configure LLM per function: Set the llm_name field to reference an LLM defined in the llms section of workflow.yaml.
Modify workflow structure: Update the workflow section to change the execution order and tool list.
Add new functions: Define additional functions in the functions section to extend agent capabilities.

For more advanced NAT usage instructions, see the NVIDIA NeMo Agent Toolkit documentation.

Best practices for prompt modification

When modifying prompts across any framework:

Be specific: Provide clear, detailed instructions for what you want the agent to accomplish.
Use consistent formatting: Maintain consistent prompt structure across all agents in your workflow.
Test incrementally: Make small changes and test them before implementing larger modifications.
Consider context: Ensure prompts work well together in multi-agent workflows.
Document changes: Keep track of prompt modifications for future reference and team collaboration.

Enable streaming responses¶

Streaming allows agents to send responses incrementally as they are generated, rather than waiting for the complete response. This provides a better user experience by showing progress in real-time, reducing perceived latency, and enabling users to see agent actions as they happen.

Streaming support varies by agent framework. There are three levels of streaming implementation:

Chunk streaming: Each chunk from the LLM is streamed as it's generated (such as tokens/partial text).
Step streaming: Response from each sub-agent is streamed when ready.
Event streaming: Each individual event (starting new step, calling a tool, reasoning) is streamed.

Framework	Streaming	Notes
LangGraph	Enabled	Chunk-level streaming is automatically enabled when `stream=True` is passed. The base `LangGraphAgent` class handles streaming responses.
Generic Base	Supported	All streaming levels (chunk, step, event) require custom implementation. Example code is provided in `agent.py` for chunk streaming.
CrewAI	Supported	All streaming levels (chunk, step, event) require custom implementation. Event listeners capture agent execution and tool usage events incrementally, which facilitates step and event streaming with custom code. Chunk streaming requires custom implementation to stream from the LLM directly.
LlamaIndex	Supported	All streaming levels (chunk, step, event) require custom implementation. The framework executes agents incrementally, which facilitates step streaming with custom code. Chunk and event streaming require custom implementation.
NAT	Supported	All streaming levels (chunk, step, event) require custom implementation.

Infrastructure support

All agent templates include infrastructure in custom.py that can handle streaming responses. For frameworks that require custom implementation (Generic Base, CrewAI, LlamaIndex, NAT), you need to modify your agent's invoke() method to return an AsyncGenerator when streaming is requested. If your agent's invoke() method returns an AsyncGenerator, the infrastructure automatically converts it to the appropriate streaming response format. The is_streaming helper function is available to all framework templates via the datarobot_genai package by importing from datarobot_genai.core.agents import is_streaming. It checks if stream=True is present in the chat completion request body parameters.

If streaming is implemented for an agent, enable streaming when testing locally (via CLI) or when making predictions with a deployed agent (via API).

CLI (local testing)API (deployed agent)

Use the --stream flag when running the agent CLI:

task agent:cli -- execute --user_prompt 'Write a document about the history of AI.' --stream

You can also use streaming with structured queries:

task agent:cli -- execute --user_prompt '{"topic":"Generative AI"}' --stream

Set stream=True in the completion parameters when making API calls:

Enable streaming in API calls

from openai import OpenAI

client = OpenAI(
    base_url=CHAT_API_URL,
    api_key=API_KEY,
)

completion = client.chat.completions.create(
    model="datarobot-deployed-llm",
    messages=[
        {"role": "user", "content": "What would it take to colonize Mars?"},
    ],
    stream=True,  # Enable streaming
)

# Process streaming response
for chunk in completion:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Testing the agent during local development¶

You can test your agent locally using the development server provided in the template. This allows you to run and debug your agent code without deploying it to DataRobot.

To submit a test query to your agent, the development server must be running. Start it manually or use the auto-start option:

Manual startAutomatic start

Start the development server manually when running multiple tests. The development server runs continuously and blocks the terminal, so start it in one terminal:

task agent:dev

Keep this terminal running. Then, in a different terminal, run your test commands:

task agent:cli -- execute --user_prompt 'Write a document about the history of AI.'

You can also send a structured query as a prompt if your agentic workflow requires it:

task agent:cli -- execute --user_prompt '{"topic":"Generative AI"}'

Auto-start the development server for single tests. Use START_DEV=1 to automatically start and stop the development server:

task agent:cli START_DEV=1 -- execute --user_prompt 'Write a document about the history of AI.'

You can also send a structured query as a prompt if your agentic workflow requires it:

task agent:cli START_DEV=1 -- execute --user_prompt '{"topic":"Generative AI"}'

This command will run the agent locally and print the output to the console. You can modify the query to test different inputs and scenarios. For additional local CLI commands that are available, see the Using the agent CLI documentation.

Fast iteration with runtime dependencies

For rapid development, you can add Python dependencies without rebuilding the Docker image using runtime dependencies. This process improves iteration speed. See the Add Python packages documentation for details on adding runtime dependencies.

Build an agent for testing in the DataRobot LLM Playground¶

To create a custom model that can be refined using the DataRobot LLM Playground:

task build

This command runs the Pulumi infrastructure to create a custom model in DataRobot but does not create a full production deployment. This is significantly faster for iterative cloud development and testing.

For more examples on working with agents in the DataRobot LLM Playground, see the Agentic playground documentation.

Build command considerations

The task build command will remove any existing deployments. These can be recreated using task deploy if they are removed, but the new deployments will have different deployment IDs.

Deploy an agent for production use¶

To create a full production-grade deployment:

task deploy

This command builds the custom model and creates a production deployment with the necessary infrastructure, which takes longer but provides a complete production environment. The deployment is a standard DataRobot deployment that includes full monitoring, logging, and scaling capabilities. For more information about DataRobot deployments, see the Deployment documentation.

View logs and traces for a deployed agent¶

Once your agent is deployed, you can view OpenTelemetry (OTel) logs and traces in the DataRobot UI.

To view logs, on the Deployments tab, locate and click your deployment, click the Activity log tab, and then click Logs. The logs display in OpenTelemetry format and include log levels (INFO, DEBUG, WARN, and ERROR), time-period filtering (Last 15 min, Last hour, Last day, or Custom range), and export capabilities via the OTel logs API for integration with third-party observability tools like Datadog.

Access and retention

Logs are available for all deployment types. Only users with "Owner" and "User" roles on a deployment can view these logs. Logs are stored for 30 days.

To view traces, which follow the end-to-end path of requests to your agent, on the Service health tab of your deployment, click Show tracing in the upper-right corner of the Total predictions chart. The tracing table displays trace information including timestamp, status, trace ID, duration, spans count, cost, prompt, and completion. Click a trace row to view detailed spans in Chart or List format, showing individual steps in the agent's execution, including LLM API calls, tool invocations, and agent actions.

For more information, see the logs documentation and tracing documentation.

Manually deploy an agent using Pulumi¶

If needed, you can manually run Pulumi commands to debug or refine Pulumi code.

# Load environment variables
set -o allexport && source .env

# For build mode only (custom model without deployment)
export AGENT_DEPLOY=0

# Or for full deployment mode (default)
# export AGENT_DEPLOY=1

# Navigate to the infrastructure directory
cd ./infra

# Run Pulumi deployment
pulumi up

The AGENT_DEPLOY environment variable controls whether Pulumi creates only the custom model (AGENT_DEPLOY=0) or both the custom model and a production deployment (AGENT_DEPLOY=1). If not set, Pulumi defaults to full deployment mode.

Pulumi will prompt you to confirm the resources to be created or updated.

Make predictions with a deployed agentic workflow¶

After the agentic workflow is deployed, access real-time prediction snippets from the deployment's Predictions > Prediction API tab. For more information on deployment predictions, see the Prediction API snippets documentation.

Alternatively, you can modify the script below to make predictions with the deployed agentic workflow, replacing the placeholders for the API_KEY, DEPLOYMENT_ID, and CHAT_API_URL variables.

datarobot-llm-chat.py

import sys
import logging
import time
import os

from openai import OpenAI

API_KEY = '<API_KEY>' # Your API Key
DEPLOYMENT_ID = '<DEPLOYMENT_ID>' # The agentic workflow deployment ID
CHAT_API_URL = '<CHAT_API_URL>' # The chat API URL for the agentic workflow deployment
# For example, 'https://app.datarobot.com/api/v2/deployments/68824e9aa1946013exfc3415/'

logging.basicConfig(
    level=logging.INFO,
    stream=sys.stdout,
    format='%(asctime)s %(filename)s:%(lineno)d %(levelname)s %(message)s',
)
logger = logging.getLogger(__name__)


def main():
    openai_client = OpenAI(
        base_url=CHAT_API_URL,
        api_key=API_KEY,
        _strict_response_validation=False
    )

    prompt = "What would it take to colonize Mars?"
    logging.info(f"Trying Simple prompt first: \"{prompt}\"")
    completion = openai_client.chat.completions.create(
        model="datarobot-deployed-llm",
        messages=[
            {"role": "system", "content": "Explain your thoughts using at least 100 words."},
            {"role": "user", "content": prompt},
        ],
        max_tokens=512,  # omit if you want to use the model's default max
    )

    print(completion.choices[0].message.content)

    return 0

if __name__ == '__main__':
    sys.exit(main())

Next steps¶

After deployment, your agent will be available in your DataRobot environment. You can:

Test your deployed agent using task agent:cli -- execute-deployment.
Integrate your agent with other DataRobot services.
Monitor usage and performance in the DataRobot dashboard.

For agentic platform-specific assistance beyond the scope of the examples provided in this repository, see the official documentation for each framework:

You can also find more examples and documentation in the public repositories for specific frameworks to help you build more complex agents, add tools, and define workflows and tasks.