Skip to content

Agent Configuration

Agent configuration controls everything about how an agent behaves — the model it uses, how it searches memory, what tools it can invoke, and how it handles cost limits.

Configuration is set when you create an agent and can be updated at any time from the agent’s Settings tab.


A complete agent configuration has these top-level sections:

{
"agent": { ... }, // Identity and display settings
"model": { ... }, // LLM selection and fallback chain
"memory": { ... }, // Memory search and storage behavior
"tools": { ... }, // Tool access control
"session": { ... }, // Session and context window settings
"limits": { ... } // Cost and rate limits
}

Basic identity and display settings.

"agent": {
"name": "Aria",
"role": "Client Concierge",
"description": "First point of contact for client inquiries"
}
FieldTypeDescription
namestringDisplay name shown in the dashboard and chat UI
rolestringShort role label (e.g., “Research Analyst”, “Ops Monitor”)
descriptionstringOne-sentence description of what this agent does

Controls which LLM the agent uses and what happens if it’s unavailable.

"model": {
"default": "claude-haiku-3.5",
"fallbacks": [
"claude-sonnet-4-20250514"
]
}
FieldTypeDescription
defaultstringPrimary model ID to use for all requests
fallbacksstring[]Ordered list of fallback models if the primary is unavailable

Available models (subset — see Models & Fallbacks for full list):

Model IDProviderCost tierBest for
claude-haiku-3.5anthropicEconomyRoutine tasks, high-volume cron jobs
claude-sonnet-4-20250514anthropicFlagshipMost tasks — good balance of quality and cost
claude-opus-4-20250514anthropicFlagshipComplex reasoning, high-stakes output
gpt-4oopenaiFlagshipBroad capability, strong reasoning
gpt-4o-miniopenaiEconomyLowest-cost option for simple tasks
gemini-2.0-flashgoogleEconomyUltra-low cost classification and routing
mistral-large-latestmistralFlagshipEU data residency, strong instruction following

See Models & Fallbacks for a full guide on model routing strategies.


Controls how the agent searches and uses its long-term memory store.

"memory": {
"search": {
"enabled": true,
"query": {
"hybrid": {
"vectorWeight": 0.7,
"bm25Weight": 0.3,
"mmr": true,
"temporalDecay": true
}
},
"topK": 5
}
}
FieldTypeDefaultDescription
search.enabledbooleantrueWhether the agent queries memory at the start of each turn
search.query.hybrid.vectorWeightnumber0.7Weight given to semantic (vector) similarity
search.query.hybrid.bm25Weightnumber0.3Weight given to keyword (BM25) matching
search.query.hybrid.mmrbooleantrueEnables Maximal Marginal Relevance — reduces redundant results
search.query.hybrid.temporalDecaybooleantrueWeights recent memories higher than old ones
search.topKnumber5Number of memory results injected into context per turn

Controls which tools the agent is allowed to invoke. Only list tools you want enabled — unlisted tools are unavailable by default.

"tools": {
"allowed": [
"web_search",
"memory_search",
"memory_store",
"send_message"
]
}

Available tools:

ToolDescription
web_searchSearch the web for current information
memory_searchQuery the agent’s long-term memory
memory_storeSave information to long-term memory
send_messageSend a message via Telegram or Slack
read_fileRead files from the agent’s workspace
write_fileWrite or update files in the agent’s workspace
run_scriptExecute scripts in the agent’s container
http_requestMake HTTP requests to external APIs

See Tools & Integrations for tool-specific configuration options.


Controls context window behavior and session management.

"session": {
"maxContextTokens": 50000,
"systemPromptPath": "SOUL.md"
}
FieldTypeDefaultDescription
maxContextTokensnumber50000Maximum tokens to include from session history per turn
systemPromptPathstring"SOUL.md"Path to the SOUL file, relative to the agent’s workspace

Cost and rate guardrails.

"limits": {
"dailyTokenBudget": 500000,
"monthlyTokenBudget": 10000000,
"alertThreshold": 0.8
}
FieldTypeDescription
dailyTokenBudgetnumberMaximum tokens per day. Agent pauses if exceeded.
monthlyTokenBudgetnumberMaximum tokens per calendar month
alertThresholdnumberFraction of budget (0–1) that triggers a cost alert notification

{
"agent": {
"name": "Pulse",
"role": "Analytics Analyst",
"description": "Monitors business metrics and surfaces weekly trends"
},
"model": {
"default": "claude-haiku-3.5",
"fallbacks": ["claude-sonnet-4-20250514"]
},
"memory": {
"search": {
"enabled": true,
"query": {
"hybrid": {
"vectorWeight": 0.7,
"bm25Weight": 0.3,
"mmr": true,
"temporalDecay": true
}
},
"topK": 5
}
},
"tools": {
"allowed": [
"memory_search",
"memory_store",
"http_request",
"send_message"
]
},
"session": {
"maxContextTokens": 40000,
"systemPromptPath": "SOUL.md"
},
"limits": {
"dailyTokenBudget": 200000,
"monthlyTokenBudget": 4000000,
"alertThreshold": 0.8
}
}