The Ultimate OpenClaw Setup Guide¶

Community-tested, Twitter-sourced, production-proven. The definitive configuration guide combining wisdom from hundreds of OpenClaw power users. Every tip has a real person behind it.

Last updated: February 15, 2026 Sources: 100+ Twitter/X posts, GitHub discussions, community configs, YouTube creator tutorials (Sonny Sangha, David Ondrej, Wes Roth)

Table of Contents¶

The 5-Minute Setup vs The 5-Hour Setup
The 9 Core Files (Mastered)
Production soul.md Template
Production agents.md Template
Model Routing (The Real Savings)
ClawRouter (Save 70%)
Token Optimization (Stop Burning Money)
Memory Architecture
Heartbeat Configuration
Security Checklist (Non-Negotiable)
The Calculator
Community Tools Worth Installing
Production openclaw.json
Common Mistakes (From Real Users)
Multi-Agent Architecture (25-Agent Example)
Community Setup Census (Feb 2026)

The 5-Minute Setup vs The 5-Hour Setup¶

"The gap between default config and optimized setup is massive. Learn the infra layer." -- @syc0ms

Setup	Who	Cost/Month	Quality
Quick (wizard defaults)	Beginners	$50-200	Works, but wasteful
Optimized (this guide)	Power users	$10-40	Same quality, 70-80% cheaper
Enterprise (multi-agent)	Teams	$50-150	Multiple agents, routing, monitoring

The wizard gets you running. This guide makes you efficient.

The 9 Core Files (Mastered)¶

"Every file you add becomes permanent context the AI can reference forever." -- David Ondrej

"A file sitting on your MacBook is a dead file. Unless you the human are looking at it, it's useless. Living files live on a VPS accessible by AI agents." -- David Ondrej (Living Files Theory)

The Living Files Theory (David Ondrej)¶

Dead files: Google Drive, local MacBook, Obsidian (inaccessible to agents — useless unless you're reading them) Living files: VPS-based, agent-accessible, self-modifying, contextual — exponentially more valuable

The insight: your markdown files become "alive" when an AI agent can read, reference, and update them 24/7. Move everything meaningful to your VPS workspace.

These files are loaded into every single message. Optimize them aggressively.

File	Purpose	Optimal Size	Key Mistake
soul.md	Personality, values, tone	30-50 lines	Too vague ("be helpful")
agents.md	Operating rules, workflows	50-100 lines	Too many rules, contradictions
user.md	About YOU	30-50 lines	Too personal (security risk)
memory.md	Durable facts	50-100 lines	Storing temporary info
tools.md	Tool documentation	20-50 lines	Duplicating built-in docs
identity.md	Agent's role	10-20 lines	Conflicting with soul.md
heartbeat.md	Proactive checks	10-30 lines	Too many checks (cost explosion)
boot.md	Startup instructions	10-20 lines	Complex startup sequences
bootstrap	Initial setup	10-20 lines	One-time, often left as-is

The Token Budget Problem¶

"Light第一次 API 调用就可能 15–20K tokens... 5–10 轮复杂操作，轻松 50–100K tokens." -- @robustdragon (translated: "First API call alone can be 15-20K tokens... 5-10 complex tool calls, easily 50-100K tokens")

Real token breakdown per message: - System prompt: ~600 lines - All workspace files (soul.md, agents.md, etc.): Variable - All tool schemas: ~8K tokens - All loaded skills: Variable - Session history: Grows per turn

Optimization rules: 1. Keep each core file under 50 lines where possible 2. Delete unused skills (each one adds tokens) 3. Use /clear when switching topics 4. Use Sonnet for daily tasks, Opus only for complex reasoning 5. Enable automatic compaction

Pro Tips from Community¶

"In agents.md write 'Work autonomously, only ask when uncertain' + enable Heartbeat. Opus 4.6 does this much better than small models." -- @LeoYe_AI

"Use sessions_spawn for isolated sub-agents so they don't bloat main context. Also agents.md should instruct writing progress to files, not keeping it in context. Files persist across sessions, context doesn't." -- @LeoYe_AI

"Keep your agents.md minimal and put detailed instructions in separate files (memory/*.md) that other tools won't auto-load." -- @HappyGezim

Production soul.md Template¶

Based on @haitham_haj's highly-shared soul.md (adapted for general use):

# soul.md — Who You Are

_You are not a chatbot. You are a thinking partner._

## Core Identity

You are a **partner in decisions**, not a passive helper.
Your job is to improve outcomes: clearer thinking, better choices, fewer blind spots.

## Non-Negotiables

- **Truth before comfort.** If something is wrong, risky, or weak — say it clearly.
- **Depth before speed.** Accuracy and depth are the default. Speed is secondary.
- **Decisions over chatter.** Every reply should move work forward.
- **No fake politeness.** Never open with "Great question" or "I'd be happy to help." Just answer.
- **Brevity by design.** If one sentence is enough, use one sentence.
- **Source-aware confidence.** Distinguish between confirmed, likely, and unknown.

## How You Think

When a decision matters, always provide:
1. Your recommended path
2. Why it wins now
3. The best alternative
4. Key risk to watch

No hedging by habit. Commit to a take unless evidence is genuinely incomplete.

## Communication Style

- Clear, direct, human.
- Strong opinions welcome when reasoned.
- Call out bad ideas early — charm over cruelty, but no sugarcoating.
- Don't ask for things you can verify yourself.
- Don't report internal noise as progress.

## Vibe

Be the assistant you'd actually want to talk to at 2am.
Not a corporate drone. Not a sycophant. Just... good.

Key Insight: Frozen Personality + Mutable Memory¶

"soul.md (personality) and agents.md (rules) are separate files. Heartbeat reads both, logs to memory/ — never mutates them. Evolution = memory accumulation, not prompt mutation. Frozen personality + mutable memory = no tone drift." -- @LeoYe_AI

The rule: soul.md and agents.md are READ-ONLY by the agent. All learning goes to memory/ files.

Production agents.md Template¶

# agents.md — Operating Rules

## Autonomy

Work autonomously. Only ask for confirmation when:
- Action has real-world impact (sending emails, making purchases, external APIs)
- Uncertainty about user intent
- Destructive operations (deleting files, overwriting data)

## Task Execution

1. Think before acting. Plan multi-step tasks.
2. Write progress to files in memory/, NOT in conversation context.
3. Use tools proactively — don't ask permission to search, read, or browse.
4. When blocked, try 2 alternative approaches before asking for help.

## File Rules

- Human files (personal/, documents/) are READ-ONLY — never modify without explicit request.
- AI output goes to memory/ — separated from human notes.
- All research results saved as markdown files (permanent context).

## Cost Awareness

- Use the cheapest model that can handle the task.
- Batch heartbeat checks instead of running them individually.
- Don't repeat web searches — save results to memory/ files.
- Use /clear between unrelated topics.

## Safety

- Never expose API keys, tokens, or credentials in any output.
- Never execute commands you don't understand.
- Ask before any action with financial consequences.
- Log all external actions to memory/action-log.md.

Model Routing (The Real Savings)¶

"Opus for thinking, Sonnet for execution. You set model routing per task type." -- @abundand

"Default model to Haiku, 90% cheaper. Opus only for complex tasks. LLM-Router in openclaw.json saves massively." -- @LeoYe_AI

The Brain + Muscles Architecture¶

Cloud API stack (starting point):

Task Type	Model	Cost	Why
Complex reasoning	Claude Opus 4.6	$15-25/M tokens	Best quality, strategy decisions
Coding (heavy)	GPT-5.3-Codex	$20/mo sub (near-unlimited)	Frontier coding, generous limits, 2x until April 2026
Daily agent brain	Kimi K2.5	$0.10/M cached	Agent-first design, 50x cheaper than Opus
Coding sub-agents (API)	GLM-5 or MiniMax M2.5	$0.11-0.32/M	77-80% SWE-bench, ultra-cheap
Heartbeat checks	Gemini Flash	Free-$0.10/M	Most reliable, fastest, cheapest
Background crons	Kimi K2.5 or GLM-5	$0.10-0.11/M	Cheap + good enough
Fallback	MiniMax M2.5	Cheap	Stable, rarely fails (see reliability test)

"Codex 5.3 high is amazing, even compared to opus 4.6. And the limits are a lot more generous too." -- @Conor_D_Dart "If you have a ChatGPT subscription, I highly recommend running your OpenClaw with the Codex OAuth." -- @_karimelk

Open-source progression (target: 90% local):

Task Type	Cloud (Phase 1)	Self-Hosted (Phase 2)	Notes
Complex reasoning	Claude Opus 4.6	Keep cloud (no local match yet)	Worth paying for quality
Daily agent brain	Kimi K2.5 API	Kimi K2.5 local (595GB) or Qwen3-Coder	Needs 192GB+ for local
Coding	GLM-5 API	GLM-5 quantized or MiniMax M2.5 local	@Patrick1Kennedy: M2.5 works on M3 Ultra 512GB
Heartbeat	Gemini Flash	Ollama Llama 3.2 8B	Zero cost local
Crons	Kimi K2.5 API	Any local model via Ollama	@joshuaday: "zero marginal cost"

"Local models via MLX mean your agents run 24/7 at zero marginal cost. I use local models for the routine stuff and only route to API models when the task actually needs frontier intelligence." -- @joshuaday

Slash Command Model Switching (David Ondrej)¶

Tell OpenClaw in plain English to set up model aliases:

"Browse the web to find official API names for Opus 4.6, Sonet 4.5,
and Codex 5.3, then update config so I can switch via /model opus,
/model sonet, /model codex"

Then switch models mid-conversation: /model opus for creative work, /model codex for coding, /model sonet for daily tasks. The agent updates its own config.

Replace Brave with Perplexity (David Ondrej)¶

"Brave Search is not agentic. Perplexity Sonar Pro is much better for agent web search."

Setup via OpenRouter: - Default search: perplexity/sonar-pro (standard queries) - Deep research: perplexity/sonar-deep-research (triggered by "deep research" keyword) - Document in tools.md so agent knows when to use each

Config for Model Routing¶

Cloud-first config:

{
  "agents": {
    "defaults": {
      "model": {
        "primary": "openrouter/moonshotai/kimi-k2.5",
        "fallbacks": ["openrouter/minimax/m2.5", "google/gemini-3-flash"]
      },
      "heartbeat": {
        "model": "google/gemini-3-flash"
      }
    }
  }
}

Hybrid config (local + cloud fallback):

{
  "agents": {
    "defaults": {
      "model": {
        "primary": "ollama/qwen3-coder:32b",
        "fallbacks": ["openrouter/moonshotai/kimi-k2.5", "google/gemini-3-flash"]
      },
      "heartbeat": {
        "model": "ollama/llama3.2:8b"
      }
    }
  }
}

Ollama (Local Models) Integration¶

"Set OLLAMA_HOST=0.0.0.0, point OpenClaw to http://localhost:11434/v1. Per-task routing: use model overrides in agent config. LAN is more stable due to fewer port conflicts with the gateway." -- @LeoYe_AI

"Local models via MLX mean your agents run 24/7 at zero marginal cost. I use local models for the routine stuff (memory writes, status checks, simple Q&A) and only route to API models when the task actually needs frontier intelligence." -- @joshuaday

{
  "agents": {
    "defaults": {
      "model": {
        "primary": "anthropic/claude-sonnet-4-5",
        "fallbacks": ["ollama/qwen2.5-coder:32b"]
      },
      "heartbeat": {
        "model": "ollama/llama3.2:8b"
      }
    }
  }
}

ClawRouter (Save 70%)¶

"ClawRouter v0.8.30 -- 2.4K stars in 11 days. Updated model routing: Auto, Eco, and Premium profiles." -- @bc1beat

"Saved 70% of my $4,660.87 Anthropic bills." -- @bc1beat

Profile	Cost	Quality	Best For
Auto	Balanced	Balanced	Most users
Eco	Lowest	Good enough	Background tasks, crons
Premium	Highest	Best	Complex reasoning, coding

Install¶

# Install ClawRouter
curl -fsSL https://raw.githubusercontent.com/bc1beat/clawrouter/main/install.sh | bash

# Or via npm
npm install -g clawrouter

How It Works¶

ClawRouter sits between OpenClaw and LLM APIs, automatically routing requests to the cheapest model that can handle the task. It analyzes: - Task complexity (simple query vs multi-step reasoning) - Required capabilities (coding, math, creative writing) - Token budget remaining - Model availability and latency

Alternative: calculator.vlvt.sh¶

"calculator.vlvt.sh → optimize openclaw config for power & money spent (stop using the best model for a heartbeat, etc)" -- @lilpeenieweenie

Visit https://calculator.vlvt.sh to analyze your token usage and get optimization recommendations.

Token Optimization (Stop Burning Money)¶

The Real Cost Sources¶

"The real token drain isn't your questions. It's: large agents.md, verbose soul.md, too many skills, frequent tool calls, long sessions without clearing." -- @robustdragon (translated)

Source	Tokens/Message	Fix
System prompt	~5K	Can't reduce (core)
Workspace files	2-15K	Keep each file under 50 lines
Tool schemas	~8K	Disable unused tools
Loaded skills	1-10K	Uninstall unused skills
Session history	0-100K+	Use /clear often

Community Cost Tips¶

"Big chats = big cost. Use 1 group per topic. When done, start a new session. Write 3 lines of summary so you don't forget." -- @thecodeholic

"Build a custom skill to remember what you've gone through and reuse it when similar requests arise." -- @0xHeavybag

"Use OpenRouter so you can switch models without reconfiguring." -- @HappyGezim

"Route the cheap work locally and save the credits for things that actually need them." -- @meetahsen

Monthly Cost Benchmarks¶

Usage	Model Strategy	Cost
Light (casual chat)	Sonnet only	$10-20/mo
Moderate (daily use)	Sonnet + Haiku heartbeat	$20-40/mo
Heavy (always-on + crons)	ClawRouter Auto	$30-60/mo
Power (multi-agent)	ClawRouter + local models	$40-80/mo
Unoptimized (Opus for everything)	Opus 4.6 default	$100-500+/mo

Memory Architecture¶

"Overwhelmed with the memory options? Beads, Mem0, Supermemory, Mastra, MemOS, roll your own..." -- @aekae8888

The Simple Answer (For Most Users)¶

File-based memory in markdown. No vector DB needed until 1,000+ entries.

~/.openclaw/workspace/
├── soul.md              # READ-ONLY by agent (personality)
├── agents.md            # READ-ONLY by agent (rules)
├── user.md              # READ-ONLY by agent (about you)
├── memory/
│   ├── MEMORY.md        # Key facts, lessons, preferences
│   ├── action-log.md    # What the agent did (audit trail)
│   ├── research/        # Research results by topic
│   │   ├── 2026-02-14-market-analysis.md
│   │   └── latest-findings.md
│   └── daily/           # Daily notes
│       ├── 2026-02-14.md
│       └── 2026-02-13.md
├── personal/            # Your personal files (READ-ONLY)
└── business/            # Business context files

Two-Layer Memory System (Wes Roth)¶

Daily notes — Raw logs of what happened today (memory/daily/)
Long-term memory — Compressed/curated from daily logs into memory.md

The agent writes daily notes, then periodically compresses the best insights into long-term memory. Old daily notes get archived.

Self-Improvement Protocol (David Ondrej)¶

"Every hour you put into improving your OpenClaw will pay dividends forever."

Ask OpenClaw to research how to do X
Tell it to save results as markdown in /docs
Tell it to update relevant core files (tools.md, memory.md)
Tell it to implement the changes itself
Tell it to restart the gateway
Improvement persists forever — compounds across all future sessions

Key Rules¶

"memory/ is the solution. Tips: (1) your notes are read-only (2) AI output isolated to memory/ (3) write 'human files are edit-prohibited' in agents.md." -- @LeoYe_AI

"When arguing with your AI, write 'previous disagreement' to memory/ — next session picks up the context." -- @LeoYe_AI

Human files = read-only — agent never modifies your personal notes
AI output = memory/ — all agent learning goes here
Research = dated files — memory/research/YYYY-MM-DD-topic.md
Progress = files, not context — survives session resets

Hidden Memory Settings (Enable These!)¶

Two settings are OFF by default:

Memory Flush — Saves key context before compaction (so it survives)
Session Memory Search — Lets you search past conversations

Enable via:

"Enable memory flush so important context survives compaction"
"Enable session memory search so you can search past conversations"

When to Upgrade¶

Scale	Memory System	Setup
<100 entries	File-based markdown	Just folders
100-1,000	File-based + good organization	Topic folders
1,000+	Mem0 (25K+ stars)	`npm install mem0`
Complex relationships	Neo4j graph DB	Separate service
Semantic search	Chroma or Qdrant	Vector DB service

Heartbeat Configuration¶

"Heartbeat + memory/ + soul.md = an AI that grows." -- @LeoYe_AI

"Add a heartbeat cron to ping WhatsApp every 30min so it doesn't go stale." -- @HappyGezim

What Heartbeat Does¶

"That's really what makes this kind of agentic. It pings every 30 minutes: 'Hey, aren't you supposed to be doing something?'" -- Wes Roth

The heartbeat file (heartbeat.md) tells OpenClaw what to check periodically. It runs on a configurable interval (default: 30 minutes, David Ondrej recommends 15 minutes for power users). If there's nothing to report, it silently returns "heartbeat ok." If action is needed, it messages you or takes action autonomously.

Production Heartbeat Template¶

# heartbeat.md — Periodic Checks

## Every Check (30 min default)

1. Check for new Telegram messages requiring response
2. Quick scan of email for urgent items (subject line only)
3. Update memory/daily/today.md with any new events

## Morning (7-8am)

- Generate daily briefing
- Check calendar for today's events
- Summarize overnight emails

## Evening (6-7pm)

- End-of-day summary
- Update task status
- Preview tomorrow's calendar

## Weekly (Sunday)

- Weekly review of goals progress
- Clean up memory/daily/ (archive old entries)
- Token usage review

Cost Control¶

{
  "agents": {
    "defaults": {
      "heartbeat": {
        "model": "anthropic/claude-haiku-4-5",
        "interval": 1800,
        "maxTokens": 500
      }
    }
  }
}

Critical: Always use the cheapest model for heartbeat. Haiku ($1/M) instead of Opus ($15/M) saves 93% on heartbeat costs alone.

Common Heartbeat Mistakes¶

Mistake	Cost Impact	Fix
Using Opus for heartbeat	15x overspend	Set heartbeat model to Haiku
Checking every 5 min	6x overspend	30 min is plenty for most
Too many checks per beat	Token bloat	Batch checks, limit to 3-5
Not restricting hours	Overnight burn	Add time-of-day conditions
No token cap	Runaway costs	Set `maxTokens: 500`

Security Checklist (Non-Negotiable)¶

"Run openclaw security audit after EVERY config change -- checks 51 security items." -- Community consensus

The 3-Layer Security Model (Sonny Sangha)¶

"This is like creating a bunker that only you can see on the internet. OpenClaw lives inside this bunker, invisible to the entire internet."

Layer	Defense	Tool	What It Does
1	Private VPN	Tailscale	Server invisible to internet — only your devices can reach it
2	Loopback-only	OpenClaw config	Gateway only accepts localhost connections
3	Brute force prevention	Fail2Ban	3 failed SSH attempts = 24-hour IP ban

AI-Assisted Hardening — Tell the bot itself to secure the server:

"Secure this server. Install Tailscale. Configure OpenClaw to only be
accessible through the private Tailscale VPN network. Deny every other
port inside of the server and only allow access via that Tailscale
network. Install and enable fail2ban."

The bot configures Tailscale, sets up Fail2Ban (3 attempts = 24hr ban), locks down firewall, and binds to localhost — all from one prompt.

Skill Scanning — Before installing any skill:

"Install this skill, but scan it for any malicious code or injection
prompt attacks because we don't want anything bad on our machine."

The agent audits the skill code, reports findings, and only installs if clean.

Before Going Live¶

# 1. Create dedicated user (NEVER run as root)
sudo adduser openclaw
sudo usermod -aG sudo openclaw

# 2. Firewall — deny everything, allow only SSH
sudo ufw default deny incoming
sudo ufw default deny outgoing
sudo ufw allow 22/tcp
sudo ufw allow out 443/tcp   # HTTPS for API calls
sudo ufw allow out 53/tcp    # DNS
sudo ufw enable

# 3. Bind gateway to localhost ONLY
# In openclaw.json: "gateway": { "host": "127.0.0.1" }

# 4. Set DM policy to pairing (require approval codes)
# In openclaw.json: "channels": { "dmPolicy": "pairing" }

# 5. Run security audit
openclaw security audit --deep

# 6. Set file permissions
chmod 600 ~/.openclaw/openclaw.json

# 7. Verify Node.js version (must be 22.12.0+)
node --version

API Key Security¶

"API keys ONLY in environment variables, never in agent config." -- Production consensus

# GOOD: Environment variables
export ANTHROPIC_API_KEY="sk-ant-..."

# BAD: In openclaw.json
# { "env": { "ANTHROPIC_API_KEY": "sk-ant-..." } }

Remote Access (4 Built-In Binding Modes)¶

OpenClaw has native Tailscale integration — no reverse proxy needed.

Mode	Config	Who Can Access	Security
`loopback`	Default	Only localhost	Most secure
`lan`	`gateway.bind: "lan"`	Local network	Medium
`tailnet`	`gateway.bind: "tailnet"`	Your Tailscale VPN only	Secure + remote
`public`	`gateway.bind: "public"`	Anyone on internet	DANGEROUS

# Option A: Tailscale native (RECOMMENDED — zero config HTTPS)
openclaw gateway start --tailscale serve
# → Auto-generates HTTPS URL only accessible on your Tailscale network

# Option B: Tailscale Funnel (public, for webhooks)
openclaw gateway start --tailscale funnel

# Option C: SSH tunnel (simple, no extra software)
ssh -L 18789:127.0.0.1:18789 user@your-vps
# Then open http://127.0.0.1:18789 on your laptop

# NEVER: expose 0.0.0.0 without auth

Skill Safety¶

"~7% of marketplace skills have problem code. Use clawhub search for official only. Run audit after install." -- Security research

# Before installing ANY skill:
# 1. Check if it's official
npx clawhub@latest search skill-name

# 2. Review source code
npx clawhub@latest inspect skill-name

# 3. After installing, re-run audit
openclaw security audit

The Calculator¶

"calculator.vlvt.sh → optimize openclaw config for power & money spent" -- @lilpeenieweenie

Use https://calculator.vlvt.sh to: - Analyze your current token usage - Find which files/skills consume the most tokens - Get personalized optimization recommendations - Compare model costs for your usage pattern

Community Tools Worth Installing¶

Tool	Stars	What It Does	Install
ClawRouter	2.4K	Auto model routing, saves 70%	`npm install -g clawrouter`
QMD	Popular	Search OpenClaw docs & skills from chat	Via ClawHub
CrabWalk	768	OpenClaw-native monitoring	Via ClawHub
Clawtick	New	Task/time tracking	Via ClawHub
PicoClaw	New	Lightweight OpenClaw (<10MB, 1s startup)	Separate install
Moltis	150K lines	Rust rewrite, runs fully offline	Build from source

Actions (Custom Commands)¶

"I can define in a config file actions e.g. 'build' and 'deploy'. Via Telegram or Slack I can call /ralph-action and my sandboxed agent executes them." -- @choas

Production openclaw.json¶

The production-ready config combining all community best practices:

{
  "gateway": {
    "host": "127.0.0.1",
    "port": 18789
  },
  "channels": {
    "dmPolicy": "pairing"
  },
  "agents": {
    "defaults": {
      "model": {
        "primary": "anthropic/claude-sonnet-4-5",
        "fallbacks": [
          "openrouter/moonshotai/kimi-k2.5",
          "openrouter/google/gemini-2.5-flash"
        ]
      },
      "heartbeat": {
        "model": "anthropic/claude-haiku-4-5",
        "interval": 1800,
        "maxTokens": 500
      },
      "compaction": {
        "reserveTokens": 40000,
        "memoryFlush": true
      }
    }
  },
  "security": {
    "sandbox": { "mode": "non-main" },
    "skillScanner": true
  }
}

Explanation¶

Setting	Why
`host: 127.0.0.1`	Never expose to public internet
`dmPolicy: pairing`	Require approval for new conversations
`primary: sonnet`	Best cost/quality for daily use
`fallbacks: kimi, gemini`	Free/cheap fallbacks if API down
`heartbeat: haiku`	93% cheaper than Opus for checks
`interval: 1800`	30 minutes is sufficient
`maxTokens: 500`	Prevent runaway heartbeat costs
`reserveTokens: 40000`	Prevent context overflow
`memoryFlush: true`	Save context before compaction
`sandbox: non-main`	Sandbox tool execution in Docker
`skillScanner: true`	Auto-scan skills for malware

Common Mistakes (From Real Users)¶

Mistake	Who Said It	Fix
"Let OpenClaw edit its own config files"	@slinkyring	Never. Add rule: "Never execute changes to config files without my confirmation."
Using Opus for everything	@LeoYe_AI	Set default to Sonnet, Opus only on demand
Storing temp info in memory.md	@robustdragon	Only durable facts in memory.md
Too many skills loaded	@robustdragon	Uninstall unused skills (each burns tokens)
Not using /clear	@thecodeholic	New session per topic
Over-engineering before using	@VibeTradr	"Stop tweaking config files and start actually using your agent"
Using Codex to set up OpenClaw	@IAmTaylorWest	Actually works great: "Give Codex the IP + OpenClaw docs"
WhatsApp as primary channel	@HappyGezim	"Flakiest connector. Telegram way more reliable."
Not enabling memory settings	Alex Finn	Enable memory flush + session search immediately
Confusing learning with personality	@LeoYe_AI	soul.md is frozen, memory/ evolves
Skipping initial personality setup	Wes Roth	"I didn't do it in the beginning... then when I started doing it again, massive difference." Agent asks "Who am I?" — answer thoroughly.
Adding bot to group chats	Sonny Sangha	Everyone in the group gets full access to your server
Running on local machine	Sonny Sangha	Blast radius = entire system. VPS = only VPS contents at risk.
Using Brave for web search	David Ondrej	Perplexity Sonar Pro is "much better for agent web search"
Not saving web search results	David Ondrej	Every search → save as markdown. Never repeat research.
Using Opus for heartbeat	David Ondrej	"Wasteful." Use Haiku 4.5 or Gemini Flash — 15x cheaper

Multi-Agent Architecture (25-Agent Example)¶

For production multi-agent setups, the key principles from clearmudai's 25-agent deployment:

Gateway Architecture:

Executive agents (COO, CTO, CMO, CRO) → Shared gateway (port 18789)
Public-facing agents (Community bot)   → Separate gateway (port 18790)

Why separate gateways: - Public-facing agents get unpredictable traffic bursts - Isolates community bot crashes from executive agents - Different heartbeat intervals (community = frequent, executives = on-demand) - Separate API key tracking for cost attribution

Multi-Agent openclaw.json pattern:

{
  "agents": {
    "defaults": {
      "model": { "primary": "anthropic/claude-sonnet-4-5" }
    },
    "overrides": {
      "coo": { "model": { "primary": "anthropic/claude-opus-4-6" }, "alwaysDelegates": true },
      "cto": { "model": { "primary": "anthropic/claude-opus-4-6" } },
      "community": {
        "model": { "primary": "google/gemini-3-flash" },
        "gateway": { "port": 18790 },
        "heartbeat": { "interval": 900 }
      }
    }
  }
}

Essential rules for multi-agent: 1. One gateway per traffic tier — Don't mix public and internal agents 2. Stagger heartbeats — 25 agents hitting the API simultaneously = rate limits 3. Cheapest model that works — Gemini Flash for community, Opus only for strategy 4. Each agent = own workspace — Own soul, tools, memory. Full isolation. 5. Voice standups optional but powerful — Microsoft open-source TTS, each agent gets personality-matched voice

Full 25-agent case study with org chart, model assignments, and architecture: Power User Guide — Multi-Agent

Community Setup Census (Feb 2026)¶

Real setups from Twitter/X — what people are actually running, how many bots, and what it costs.

By Scale¶

Who	Agents	Hardware	Models	Cost/mo	Setup
@clearmudai (Marcelo)	25	Custom server	Opus 4.6, Codex 5.3, Sonnet 4.5, Gemini 3 Pro/Flash, Nano Banana Pro	Est. $200-500	"Muddy OS" dashboard, 3 AI chiefs, voice standups
@learntouseai (Ramón)	4-5+	4x Mac Mini M4 16GB + MacBook M4 24GB	Local open-source + $20/mo Codex	~$20	English learning web app, fully local 24/365
@theLewisLu	4	Unknown	Unknown	Unknown	Self-reported poll
@Italianclownz (Carlo)	5	Unknown	Gemini 2.5 Flash (main), Perplexity (research), Gemini 3 Pro (coding), Llama 3.2 Vision, Mistral-Nemo	~$20/mo (gifted sub)	Content + manga translation
@yellowsockzzz	Multiple	Unknown	Codex (coding), Sonnet (fast), mix	Unknown	Persona-based delegation
@david3443ai	4 tasks	Mac Mini	Unknown	$0/mo (after hardware)	Email, calendar, tasks, code — 3 months untouched
@Patrick1Kennedy	Multi	M3 Ultra 512GB + planning 8x GB10 cluster	MiniMax M2.5 (local 8-bit), adding Kimi K2.5	Hardware only	Frontier local inference
@davidschk	2+	2x Hetzner VPS (agent + Supabase)	Unknown	~$15-30	Docker + Cloudflare Tunnel, zero exposed ports
@EXM7777 (Machina)	Multi	Cloud APIs	Opus 4.6 + Kimi K2.5 (writing), Codex + Opus (coding), MiniMax M2.5 (general)	Unknown	Business + content
@attaboiaj (AJ)	Multi	Cloud APIs	Opus 4.5 (main), Kimi K2 (writing), Codex 5.3 (coding)	Unknown	Standard power user
@SwarmDigitalUK	Multi	Existing PC	DeepSeek (daily), Claude (complex)	£5-15/mo	Model failover routing

By Cost Tier¶

Monthly Cost	What You Get	Models	Source
$0/mo	Mac Mini + local models, 24/7	Ollama + open-source	@david3443ai
$5-15/mo	VPS + cheap APIs, basic automation	DeepSeek, Gemini Flash, model failover	@SwarmDigitalUK, @HappyGezim
$20-40/mo	VPS + Kimi K2.5/GLM-5, solid agent	Kimi K2.5 primary + cheap fallbacks	@HappyGezim (typical range)
$50-100/mo	Multi-agent, Claude API access	Opus for quality + Kimi/GLM for volume	@abhip_me (common question range)
$100-200/mo	Serious multi-agent fleet	Opus + Codex + routing	@ThePumponomics
$200-500/mo	Enterprise/25-agent scale	Full model fleet with routing	@clearmudai (estimated)

VPS Provider Recommendations (Community-Sourced)¶

Provider	Cost	Community Notes	Source
Hetzner	€4-7/mo	Most recommended. Supports persistent processes. EU-based.	@LeoYe_AI, @HappyGezim, @davidschk
Contabo	$5-15/mo	"OpenClaw Power User" tier available. SSH-first.	@DeRonin_
DigitalOcean	$12/mo	Simple setup, good docs.	@JorgeCastilloPr
Hostinger	~$4-8/mo	Tricky — some plans don't support persistent processes. Test first.	@LeoYe_AI warning
LobsterFarm	$?/mo	Managed OpenClaw hosting, all ports blocked by default, SSH-first. Can transfer Hetzner instance to your account.	@HappyGezim
Ampere.sh	Free + $500 credits	One-click OpenClaw deploy, browser automation built-in.	@Sheldon056
Contabo (dedicated)	$15-30/mo	"OpenClaw Power User" plan with more RAM/CPU.	@DeRonin_

Security must-haves for VPS (from @ao2363's verified setup): 1. Dedicated non-root user for OpenClaw 2. Docker sandbox mode enabled 3. Tailscale VPN (all ports blocked to public internet) 4. openclaw security audit --fix (51 automated checks) 5. UFW firewall — only allow Tailscale subnet

What Hardware People Self-Host On¶

Hardware	Cost	Use Case	Who
Mac Mini M4 16GB	$600	Basic OpenClaw, API-only	Many users
Mac Mini M4 Pro 64GB	~$2,000	Local models + OpenClaw	Community standard
4x Mac Mini M4 cluster	~$2,400	Full local AI fleet	@learntouseai
Mac Studio M3 Ultra 512GB	$10,000+	Frontier local models (MiniMax M2.5 8-bit)	@Patrick1Kennedy
Beelink SER7	$450	Budget always-on server, 25W	Community tested
AMD EPYC server	$2-5K+	Enterprise local, massive VRAM with GPUs	For serious self-hosting
Jetson Orin Nano	~$250	Ultra-low power (sub-15W), edge AI	@superactro
ESP32	~$10	Microcontroller running OpenClaw (!)	@0xjoggie (experimental)

YouTube Creator Setups (Video-Sourced)¶

Creator	Video	Setup	Security	Key Technique
Sonny Sangha	Secure 24/7 Setup	$6/mo VPS (Hostinger KVM2) + Telegram	Tailscale + Loopback + Fail2Ban (3-layer)	AI-assisted hardening via single prompt
David Ondrej	10x More Powerful	VPS (Hostinger) + WhatsApp	Standard	Living Files Theory, Perplexity over Brave, slash command model switching, self-improvement protocol
Wes Roth	Install in 30 Seconds	Local ($130 mini PC) or VPS (Hostinger KVM2)	Basic	30-second install (`curl \| bash`), old hardware works, memory architecture walkthrough

Common across all 3: - All use/recommend Hostinger VPS ($6-7/mo KVM2) - All use Telegram as primary channel - All emphasize personality setup (soul.md, identity.md) matters more than people think - All agree: VPS > local for 24/7 operation - None use auto-update — all manual

David Ondrej's Agentic Company Vision:

"We're going to see people who as a single person can build multi-million dollar companies. By end of 2026, companies become 50% human, 50% agent."

Structure: Personal OpenClaw (knows your life) + Team OpenClaw (company context: docs, financials, goals). Personal OpenClaws query Team OpenClaw. OpenClaws message each other.

The One Rule from the Community¶

"Stop overthinking it. Hetzner VPS ($4-7/mo), onboard wizard, Telegram as interface. Done. Ignore the 'best setup' noise — best one is the one you actually use. Pick one task and build from there." -- @HappyGezim (LobsterFarm.ai)