ClawdBot Field Guide
← Back to all topics

ClawdBot Features: What Your AI Assistant Can Actually Do

A feature-by-feature breakdown of ClawdBot’s core capabilities: messaging channels, browser automation, memory, scheduling, and extensibility.

ClawdBot Field Guide is an independent, third‑party site that curates practical explanations from the included article set. This page is a topic hub built from one or more focused write-ups, so you can read end-to-end or jump directly to the subsection you need.

If you’re new, skim the table of contents first. If you’re evaluating an implementation or making a purchase decision, pay attention to the tradeoffs and check the references at the end of each subsection.

Below: 1 subsection that make up “ClawdBot Features: What Your AI Assistant Can Actually Do”.

ClawdBot: Features and Capabilities

Executive Summary

ClawdBot is an open-source, self-hosted personal AI assistant that transforms how users interact with artificial intelligence. Unlike traditional cloud-based chatbots, ClawdBot runs entirely on your own hardware—whether a Mac, Windows machine, or Linux server—maintaining complete data privacy while providing unprecedented automation capabilities. Created by developer Peter Steinberger and released under the MIT license in late 2025, ClawdBot has garnered significant adoption in the developer and AI enthusiast communities, with over 19,000 GitHub stars and growing daily interest.

At its core, ClawdBot bridges powerful large language models (Claude, GPT-4, or local alternatives) with your existing messaging applications and personal infrastructure, enabling genuine automation rather than mere conversation. It can manage your email, control your browser, execute code, monitor systems, and proactively reach out to you—all while keeping your data completely private and under your control.

Architecture Overview

ClawdBot's power derives from its elegantly simple yet sophisticated four-component architecture:

The Gateway serves as the control plane and front door of the system. This WebSocket-based component manages all connections to messaging platforms (WhatsApp, Telegram, Discord, Slack, Signal, iMessage, Microsoft Teams, and others), handles scheduling for automated tasks, coordinates multi-agent routing, and serves the web-based dashboard for configuration and monitoring.

The Agent functions as the brain of the operation. This is where your chosen AI model lives—whether that's Anthropic's Claude Opus 4.5 (the recommended option for long-context tasks), OpenAI's GPT-4, Google Gemini, or even locally-hosted open models via Ollama or LM Studio. Each agent maintains its own workspace, session history, and persistent memory, allowing for complete isolation when running multiple agents.

Skills are the extensibility mechanism that transforms ClawdBot from a chatbot into an automation engine. These are specialized modules that teach your agent how to interact with external services and perform specific tasks. Built-in skills include web browsing, email integration, calendar management, file operations, and shell command execution. The ClawdHub marketplace hosts dozens of community-built skills, while users can create custom skills tailored to their unique workflows.

Memory distinguishes ClawdBot from ephemeral chatbots by implementing persistent, structured storage. Conversations, preferences, learned context, and task history are stored as Markdown files in your local workspace. This means your AI assistant actually remembers who you are, understands your preferences, and builds increasingly useful context over time—a crucial difference from stateless API-based chat interfaces.

Communication Channels

One of ClawdBot's distinctive strengths is its unified multi-platform messaging architecture. Rather than maintaining separate bot accounts across different services, ClawdBot consolidates communication into a single gateway that routes messages from multiple channels to your AI agent.

Supported platforms include WhatsApp (via Baileys integration), Telegram, Discord, Slack, Signal, iMessage, Microsoft Teams, Matrix, Mattermost (via plugin), and a WebChat interface accessible through the browser. This architecture enables a remarkable workflow: you can start a conversation on WhatsApp from your phone, continue it on Discord from your computer, and the agent maintains full context of both interactions.

Group chat support includes mention-gating (so the bot only responds when explicitly @mentioned), sender attribution, session isolation per participant, and view-once media handling. The system intelligently routes different WhatsApp accounts, Slack workspaces, or Discord guilds to different agents if desired, enabling power users to maintain separate assistant personas for different life domains.

Core Capabilities

Persistent, Evolving Memory

Unlike ChatGPT or Claude's web interface, which restarts each conversation fresh, ClawdBot maintains continuous memory. Your agent stores conversation history, learns your preferences, understands your context, and references past interactions. Memory files live locally in simple Markdown format—SOUL.md contains the agent's persona and instructions, AGENTS.md defines agent metadata, HEARTBEAT.md specifies proactive tasks, and USER.md captures your preferences and context.

This persistent memory has profound implications. The agent doesn't need you to reintroduce yourself, repeat context, or re-explain your preferences. After weeks or months of interaction, it develops a nuanced understanding of your working style, priorities, and communication patterns. Users report that their ClawdBot assistants develop distinct personalities and become progressively more helpful over time.

Proactive Intelligence via Heartbeats

The heartbeat system is where ClawdBot diverges dramatically from reactive chatbots. Rather than waiting for you to type a message, ClawdBot's agent periodically (every 30 minutes by default, configurable) awakens and checks if anything requires attention. If you've specified priorities in HEARTBEAT.md, the agent reviews them, assesses the current state of your email, calendar, or monitored systems, and proactively sends you notifications.

Real-world examples include:

  • Morning Briefings: Weather, calendar, priority tasks, and health metrics delivered at 8 AM
  • Flight Monitoring: Automatically checking flight confirmation emails and checking you in 24 hours before departure
  • Health Tracking: Reviewing data from fitness devices and alerting you when patterns deviate from baselines
  • Inbox Triaging: Summarizing unread emails by priority and creating actionable tasks
  • System Monitoring: Alerting you if watched websites change, servers go down, or specific conditions occur

The heartbeat system is rate-conscious—if nothing requires attention, the agent can respond with a simple acknowledgment that doesn't result in a message to you, saving API costs. Users report this feature alone justifies the setup effort, as it shifts the assistant from a tool you must remember to consult into an actual member of your team.

Full System Integration

ClawdBot can interact with your machine at a deep level. The exec tool allows the agent to run shell commands, execute scripts, and automate system-level operations. The read/write tools enable file manipulation, data extraction, and document generation. Combined with the browser control tool (which operates a headless Chrome instance), these capabilities enable workflows that blur the line between personal assistant and autonomous agent.

Examples of system integration include:

  • Browser Automation: Logging into websites, scraping data, filling out forms, and taking screenshots
  • File Organization: Batch renaming files, organizing downloads, extracting data from documents
  • Code Execution: Running Python scripts, executing npm commands, deploying applications
  • Terminal Operations: Running git commands, managing databases, monitoring logs
  • API Integration: Calling external APIs, processing responses, storing results

Security is baked into this deep system access through sandboxing options. For untrusted or group-chat scenarios, you can enable Docker-based sandboxing that isolates the agent's execution environment, preventing accidental or malicious system damage.

Browser Automation

The browser control capability deserves special attention as it's particularly powerful. ClawdBot can operate a dedicated Chrome instance, enabling workflows that would otherwise require manual intervention or complex scripting.

Use cases include:

  • Competitive Research: Visiting competitor websites, analyzing UI/UX, extracting pricing and feature comparisons
  • E-Commerce Automation: Adding items to shopping carts, checking inventory, monitoring prices
  • Web Scraping: Extracting structured data from websites for analysis
  • Form Filling: Completing complex forms with multi-step validation
  • Testing Automation: Running through user flows to validate application functionality
  • Screenshot Capture: Taking screenshots for analysis, documentation, or monitoring

The agent sees the rendered page (not just HTML), understands layouts, and can interact with dynamic content loaded by JavaScript—a significant advantage over simple API-based scraping.

Voice Integration

ClawdBot includes sophisticated voice capabilities that extend beyond text. Voice Wake allows always-on listening (on macOS, iOS, and Android) using customizable wake words. Once triggered, Talk Mode enables continuous voice conversation where the agent listens, processes, responds, and speaks back using ElevenLabs text-to-speech technology.

The voice subsystem includes:

  • Global wake word management: Customize trigger phrases across all your devices
  • Natural speech processing: Whisper-based speech-to-text with language support
  • Streaming TTS playback: Real-time voice generation that users describe as natural and expressive
  • Interrupt handling: If you start speaking while the agent is talking, it stops and listens
  • Voice control directives: The agent can specify voice parameters (speaker, pace, tone) for specific responses
  • Phone call integration: Some users have configured their ClawdBot to make outbound calls for scheduling or notifications

Voice mode transforms ClawdBot from a text-centric tool into something more like a conversational AI that can be used while driving, cooking, or otherwise unable to type.

Skills and Extensions Ecosystem

Skills are the primary mechanism for extending ClawdBot's capabilities. The system follows the AgentSkills specification, allowing both bundled (included with ClawdBot), managed (downloaded from ClawdHub), and workspace (custom-built) skills.

Built-in skills typically include:

  • Web browsing and search
  • Email and calendar management (Gmail, Google Calendar)
  • File operations and document handling
  • Shell command execution
  • Home Assistant and smart home integration
  • Notion, Todoist, and task management integrations
  • Code repository access (GitHub, GitLab)
  • Database connections
  • Weather and data APIs

Popular community skills available on ClawdHub cover domains like:

  • Finance: Trading analysis, expense tracking, invoice generation
  • Content Creation: Image generation (Gemini, DALL-E), video processing, markdown conversion
  • Development: Docker management, Jira automation, deployment pipelines
  • E-Commerce: Shopping automation, price monitoring, inventory tracking
  • Health & Fitness: Data aggregation from wearables, workout tracking, nutrition logging

Custom skill creation is remarkably accessible. Users describe building specialized skills in minutes simply by describing what they need to the agent—ClawdBot's self-improving nature allows it to write its own skill code, test it, and deploy it to the marketplace.

Skills are gated at load-time through metadata, allowing requirements specification (binary dependencies, environment variables, configuration parameters) and enabling the system to inform you when a skill is missing prerequisites.

Multi-Agent Architecture

For complex workflows or household/team scenarios, ClawdBot supports running multiple isolated agents within a single gateway. Each agent is a completely separate brain with its own workspace, persona, session history, and authentication credentials.

Multi-agent setup enables:

  • Different personas: A "work" agent for professional tasks, a "personal" agent for life management, and a "creative" agent for writing
  • Specialized models: Routing WhatsApp (fast queries) to a lightweight model while Telegram (deep work) uses Claude Opus
  • Team use: Multiple family members or coworkers maintaining separate agents on shared infrastructure
  • Hierarchical automation: Research agents spawning sub-agents to investigate specific questions in parallel
  • Sandboxed environments: Different trust levels and tool permissions for different agents

Routing is deterministic and most-specific-match wins, allowing fine-grained control over which channels or individuals connect to which agent.

Canvas and Visual Workspace

For complex tasks, ClawdBot provides a "Canvas"—a visual, interactive workspace that the agent can manipulate in real-time. Built on the A2UI (AI-to-UI) framework, Canvas allows the agent to:

  • Generate rich visual outputs (tables, charts, diagrams)
  • Create interactive forms for user input
  • Display progress on long-running tasks
  • Render code with syntax highlighting
  • Build UI elements without leaving the chat interface

Users can interact with Canvas outputs, seeing what the agent is working on, adjusting parameters, and providing real-time feedback.

Advanced Features and Configuration

Fine-Grained Security and Access Control

ClawdBot takes security seriously with multiple layers of control:

Sandboxing: The system supports multiple sandbox modes—off (full access), per-agent (one sandbox per agent), or shared (one sandbox for all agents). Docker-based sandboxing provides process isolation and filesystem constraints.

Tool restrictions: Every agent can have a custom allow-list and deny-list of tools. A "family" agent might only allow read and message-sending tools, while a "personal" agent has full system access. Tool restrictions can be configured globally or per-agent.

Gateway authentication: Supports token-based auth with optional Tailscale Serve/Funnel for secure remote access. The system can run behind SSH tunnels or with optional password protection.

Session isolation: Different users sharing a WhatsApp account can be routed to different agents, maintaining separate session contexts.

Pairing: Channel connections (WhatsApp, Telegram, etc.) require explicit pairing/approval, preventing unauthorized access.

Node Architecture for Distributed Operations

Beyond the gateway, ClawdBot supports pairing remote "nodes"—iOS, Android, or macOS devices that can contribute device-specific capabilities:

  • Camera operations: Taking photos or screen recordings on demand
  • Screen capture: Recording the device screen for documentation or automation
  • Location access: Sharing current location for context-aware tasks
  • Notifications: Sending system notifications with custom content
  • System integration: macOS-specific operations like AppleScript execution

Nodes connect via Bonjour/mDNS for local discovery or WebSocket for remote pairing, enabling sophisticated workflows like "take a screenshot of what I'm looking at" or "remind me when I leave this location."

Configuration and Customization

ClawdBot's behavior is highly customizable through configuration files (JSON5 format):

Agent personality is defined through workspace files:

  • SOUL.md: Core instructions and persona definition
  • AGENTS.md: Metadata and system prompts
  • USER.md: User context and preferences
  • HEARTBEAT.md: Proactive task definitions
  • TOOLS.md: Tool-specific instructions
  • IDENTITY.md: Agent identity and introduction
  • BOOTSTRAP.md: Initial setup instructions (auto-created)

Model configuration allows specifying different models for different agents, setting thinking defaults, adjusting timeout behaviors, and configuring model failover.

Session management includes resetting triggers (/new, /reset), context compaction, session scope (per-sender vs global), and idle timeouts.

Heartbeat tuning allows adjusting frequency, enabling/disabling proactive notifications, and defining what triggers activity.

The system supports modular configuration with $include directives, allowing you to split configuration across multiple files for organization and version control.

Logging, Monitoring, and Troubleshooting

ClawdBot includes comprehensive operational tooling:

  • Status command: clawdbot status shows local status, credentials, sessions, and queued events
  • Deep diagnostics: clawdbot status --deep adds gateway health probes for each channel
  • Health monitoring: clawdbot health --json provides JSON output suitable for monitoring systems
  • Log files: Detailed logs written to /tmp/clawdbot/ with date-stamped filenames
  • Control UI: Web-based dashboard for configuration, session browsing, and diagnostics

The gateway can be configured as a LaunchAgent (macOS) or systemd service (Linux) for automatic startup.

Real-World Use Cases

The flexibility of ClawdBot's architecture has inspired diverse automation workflows:

Morning Intelligence Briefings: One user configured their assistant to deliver daily 8 AM briefings including weather, calendar summary, priority tasks from Todoist, recent health metrics from a fitness tracker, and curated news. The briefing is delivered via Telegram and integrates data from Gmail, Google Calendar, health APIs, and RSS feeds.

Autonomous Project Management: A startup founder uses ClawdBot to manage project workflow—when team members send updates via WhatsApp, the agent summarizes them, creates or updates Jira tickets, identifies blockers, and sends daily summaries to Slack. The system operates 24/7 without human intervention.

Intelligent Email Triage: The agent monitors Gmail, categorizes incoming mail by priority, auto-responds to common queries, flags urgent messages for immediate attention, and moves routine subscriptions to folders. Users report reaching inbox zero for the first time in years.

E-Commerce Automation: Several users configured ClawdBot to handle shopping—taking a photo of a recipe, extracting ingredients, mapping them to a grocery store's inventory, adding items to the cart, and even completing checkout with saved credentials.

Browser-Based Task Automation: One user configured the system to automatically monitor hotel availability, flight prices, and insurance claim statuses—monitoring competitor websites daily and alerting when conditions change.

Code Review and Testing: Developers are using ClawdBot to autonomously run tests on pull requests, analyze failures, suggest fixes, and even generate and commit code changes—with the agent reaching out via Discord when human review is needed.

Smart Home Integration: Combining ClawdBot with Home Assistant enables sophisticated home automation—voice commands via Telegram or Slack trigger complex scenes, the agent learns usage patterns, and can autonomously optimize temperature, lighting, and security based on presence and preferences.

Team Coordination: Some teams run shared ClawdBot instances with multiple agents representing different roles (project manager, developer lead, designer liaison), each maintaining separate communication channels and handling routing automatically.

Installation and Infrastructure Requirements

ClawdBot requires minimal infrastructure. The gateway needs Node.js 22+ and can run on:

  • Local machines: macOS, Windows (via WSL2), or Linux laptops/desktops
  • Small servers: Raspberry Pi (tested and working)
  • Cloud VPS: As small as $5/month instances on DigitalOcean, Hetzner, or similar providers
  • NAS devices: Some users run it on Synology or similar NAS hardware

The one-command installation handles all dependencies:

curl -fsSL https://install.clawdbot.com | sh

Or via npm:

npm install -g clawdbot@latest

From there, the interactive wizard walks you through onboarding—connecting messaging platforms, selecting your AI model, configuring initial skills, and setting up heartbeat tasks.

Memory footprint is modest (typically 200-300MB for the gateway) and API costs depend entirely on your chosen LLM provider—with efficient prompting, power users report monthly API costs ranging from $10-50 depending on usage.

Pricing and Licensing

ClawdBot is completely free under the MIT open-source license. There are no subscription fees, no cloud dependency charges, and no vendor lock-in.

You only pay for:

  1. Infrastructure (optional): If you run it on a cloud VPS instead of your own hardware, typically $5-15/month
  2. LLM API usage: Charged by Anthropic, OpenAI, or your chosen provider based on token consumption

An optional ClawCloud Run managed hosting service exists starting at $0/month tier for those who want managed deployment, though the flagship self-hosted experience remains the default recommendation.

Comparison with Alternatives

vs. ChatGPT / Claude Web: ClawdBot offers persistent memory, proactive automation, system access, and complete data privacy. ChatGPT excels at pure reasoning but is stateless and cloud-dependent.

vs. Traditional AI Agent Platforms (LangChain, AutoGPT): ClawdBot provides better UX through messaging integration, is production-ready with comprehensive tooling, and includes built-in security features. It's more accessible for non-technical users while remaining powerful for developers.

vs. Virtual Assistant Services (VA companies): ClawdBot costs a fraction of human VAs, operates 24/7, and scales instantly. It lacks human judgment for sensitive decisions but excels at information processing and routine automation.

vs. Siri / Google Assistant: ClawdBot matches or exceeds native assistants in automation capability, maintains better privacy (runs locally), and includes deep system access. Native assistants integrate more seamlessly with phone OS but are cloud-dependent and limited.

Limitations and Considerations

While powerful, ClawdBot has important limitations:

Learning curve: Setting up multi-agent configurations, writing custom skills, or complex security sandboxing requires technical knowledge. The interactive wizard helps, but power user features demand CLI/configuration comfort.

Model dependency: Quality depends on your chosen AI model. Local models are free but less capable; premium APIs introduce per-token costs.

Legal/liability: An agent sending emails or making decisions on your behalf creates potential liability. Most setups require explicit human review for sensitive actions.

Infrastructure responsibility: Unlike cloud services, you're responsible for uptime, security, backups, and updates.

Unproven at scale: While early users love it, long-term behaviors of persistent multi-agent systems remain to be fully explored.

Future Directions and Emerging Use Cases

The ClawdBot community is actively building on the platform:

  • Autonomous multi-agent systems: Teams of specialized agents collaborating to solve complex problems
  • MCP integration: Deeper integration with Model Context Protocol for richer tool definitions
  • Mobile-first workflows: iOS/Android apps that bring the full power to phones
  • Enterprise adoption: Organizations exploring on-premises deployments for sensitive workflows
  • Swarm intelligence: Multiple ClawdBot instances communicating and coordinating across teams

The project is moving quickly—new releases roughly every week with new features, bug fixes, and community-built skills constantly arriving.

Conclusion

ClawdBot represents a qualitative shift in how personal AI assistants can work. By running locally, maintaining persistent memory, enabling proactive automation, and providing deep system integration, it transforms AI from a conversational interface into a genuine member of your productivity team.

For developers, automation enthusiasts, and anyone comfortable with self-hosting, ClawdBot opens possibilities that centralized cloud assistants cannot touch. The combination of privacy, customization, cost efficiency, and power makes it compelling for both individual use and team deployment.

Whether you're seeking to automate routine tasks, build a smarter information dashboard, orchestrate complex workflows, or simply experience what truly personal AI looks like, ClawdBot provides the technical foundation and community ecosystem to make it happen. As the project matures and the community grows, it's increasingly clear why early adopters describe ClawdBot as the future of personal AI—a future that's available today, running on your own hardware, under your complete control.

These pages cover adjacent questions you’ll likely run into while exploring ClawdBot: