π¦ PicoClaw Roadmap
Vision: To build the ultimate lightweight, secure, and fully autonomous AI Agent infrastructure.automate the mundane, unleash your creativity
π 1. Core Optimization: Extreme Lightweightβ
Our defining characteristic. We fight software bloat to ensure PicoClaw runs smoothly on the smallest embedded devices.
- Memory Footprint Reduction
- Goal: Run smoothly on 64MB RAM embedded boards (e.g., low-end RISC-V SBCs) with the core process consuming < 20MB.
- Context: RAM is expensive and scarce on edge devices. Memory optimization takes precedence over storage size.
- Action: Analyze memory growth between releases, remove redundant dependencies, and optimize data structures.
π‘οΈ 2. Security Hardening: Defense in Depthβ
Paying off early technical debt. We invite security experts to help build a "Secure-by-Default" agent.
-
Input Defense & Permission Control
- Prompt Injection Defense: Harden JSON extraction logic to prevent LLM manipulation.
- Tool Abuse Prevention: Strict parameter validation to ensure generated commands stay within safe boundaries.
- SSRF Protection: Built-in blocklists for network tools to prevent accessing internal IPs (LAN/Metadata services).
-
Sandboxing & Isolation
- Filesystem Sandbox: Restrict file R/W operations to specific directories only.
- Context Isolation: Prevent data leakage between different user sessions or channels.
- Privacy Redaction: Auto-redact sensitive info (API Keys, PII) from logs and standard outputs.
-
Authentication & Secrets
- Crypto Upgrade: Adopt modern algorithms like
ChaCha20-Poly1305for secret storage. - OAuth 2.0 Flow: Deprecate hardcoded API keys in the CLI; move to secure OAuth flows.
- Crypto Upgrade: Adopt modern algorithms like
π 3. Connectivity: Protocol-First Architectureβ
Connect every model, reach every platform.
-
Provider
- Architecture Upgrade: Refactor from "Vendor-based" to "Protocol-based" classification (e.g., OpenAI-compatible, Ollama-compatible). (Status: In progress by @Daming, ETA 5 days)
- Local Models: Deep integration with Ollama, vLLM, LM Studio, and Mistral (local inference).
- Online Models: Continued support for frontier closed-source models.
-
Channel
- IM Matrix: QQ, WeChat (Work), DingTalk, Feishu (Lark), Telegram, Discord, WhatsApp, LINE, Slack, Email, KOOK, Signal, ...
- Standards: Support for the OneBot protocol.
- attachment: Native handling of images, audio, and video attachments.
-
Skill Marketplace
- Discovery skills: Implement
find_skillto automatically discover and install skills from the [GitHub Skills Repo] or other registries.
- Discovery skills: Implement
π§ 4. Advanced Capabilities: From Chatbot to Agentic AIβ
Beyond conversationβfocusing on action and collaboration.
-
Operations
- MCP Support: Native support for the Model Context Protocol (MCP).
- Browser Automation: Headless browser control via CDP (Chrome DevTools Protocol) or ActionBook.
- Mobile Operation: Android device control (similar to BotDrop).
-
Multi-Agent Collaboration
- Basic Multi-Agent implement
- Model Routing: "Smart Routing" β dispatch simple tasks to small/local models (fast/cheap) and complex tasks to SOTA models (smart).
- Swarm Mode: Collaboration between multiple PicoClaw instances on the same network.
- AIEOS: Exploring AI-Native Operating System interaction paradigms.
π 5. Developer Experience (DevEx) & Documentationβ
Lowering the barrier to entry so anyone can deploy in minutes.
-
QuickGuide (Zero-Config Start)
- Interactive CLI Wizard: If launched without config, automatically detect the environment and guide the user through Token/Network setup step-by-step.
-
Comprehensive Documentation
- Platform Guides: Dedicated guides for Windows, macOS, Linux, and Android.
- Step-by-Step Tutorials: "Babysitter-level" guides for configuring Providers and Channels.
- AI-Assisted Docs: Using AI to auto-generate API references and code comments (with human verification to prevent hallucinations).
π€ 6. Engineering: AI-Powered Open Sourceβ
Born from Vibe Coding, we continue to use AI to accelerate development.
- AI-Enhanced CI/CD
- Integrate AI for automated Code Review, Linting, and PR Labeling.
- Bot Noise Reduction: Optimize bot interactions to keep PR timelines clean.
- Issue Triage: AI agents to analyze incoming issues and suggest preliminary fixes.
π¨ 7. Brand & Communityβ
- Logo Design: We are looking for a Mantis Shrimp (Stomatopoda) logo design!
- Concept: Needs to reflect "Small but Mighty" and "Lightning Fast Strikes."
π€ Call for Contributionsβ
We welcome community contributions to any item on this roadmap! Please comment on the relevant Issue or submit a PR. Let's build the best Edge AI Agent together!