zeroclaw

Author	SHA1	Message	Date
李龙 0668001470	d23abdb92d	test(config): centralize backward-compat fixtures	2026-03-24 15:29:03 +03:00
李龙 0668001470	a2fdf64a5e	fix(security): block runtime config state edits	2026-03-24 15:29:02 +03:00
Alix-007	d43ce63b4d	feat(skills): add read_skill for compact mode	2026-03-24 15:29:02 +03:00
Alix-007	40e2aaa318	Fix /new regression test lint scope	2026-03-24 15:29:02 +03:00
Alix-007	f9c7d18caf	Refresh skills after new channel sessions	2026-03-24 15:29:02 +03:00
Alix-007	68f4fbd550	fix(tools): normalize workspace-prefixed paths	2026-03-24 15:29:02 +03:00
Alix-007	b19b6822db	style(zai): satisfy rustfmt in tool_stream request	2026-03-24 15:29:02 +03:00
Alix-007	bf4c4d81a0	fix(zai): send tool_stream for tool-capable requests	2026-03-24 15:29:01 +03:00
Alix-007	8275649517	test(claude_code): isolate echo script per test run	2026-03-24 15:29:01 +03:00
李龙 0668001470	7283f3c6fe	test(config): move initialized log regression away from merge hotspot	2026-03-24 15:28:49 +03:00
李龙 0668001470	41f8773a3c	fix(config): avoid clippy used_underscore_binding	2026-03-24 15:28:48 +03:00
Alix-007	4544947a15	fix(config): log existing config as initialized	2026-03-24 15:28:47 +03:00
Argenis	296fff7af9	fix(config): enable compact_context by default (#3995 ) * fix: change compact_context default to true Local LLMs with limited context windows immediately run out of context when compact_context defaults to false. The system prompt alone can consume 25K+ tokens, exceeding even 55K context windows with history. Setting compact_context=true by default limits system prompt injection to 6000 chars and RAG results to 2 chunks, making the agent usable with smaller models out of the box. Fixes #3987 * docs: update compact_context default to true in config reference Update all locale variants (en, zh-CN, vi) to reflect the new default. * test: update tests to expect compact_context default of true Update assertions in schema.rs unit tests and config_persistence.rs component tests to match the new default value.	2026-03-24 15:27:55 +03:00
Argenis	c41009d29f	fix(cron): persist allowed_tools for agent jobs (#3993 ) Persist allowed_tools in cron_jobs table, threading it through CLI add/update and cron_add/cron_update tool APIs. Add regression coverage for store, tool, and CLI roundtrip paths. Fixups over original PR #3929: add allowed_tools to all_overdue_jobs SELECT (merge gap), resolve merge conflicts. Closes #3920 Supersedes #3929	2026-03-24 15:26:29 +03:00
Alix-007	b6bc332b68	feat(slack): add thread_replies channel option (#3930 ) Add a thread_replies option to Slack channel config (default true). When false, replies go to channel root instead of the originating thread. Closes #3888	2026-03-24 15:26:28 +03:00
Argenis	53802d6d04	fix(anthropic): always apply cache_control to system prompts (#3990 ) * fix: always use Blocks format for system prompts with cache_control System prompts under 3KB were wrapped in SystemPrompt::String which cannot carry cache_control headers, resulting in 0% cache hit rate on Haiku 4.5. Always use SystemPrompt::Blocks with ephemeral cache_control regardless of prompt size. Fixes #3977 * fix: lower conversation caching threshold from >4 to >1 messages The previous threshold of >4 non-system messages was too restrictive, delaying cache benefits until 5+ turns. Lower to >1 so caching kicks in after the first user+assistant exchange. Fixes #3977 * test: update anthropic cache tests for new thresholds and Blocks format - convert_messages_small_system_prompt now expects Blocks with cache_control instead of String variant - should_cache_conversation tests updated for >1 threshold - backward_compatibility test replaced with blocks-system test	2026-03-24 15:26:28 +03:00
Argenis	1ff411d8f3	fix(security): wire sandbox into shell command execution (#3989 ) * fix: add sandbox field to ShellTool struct Add `sandbox: Arc<dyn Sandbox>` field to `ShellTool` and a `new_with_sandbox()` constructor so callers can inject the configured sandbox backend. The existing `new()` constructor defaults to `NoopSandbox` for backward compatibility. Ref: #3983 * fix: apply sandbox wrapping in ShellTool::execute() Call `self.sandbox.wrap_command()` on the underlying std::process::Command (via `as_std_mut()`) after building the shell command and before clearing the environment. This ensures every shell command passes through the configured sandbox backend before execution. Ref: #3983 * fix: wire up sandbox creation at ShellTool callsites In `all_tools_with_runtime()`, create a sandbox from `root_config.security` via `create_sandbox()` and pass it to `ShellTool::new_with_sandbox()`. The `default_tools_with_runtime()` path retains `ShellTool::new()` which defaults to `NoopSandbox`. Ref: #3983 * test: add sandbox integration tests for ShellTool Verify that ShellTool can be constructed with a sandbox via `new_with_sandbox()`, that NoopSandbox leaves commands unmodified, and that command execution works end-to-end with a sandbox attached. Ref: #3983	2026-03-24 15:26:28 +03:00
Alix-007	6f08b15d8b	fix(cron): default channel delivery to active reply target	2026-03-24 15:26:28 +03:00
Alix-007	ffa1ee4d3b	fix(onboard): warn when Homebrew service uses another workspace	2026-03-24 15:26:28 +03:00
Alix-007	2ca9ca3285	fix(skills): narrow shell shebang detection (#3944 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:26:27 +03:00
Giulio V	49d68e55f2	fix(cron): add startup catch-up and drop login shell flag (#3948 ) * fix(cron): add startup catch-up and drop login shell flag Problems: 1. When ZeroClaw started after downtime (late boot, daemon restart), overdue jobs were picked up via `due_jobs()` but limited by `max_tasks` per poll cycle — with many overdue jobs, catch-up could take many cycles. 2. Cron shell jobs used `sh -lc` (login shell), which loads the full user profile on every execution — slow and may cause unexpected side effects. Fixes: - Add `all_overdue_jobs()` store query without `max_tasks` limit - Add `catch_up_overdue_jobs()` startup phase that runs ALL overdue jobs once before entering the normal polling loop - Extract `build_cron_shell_command()` helper using `sh -c` (non-login) - Add structured tracing for catch-up progress - Add tests for all new functions * feat(cron): make catch-up configurable via API and control panel Add `catch_up_on_startup` boolean to `[cron]` config (default: true). When enabled, the scheduler runs all overdue jobs at startup before entering the normal polling loop. Users can toggle this from: - The Cron page toggle switch in the control panel - PATCH /api/cron/settings { "catch_up_on_startup": false } - The `[cron]` section of the TOML config editor Also adds GET /api/cron/settings endpoint to read cron subsystem settings without parsing the full config. * fix(config): add catch_up_on_startup to CronConfig test constructors The CI Lint job failed because the `cron_config_serde_roundtrip` test constructs CronConfig directly and was missing the new field.	2026-03-24 15:26:27 +03:00
Argenis	8db392de7c	fix(providers): exempt tool schema errors from non-retryable classification (#3978 ) * fix: exempt tool schema validation errors from non-retryable classification Groq returns 400 "tool call validation failed" which was classified as non-retryable by is_non_retryable(), preventing the provider-level fallback in compatible.rs from executing. Add is_tool_schema_error() to detect these errors and return false from is_non_retryable(), allowing the retry loop to pass control back to the provider's built-in fallback. Fixes #3757 * test: add unit tests for tool schema error detection in reliable.rs Verify is_tool_schema_error detects Groq-style validation failures and that is_non_retryable returns false for tool schema 400s while still returning true for other 400 errors like invalid API key. * fix: escape format braces in test string literals for cargo check The anyhow::anyhow! macro interprets curly braces as format placeholders. Use explicit format argument to pass JSON-containing strings in tests.	2026-03-24 15:26:26 +03:00
argenis de la rosa	64efb72605	fix(agent): enforce autonomy level in gateway and channel paths (#3952 ) - Channel tool filtering (`non_cli_excluded_tools`) now respects `autonomy.level = "full"` — full-autonomy agents keep all tools available regardless of channel. - Gateway `process_message` now creates and passes an `ApprovalManager` to `agent_turn`, so `ReadOnly`/`Supervised` policies are enforced instead of silently skipped. - Gateway also applies `non_cli_excluded_tools` filtering with the same full-autonomy bypass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:26:26 +03:00
argenis de la rosa	bbc470fe07	fix(config): warn when conversational_ai.enabled is set (#3958 ) The conversational_ai config section is parsed but not yet consumed by any runtime code. Emit a startup warning so users know the setting is ignored, and update the doc comment to mark it as reserved for future use. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:26:26 +03:00
Argenis	8ee3e71e90	fix(openrouter): respect provider_timeout_secs and improve error messages (#3973 ) * fix(openrouter): wire provider_timeout_secs through factory Apply the configured provider_timeout_secs to OpenRouterProvider in the provider factory, matching the pattern used for compatible providers. * fix(openrouter): add timeout_secs field to OpenRouterProvider Add a configurable timeout_secs field (default 120s) and a with_timeout_secs() builder method so the HTTP client timeout can be overridden via provider config instead of being hardcoded. * refactor(openrouter): improve response decode error messages Read the response body as text first, then parse with serde_json::from_str so that decode failures include a truncated snippet of the raw body for easier debugging. * test(openrouter): add timeout_secs configuration tests Verify that the default timeout is 120s and that with_timeout_secs correctly overrides it. * style: run rustfmt on openrouter.rs	2026-03-24 15:26:24 +03:00
Alix-007	8f241cd4b7	fix(openrouter): respect provider timeout config	2026-03-24 15:17:35 +03:00
Roman Tataurov	f01ec415a5	Fix models refresh	2026-03-24 15:17:35 +03:00
Argenis	96e2a324d1	fix: make channel system prompt respect autonomy.level = full (#3952 ) (#3970 ) When autonomy.level is set to "full", the channel/web system prompt no longer includes instructions telling the model to ask for permission before executing tools. Previously these safety lines were hardcoded regardless of autonomy config, causing the LLM to simulate approval dialogs in channel and web-interface modes even though the ApprovalManager correctly allowed execution. The fix adds an autonomy_level parameter to build_system_prompt_with_mode and conditionally omits the "ask before acting" instructions when the level is Full. Core safety rules (no data exfiltration, prefer trash) are always included.	2026-03-24 15:17:35 +03:00
Argenis	8856c5fa95	fix: omit experimental conversational_ai section from default config (#3969 ) The [conversational_ai] config section was serialized into every freshly-generated config.toml despite the feature being experimental and not yet wired into the agent runtime. This confused new users who found an undocumented section in their config. Add skip_serializing_if = "ConversationalAiConfig::is_disabled" so the section is only written when a user has explicitly enabled it. Existing configs that already contain the section continue to deserialize correctly via #[serde(default)]. Fixes #3958	2026-03-24 15:17:34 +03:00
Argenis	ae98c91596	feat(heartbeat): default interval 30→5min + prune heartbeat from auto-save (#3938 ) Lower the default heartbeat interval to 5 minutes to match the renewable partial wake-lock cadence. Add `[heartbeat task` to the memory auto-save skip filter so heartbeat prompts (both Phase 1 decision and Phase 2 task execution) do not pollute persistent conversation memory. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:34 +03:00
Argenis	84e4adbcd2	fix(tools): use resolve_tool_path for consistent path resolution (#3937 ) Replace workspace_dir.join(path) with resolve_tool_path(path) in file_write, file_edit, and pdf_read tools to correctly handle absolute paths within the workspace directory, preventing path doubling. Closes #3774	2026-03-24 15:17:34 +03:00
Argenis	7a65850d45	fix(config): add missing challenge_max_attempts field to OtpConfig (#3919 ) (#3936 ) The OtpConfig struct uses deny_unknown_fields but was missing the challenge_max_attempts field, causing zeroclaw config schema to fail with a TOML parse error when the field appeared in config files. Add challenge_max_attempts as an Option<u32>-style field with a default of 3 and a validation check ensuring it is greater than 0.	2026-03-24 15:17:34 +03:00
argenis de la rosa	9361f7e7eb	fix(gateway): move pairing code below dashboard URL in terminal banner Repositions the one-time pairing code display to appear directly below the dashboard URL for cleaner terminal output, and removes the duplicate display that was showing at the bottom of the route list. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:34 +03:00
Argenis	121e8b86a2	feat(skills): autonomous skill creation from multi-step tasks (#3916 ) Add SkillCreator module that persists successful multi-step task executions as reusable SKILL.toml definitions under the workspace skills directory. - SkillCreationConfig in [skills.skill_creation] (disabled by default) - Slug validation, TOML generation, embedding-based deduplication - LRU eviction when max_skills limit is reached - Agent loop integration post-success - Gated behind `skill-creation` compile-time feature flag Closes #3825.	2026-03-24 15:17:34 +03:00
Argenis	ba23480e01	fix: ensure SOUL.md and IDENTITY.md exist in non-tty sessions (#3915 ) When the workspace is created outside of `zeroclaw onboard` (e.g., via cron, daemon, or `< /dev/null`), SOUL.md and IDENTITY.md were never scaffolded, causing the agent to activate without identity files. Added `ensure_bootstrap_files()` in `Config::load_or_init()` that idempotently creates default SOUL.md and IDENTITY.md if missing. Closes #3819.	2026-03-24 15:17:33 +03:00
Argenis	cc463c0df4	feat(delegate): make sub-agent timeouts configurable via config.toml (#3909 ) Add `timeout_secs` and `agentic_timeout_secs` fields to `DelegateAgentConfig` so users can tune per-agent timeouts instead of relying on the hardcoded 120s / 300s defaults. Validation rejects values of 0 or above 3600s, matching the pattern used by MCP timeout validation. Closes #3898	2026-03-24 15:17:33 +03:00
Argenis	81d99f513c	feat(i18n): externalize tool descriptions for translation (#3912 ) Add a locale-aware tool description system that loads translations from TOML files in tool_descriptions/. This enables non-English users to see tool descriptions in their language. - Add src/i18n.rs module with ToolDescriptions loader, locale detection (ZEROCLAW_LOCALE, LANG, LC_ALL env vars), and English fallback chain - Add locale config field to Config struct for explicit locale override - Create tool_descriptions/en.toml with all 47 tool descriptions - Create tool_descriptions/zh-CN.toml with Chinese translations - Integrate with ToolsSection::build() and build_tool_instructions() to resolve descriptions from locale files before hardcoded fallback - Add PromptContext.tool_descriptions field for prompt-time resolution - Add AgentBuilder.tool_descriptions() setter for Agent construction - Include tool_descriptions/ in Cargo.toml package include list - Add 8 unit tests covering locale loading, fallback chains, env detection, and config override Closes #3901	2026-03-24 15:17:33 +03:00
argenis de la rosa	3430f9bf1a	fix(test): use PID-scoped script path to prevent ETXTBSY in CI The echo_provider() test helper writes a fake_claude.sh script to a shared temp directory. When lib and bin test binaries run in parallel (separate processes, separate OnceLock statics), one process can overwrite the script while the other is executing it, causing "Text file busy" (ETXTBSY). Scope the filename with PID to isolate each test process. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:33 +03:00
Argenis	23471f7357	fix: reset tool call dedup cache each iteration to prevent loops (#3910 ) The seen_tool_signatures HashSet was initialized outside the iteration loop, causing cross-iteration deduplication of legitimate tool calls. This triggered a self-correction spiral where the agent repeatedly attempted skipped calls until hitting max_iterations. Moving the HashSet inside the loop ensures deduplication only applies within a single iteration, as originally intended. Fixes #3798	2026-03-24 15:17:33 +03:00
Argenis	c6f94fda4f	fix(channels): respect ack_reactions config for Telegram channel (#3834 ) (#3913 ) The Telegram channel was ignoring the ack_reactions setting because it sent setMessageReaction API calls directly in its polling loop, bypassing the top-level channels_config.ack_reactions check. - Add optional ack_reactions field to TelegramConfig so it can be set under [channels_config.telegram] without "unknown key" warnings - Add ack_reactions field and with_ack_reactions() builder to TelegramChannel, defaulting to true - Guard try_add_ack_reaction_nonblocking() behind self.ack_reactions - Wire channel-level override with fallback to top-level default - Add config deserialization and channel behavior tests	2026-03-24 15:17:32 +03:00
Argenis	e556ad3d3e	fix: handle double-serialized schedule in cron_add and cron_update (#3860 ) (#3905 ) When LLMs pass the schedule parameter as a JSON string instead of a JSON object, serde fails with "invalid type: string, expected internally tagged enum Schedule". Add a deserialize_maybe_stringified helper that detects stringified JSON values and parses the inner string before deserializing, providing backward compatibility for both object and string representations. Fixes #3860	2026-03-24 15:17:32 +03:00
Argenis	ba7d371df4	fix: enable vision support for llamacpp provider (#3907 ) The llamacpp provider was instantiated with vision disabled by default, causing image transfers from Telegram to fail. Use new_with_vision() with vision enabled, matching the behavior of other compatible providers. Fixes #3802	2026-03-24 15:17:32 +03:00
Argenis	f44c3515d1	fix(tools): include tool_search instruction in deferred tools system prompt (#3826 ) (#3914 ) The deferred MCP tools section in the system prompt only listed tool names inside <available-deferred-tools> tags without any instruction telling the LLM to call tool_search to activate them. In daemon and Telegram mode, where conversations are shorter and less guided, the LLM never discovered it should call tool_search, so deferred tools were effectively unavailable. Add a "## Deferred Tools" heading with explicit instructions that the LLM MUST call tool_search before using any listed tool. This ensures the LLM knows to activate deferred tools in all modes (CLI, daemon, Telegram) consistently. Also add tests covering: - Instruction presence in the deferred section - Multiple-server deferred tool search - Cross-server keyword search ranking - Activation persistence across multiple tool_search calls - Idempotent re-activation	2026-03-24 15:17:32 +03:00
Argenis	031008ae31	fix(providers): recover from context window errors by truncating history (#3908 ) When a provider returns a context-size-exceeded error, truncate the oldest non-system messages from conversation history and retry instead of immediately bailing out. This enables local models with small context windows (llamafile, llama.cpp) to work by automatically fitting the conversation within available context. Closes #3894	2026-03-24 15:17:32 +03:00
Vasanth	6d77f48ee5	feat(agent): add runtime model switching via model_switch tool (#3853 ) Add support for switching AI models at runtime during a conversation. The model_switch tool allows users to: - Get current model state - List available providers - List models for a provider - Switch to a different model The switch takes effect immediately for the current conversation by recreating the provider with the new model after tool execution. Risk: Medium - internal state changes and provider recreation	2026-03-24 15:17:31 +03:00
Argenis	dab6edfc7c	fix(providers): preserve conversation context in Claude Code CLI (#3885 ) * fix(providers): preserve conversation context in Claude Code CLI provider Override chat_with_history to format full multi-turn conversation history into a single prompt for the claude CLI, instead of only forwarding the last user message. Closes #3878 * fix(providers): fix ETXTBSY race in claude_code tests Use OnceLock to initialize the fake_claude.sh test script exactly once, preventing "Text file busy" errors when parallel tests concurrently write and execute the same script file.	2026-03-24 15:17:31 +03:00
Argenis	88693dda59	fix(cron): prevent one-shot jobs from re-executing indefinitely (#3886 ) Handle Schedule::At jobs in reschedule_after_run by disabling them instead of rescheduling to a past timestamp. Also add a fallback in persist_job_result to disable one-shot jobs if removal fails. Closes #3868	2026-03-24 15:17:31 +03:00
Argenis	b09baba8c8	fix: pass route-specific api_key through channel provider creation (#3881 ) When using Channel mode with dynamic classification and routing, the route-specific `api_key` from `[[model_routes]]` was silently dropped. The system always fell back to the global `api_key`, causing 401 errors when routing to `custom:` providers that require distinct credentials. Root cause: `ChannelRouteSelection` only stored provider + model, and `get_or_create_provider` always used `ctx.api_key` (the global key). Changes: - Add `api_key` field to `ChannelRouteSelection` so the matched route's credential survives through to provider creation. - Update `get_or_create_provider` to accept and prefer a route-specific `api_key` over the global key. - Use a composite cache key (provider name + api_key hash) to prevent cache poisoning when multiple routes target the same provider with different credentials. - Wire the route api_key through query classification matching and the `/model` (SetModel) command path. Fixes #3838	2026-03-24 15:17:31 +03:00
argenis de la rosa	7f40746988	fix(plugins): update lockfile and fix ws.rs formatting Sync Cargo.lock with new Extism/WASM plugin dependencies and apply rustfmt line-wrap fix in gateway WebSocket handler. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:30 +03:00
argenis de la rosa	2aba569366	fix(plugins): integrate WASM tools into registry, add gateway routes and tests - Wire WASM plugin tools into all_tools_with_runtime() behind cfg(feature = "plugins-wasm"), discovering and registering tool-capable plugins from the configured plugins directory at startup. - Add /api/plugins gateway endpoint (cfg-gated) for listing plugin status. - Add mod plugins declaration to main.rs binary crate so crate::plugins resolves when the feature is enabled. - Add unit tests for PluginHost: empty dir, manifest discovery, capability filtering, lookup, and removal. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:30 +03:00
argenis de la rosa	7ce5421d12	feat(plugins): add PluginHost, WasmTool, and WasmChannel bridges Implement the core plugin infrastructure: - PluginHost: discovers plugins from the workspace plugins directory, loads manifest.toml files, supports install/remove/list/info operations - WasmTool: bridges WASM plugins to the Tool trait (execute stub pending Extism runtime wiring) - WasmChannel: bridges WASM plugins to the Channel trait (send/listen stubs pending Extism runtime wiring) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:29 +03:00
argenis de la rosa	8aa3ac704d	feat(plugins): add Extism dependency, feature flag, and plugin module skeleton Introduce the WASM plugin system foundation: - Add extism 1.9 as an optional dependency behind `plugins-wasm` feature - Create `src/plugins/` module with manifest types, error types, and stub host - Add `Plugin` CLI subcommands (list, install, remove, info) behind cfg gate - Add `PluginsConfig` to the config schema with sensible defaults All plugin code is behind `#[cfg(feature = "plugins-wasm")]` so the default build is unaffected. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:29 +03:00
argenis de la rosa	4e31d1dd3a	fix(pairing): add SQLite persistence, fix config defaults, align with plan - Add SQLite persistence to DeviceRegistry (backed by rusqlite) - Rename config fields: ttl_secs -> code_ttl_secs, max_pending -> max_pending_codes, max_attempts -> max_failed_attempts - Update defaults: code_length 6 -> 8, ttl_secs 300 -> 3600, max_pending 10 -> 3 - Add attempts tracking to PendingPairing struct - Add token_hash() and authenticate_and_hash() to PairingGuard - Fix route paths: /api/pairing/submit -> /api/pair, /api/devices/{id}/rotate -> /api/devices/{id}/token/rotate - Add QR code placeholder to Pairing.tsx - Pass workspace_dir to DeviceRegistry constructor Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:29 +03:00
argenis de la rosa	8df14402d2	fix(gateway): add new fields to test AppState and GatewayConfig constructors Add device_registry, pending_pairings to test AppState instances and pairing_dashboard to test GatewayConfig to fix compilation of tests after the new pairing dashboard fields were introduced. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:29 +03:00
argenis de la rosa	5b1be9615b	feat(gateway): extend WebSocket handshake with optional connect params Add ConnectParams struct for an optional first-frame connect handshake. If the first WebSocket message is {"type":"connect",...}, connection parameters (session_id, device_name, capabilities) are extracted and a "connected" ack is sent back. Old clients sending "message" first still work unchanged (backward-compatible). Extract process_chat_message() helper to avoid duplication between fallback first-message handling and the main message loop. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:29 +03:00
argenis de la rosa	0c1c9ca1a6	feat(gateway): add device registry and pairing API handlers Introduce DeviceRegistry, PairingStore, and five new API endpoints: - POST /api/pairing/initiate — generate a new pairing code - POST /api/pairing/submit — submit code with device metadata - GET /api/devices — list paired devices - DELETE /api/devices/{id} — revoke a paired device - POST /api/devices/{id}/rotate — rotate a device token Wire into AppState and gateway router. Registry is only created when require_pairing is enabled. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:29 +03:00
argenis de la rosa	2d8cfc69f1	feat(config): add PairingDashboardConfig to gateway schema Add PairingDashboardConfig struct with configurable code_length, ttl_secs, max_pending, max_attempts, and lockout_secs fields. Nested under GatewayConfig as `pairing_dashboard` with serde defaults. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:28 +03:00
argenis de la rosa	ea2a04d2a8	fix(cli): align self-test and update commands with implementation plan - Export commands module from lib.rs (pub mod commands) for external consumers - Add --force and --version flags to the Update CLI command - Wire version parameter through to check() and run() in update.rs, supporting targeted version fetches via GitHub releases/tags API - Add WebSocket handshake check (check_websocket_handshake) to the full self-test suite in self_test.rs Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:28 +03:00
argenis de la rosa	77a44c5217	feat(cli): add update command with 6-phase pipeline and rollback Add `zeroclaw update` command with a 6-phase self-update pipeline: 1. Preflight — check GitHub releases API for newer version 2. Download — fetch platform-specific binary to temp dir 3. Backup — copy current binary to .bak for rollback 4. Validate — size check + --version smoke test on download 5. Swap — overwrite current binary with new version 6. Smoke test — verify updated binary runs, rollback on failure Supports --check flag for update-check-only mode without installing. Includes version comparison logic with unit tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:28 +03:00
argenis de la rosa	9161b40653	feat(cli): add self-test command with quick and full modes Add `zeroclaw self-test` command with two modes: - Quick mode (--quick): 8 offline checks including config, workspace, SQLite, provider/tool/channel registries, security policy, and version - Full mode (default): adds gateway health and memory round-trip checks Creates src/commands/ module structure with self_test and update stubs. Adds indicatif and tempfile runtime dependencies for the update pipeline. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:28 +03:00
argenis de la rosa	94ed0f62a4	feat(cli): add status --format=exit-code for Docker healthcheck Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:28 +03:00
Argenis	fb01622d47	feat(gateway): persist WS chat sessions across restarts (#3813 ) Gateway WebSocket chat sessions were in-memory only — conversation history was lost on gateway restart, macOS sleep/wake, or client reconnect. This wires up the existing SessionBackend (SQLite) to the gateway WS handler so sessions survive restarts and reconnections. Changes: - Add delete_session() to SessionBackend trait + SQLite implementation - Add session_persistence and session_ttl_hours to GatewayConfig - Add Agent::seed_history() to hydrate agent from persisted messages - Initialize SqliteSessionBackend in run_gateway() when enabled - Send session_start message on WS connect with session_id + resumed - Persist user/assistant messages after each turn - Add GET /api/sessions and DELETE /api/sessions/{id} REST endpoints - Bump version to 0.5.0 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:27 +03:00
Argenis	1a3a2f8baf	fix(web): preserve provider runtime options in ws agent (#3807 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:17:27 +03:00
Marijan Petričević	d86fd55a82	config/schema: add serde default to AutonomyConfig (#3691 ) Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:26 +03:00
Argenis	0fa37f178c	fix(security): restore tokens.is_empty() guard, add re-pairing hint (#3738 ) Revert "always generate pairing code" to tighter security posture: codes are only generated on first startup when no tokens exist. Add a CLI hint to the gateway banner so operators know how to re-pair on demand. Fix install.sh to not use --new on fresh install (avoids invalidating the auto-generated code). Fix onboard to show an informational message instead of a throwaway PairingGuard. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:26 +03:00
Alix-007	0847e97b79	fix(channels): allow low-risk shell in non-interactive mode (#3771 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:17:26 +03:00
Alix-007	fdf3ef526a	fix(daemon): preserve deferred MCP tools in /api/chat (#3790 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:17:25 +03:00
Alix-007	7191172524	fix(agent): resolve deferred MCP tools by suffix (#3793 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:17:25 +03:00
Alix-007	b7e3c356cb	feat(skills): support YAML frontmatter in SKILL.md (#3797 ) * feat(skills): support YAML frontmatter in SKILL.md * fix(skills): preserve nested open-skill names --------- Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:17:25 +03:00
Alix-007	3ab16560e0	fix(groq): fall back on tool validation 400s (#3778 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com> Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:24 +03:00
Argenis	c525a3340c	feat(runtime): add configurable reasoning effort (#3785 ) * feat(runtime): add configurable reasoning effort * fix(test): add missing reasoning_effort field in live test Add reasoning_effort: None to ProviderRuntimeOptions construction in openai_codex_vision_e2e.rs to fix E0063 compile error. --------- Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-24 15:17:24 +03:00
Alix-007	080d0c816f	fix(channels): hide tool-call notifications by default (#3779 ) Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com> Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:24 +03:00
GhostC	555b3755fe	fix(skills): allow sibling markdown links within skills root (#3781 ) Made-with: Cursor	2026-03-24 15:17:24 +03:00
Giulio V	f203eed904	feat(multi): LinkedIn tool, WhatsApp voice notes, and Anthropic OAuth fix (#3604 ) * feat(tools): add native LinkedIn integration tool Add a config-gated LinkedIn tool that enables ZeroClaw to interact with LinkedIn's REST API via OAuth2. Supports creating posts, listing own posts, commenting, reacting, deleting posts, viewing engagement stats, and retrieving profile info. Architecture: - linkedin.rs: Tool trait impl with action-dispatched design - linkedin_client.rs: OAuth2 token management and API wrappers - Config-gated via [linkedin] enabled = false (default off) - Credentials loaded from workspace .env file - Automatic token refresh with line-targeted .env update 39 unit tests covering security enforcement, parameter validation, credential parsing, and token management. * feat(linkedin): configurable content strategy and API version - Expand LinkedInConfig with api_version and nested LinkedInContentConfig (rss_feeds, github_users, github_repos, topics, persona, instructions) - Add get_content_strategy tool action so agents can read config at runtime - Fix hardcoded LinkedIn API version 202402 (expired) → configurable, defaulting to 202602 - LinkedInClient accepts api_version as parameter instead of static header - 4 new tests (43 total), all passing * feat(linkedin): add multi-provider image generation for posts Add ImageGenerator with provider chain (DALL-E, Stability AI, Imagen, Flux) and SVG fallback card. LinkedIn tool create_post now supports generate_image parameter. Includes LinkedIn image upload (register → upload → reference), configurable provider priority, and 14 new tests. * feat(whatsapp): add voice note transcription and TTS voice replies - Add STT support: download incoming voice notes via wa-rs, transcribe with OpenAI Whisper (or Groq), send transcribed text to agent - Add TTS support: synthesize agent replies to Opus audio via OpenAI TTS, upload encrypted media, send as WhatsApp voice note (ptt=true) - Voice replies only trigger when user sends a voice note; text messages get text replies only. Flag is consumed after one use to prevent multiple voice notes per agent turn - Fix transcription module to support OpenAI API key (not just Groq): auto-detect provider from API URL, check ANTHROPIC_OAUTH_TOKEN / OPENAI_API_KEY / GROQ_API_KEY env vars in priority order - Add optional api_key field to TranscriptionConfig for explicit key - Add response_format: opus to OpenAI TTS for WhatsApp compatibility - Add channel capability note so agent knows TTS is automatic - Wire transcription + TTS config into WhatsApp Web channel builder * fix(providers): prefer ANTHROPIC_OAUTH_TOKEN over global api_key When the Anthropic provider is used alongside a non-Anthropic primary provider (e.g. custom: gateway), the global api_key would be passed as credential override, bypassing provider-specific env vars. This caused Claude Code subscription tokens (sk-ant-oat01-) to be ignored in favor of the unrelated gateway JWT. Fix: for the anthropic provider, check ANTHROPIC_OAUTH_TOKEN and ANTHROPIC_API_KEY env vars before falling back to the credential override. This mirrors the existing MiniMax OAuth pattern and enables subscription-based auth to work as a fallback provider. feat(linkedin): add scheduled post support via LinkedIn API Add scheduled_at parameter to create_post and create_post_with_image. When provided (RFC 3339 timestamp), the post is created as a DRAFT with scheduledPublishOptions so LinkedIn publishes it automatically at the specified time. This enables the cron job to schedule a week of posts in advance directly on LinkedIn. * fix(providers): prefer env vars for openai and groq credential resolution Generalize the Anthropic OAuth fix to also cover openai and groq providers. When used alongside a non-matching primary provider (e.g. a custom: gateway), the global api_key would be passed as credential override, causing auth failures. Now checks provider-specific env vars (OPENAI_API_KEY, GROQ_API_KEY) before falling back to the credential override. * fix(whatsapp): debounce voice replies to voice final answer only The voice note TTS was triggering on the first send() call, which was often intermediate tool output (URLs, JSON, web fetch results) rather than the actual answer. This produced incomprehensible voice notes. Fix: accumulate substantive replies (>30 chars, not URLs/JSON/code) in a pending_voice map. A spawned debounce task waits 4 seconds after the last substantive message, then synthesizes and sends ONE voice note with the final answer. Intermediate tool outputs are skipped. This ensures the user hears the actual answer in the correct language, not raw tool output in English. * fix(whatsapp): voice in = voice out, text in = text out Rewrite voice reply logic with clean separation: - Voice note received: ALL text output suppressed. Latest message accumulated silently. After 5s of no new messages, ONE voice note sent with the final answer. No tool outputs, no text, just voice. - Text received: normal text reply, no voice. Atomic debounce: multiple spawned tasks race but only one can extract the pending message (remove-inside-lock pattern). Prevents duplicate voice notes. * fix(whatsapp): voice replies send both text and voice note Voice note in → text replies sent normally in real-time PLUS one voice note with the final answer after 10s debounce. Only substantive natural-language messages are voiced (tool outputs, URLs, JSON, code blocks filtered out). Longer debounce (10s) ensures the agent completes its full tool chain before the voice note fires. Text in → text out only, no voice. * fix(channels): suppress tool narration and ack reactions - Add system prompt instruction telling the agent to NEVER narrate tool usage (no "Let me fetch..." or "I will use http_request...") - Disable ack_reactions (emoji reactions on incoming messages) - Users see only the final answer, no intermediate steps * docs(claude): add full CONTRIBUTING.md guidelines to CLAUDE.md Add PR template requirements, code naming conventions, architecture boundary rules, validation commands, and branch naming guidance directly to CLAUDE.md for AI assistant reference. * fix(docs): add blank lines around headings in CLAUDE.md for markdown lint * fix(channels): strengthen tool narration suppression and fix large_futures - Move anti-narration instruction to top of channel system prompt - Add emphatic instruction for WhatsApp/voice channels specifically - Add outbound message filter to strip tool-call-like patterns (⏳, 🔧) - Box::pin the two-phase heartbeat agent::run call (16664 bytes on Linux)	2026-03-24 15:17:23 +03:00
Giulio V	5f47de5087	feat(channels): add Reddit, Bluesky, and generic Webhook adapters (#3598 ) * feat(channels): add Reddit, Bluesky, and generic Webhook adapters - Reddit: OAuth2 polling for mentions/DMs/replies, comment and DM sending - Bluesky: AT Protocol session auth, notification polling, post replies - Webhook: Axum HTTP server for inbound, configurable outbound POST/PUT - All three follow existing channel patterns with tests * fix(channels): use neutral test fixtures and improve test naming in webhook	2026-03-24 15:17:23 +03:00
Giulio V	8148e8369d	feat(knowledge): add knowledge graph for expertise capture and reuse (#3596 ) * feat(knowledge): add knowledge graph for expertise capture and reuse SQLite-backed knowledge graph system for consulting firms to capture, organize, and reuse architecture decisions, solution patterns, lessons learned, and expert matching across client engagements. - KnowledgeGraph (src/memory/knowledge_graph.rs): node CRUD, edge creation, FTS5 full-text search, tag filtering, subgraph traversal, expert ranking by authored contributions, graph statistics - KnowledgeTool (src/tools/knowledge_tool.rs): Tool trait impl with capture, search, relate, suggest, expert_find, lessons_extract, and graph_stats actions - KnowledgeConfig (src/config/schema.rs): disabled by default, configurable db_path/max_nodes, cross_workspace_search off by default for client data isolation - Wired into tools factory (conditional on config.knowledge.enabled) 20 unit tests covering node CRUD, edge creation, search ranking, subgraph queries, expert ranking, and tool actions. * fix: address CodeRabbit review findings - Fix UTF-8 truncation panic in truncate_str by using char-based iteration instead of byte indexing - Add config validation for knowledge.max_nodes > 0 - Add subgraph depth boundary validation (must be > 0, capped at 100) * fix(knowledge): address remaining CodeRabbit review issues - MAJOR: Add db_path non-empty validation in Config::validate() - MAJOR: Reject tags containing commas in add_node (comma is separator) - MAJOR: Fix subgraph depth boundary (0..depth instead of 0..=depth) - MAJOR: Apply project and node_type filters consistently in both tag-only and similarity search paths * fix: correct subgraph traversal test assertion and sync CI workflows	2026-03-24 15:17:23 +03:00
Giulio V	4471f8f8f9	feat(tools): add Google Workspace CLI (gws) integration (#3616 ) * feat(tools): add Google Workspace CLI (gws) integration Adds GoogleWorkspaceTool for interacting with Google Drive, Sheets, Gmail, Calendar, Docs, and other Workspace services via CLI. - Config-gated (google_workspace.enabled) - Service allowlist for restricted access - Requires shell access for CLI delegation - Input validation against shell injection - Wrong-type rejection for all optional parameters - Config validation for allowed_services (empty, duplicate, malformed) - Registered in integrations registry and CLI discovery Closes #2986 * style: fix cargo fmt + clippy violations * feat(google-workspace): expand config with auth, rate limits, and audit settings * fix(tools): define missing GWS_TIMEOUT_SECS constant * fix: Box::pin large futures and resolve duplicate Default impl --------- Co-authored-by: argenis de la rosa <theonlyhennygod@gmail.com>	2026-03-24 15:17:23 +03:00
Giulio V	2202181b07	feat(stt): multi-provider STT with TranscriptionProvider trait (#3614 ) * feat(stt): add multi-provider STT with TranscriptionProvider trait Refactors single-endpoint transcription to support multiple providers: Groq (existing), OpenAI Whisper, Deepgram, AssemblyAI, and Google Cloud Speech-to-Text. Adds TranscriptionManager for provider routing with backward-compatible config fields. * style: fix cargo fmt + clippy violations * fix: Box::pin large futures and resolve merge conflicts with master --------- Co-authored-by: argenis de la rosa <theonlyhennygod@gmail.com>	2026-03-24 15:17:23 +03:00
Argenis	01e9231f61	test(channel): add QQ markdown msg_type regression test (#3752 ) Verify that QQ send body uses msg_type 2 with nested markdown object instead of msg_type 0 with top-level content. Adapted from #3668.	2026-03-24 15:17:22 +03:00
Giulio V	3f0e3ffe05	feat(providers): add Claude Code, Gemini CLI, and KiloCLI subprocess providers (#3615 ) * feat(providers): add Claude Code, Gemini CLI, and KiloCLI subprocess providers Adds three new local subprocess-based providers for AI CLI tools. Each provider spawns the CLI as a child process, communicates via stdin/stdout pipes, and parses responses into ChatResponse format. * fix: resolve clippy unnecessary_debug_formatting and rustfmt violations * fix: resolve remaining clippy unnecessary_debug_formatting in CLI providers * fix(providers): add AiAgent CLI category for subprocess providers	2026-03-24 15:17:22 +03:00
Chris Hengge	714b319ba1	fix(tool): expand cron_add and cron_update parameter schemas (#3671 ) The schedule field in cron_add used a bare {"type":"object"} with a description string encoding a tagged union in pseudo-notation. The patch field in cron_update was an opaque {"type":"object"} despite CronJobPatch having nine fully-typed fields. Both gaps cause weaker instruction-following models to produce malformed or missing nested JSON when invoking these tools. Changes: - cron_add: expand schedule into a oneOf discriminated union with explicit properties and required fields for each variant (cron/at/every), matching the Schedule enum in src/cron/types.rs exactly - cron_add: add descriptions to all previously undocumented top-level fields - cron_add: expand delivery from a bare inline comment to fully-specified properties with per-field descriptions - cron_update: expand patch from opaque object to full properties matching CronJobPatch (name, enabled, command, prompt, model, session_target, delete_after_run, schedule, delivery) - cron_update: schedule inside patch mirrors the same oneOf expansion - Both: add inline NOTE comments flagging that oneOf is correct for OpenAI-compatible APIs but SchemaCleanr::clean_for_gemini must be applied if Gemini native tool calling is ever wired up - Both: add schema-shape tests using the existing test_config/test_security helper pattern, covering oneOf variant structure, required fields, and delivery channel enum completeness No behavior changes. No new dependencies. Backward compatible: the runtime deserialization path (serde on Schedule/CronJobPatch) is unchanged. Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:22 +03:00
Sid Jain	d4f4173c75	fix(slack): honor mention_only in runtime channel wiring (#3715 ) * feat(slack): wire mention_only group reply policy * feat(slack): expose mention_only in config and wizard defaults	2026-03-24 15:17:22 +03:00
Markus Bergholz	183fb3f2bb	Fix: Support Nextcloud Talk Activity Streams 2.0 webhook format (#3737 ) * fix * fix * format	2026-03-24 15:17:22 +03:00
Ricardo Madriz	9c30164aa3	fix(tools) Wire activated toolset into dispatch (#3747 ) * fix(tools): wire ActivatedToolSet into tool dispatch and spec advertisement When deferred MCP tools are activated via tool_search, they are stored in ActivatedToolSet but never consulted by the tool call loop. tool_specs is built once before the iteration loop and never refreshed, so the provider API tools[] parameter never includes activated tools. find_tool only searches the static registry, so execution dispatch also fails silently. Thread Arc<Mutex<ActivatedToolSet>> from creation sites through to run_tool_call_loop. Rebuild tool_specs each iteration to merge base registry specs with activated specs. Add fallback in execute_one_tool to check the activated set when the static registry lookup misses. Change ActivatedToolSet internal storage from Box<dyn Tool> to Arc<dyn Tool> so we can clone the Arc out of the mutex guard before awaiting tool.execute() (std::sync::MutexGuard is not Send). * fix(tools): add activated_tools field to new ChannelRuntimeContext test site	2026-03-24 15:17:21 +03:00
Chris Hengge	525bf47954	fix(integrations): wire Cron and Browser status to config fields (#3750 ) Both entries had hardcoded \|_\| IntegrationStatus::Available, ignoring the live config entirely. Users with cron.enabled = true or browser.enabled = true saw 'Available' on the /integrations dashboard card instead of 'Active'. Root cause: status_fn closures did not capture the Config argument. Fix: replace the \|_\| stubs with \|c\| closures that check c.cron.enabled and c.browser.enabled respectively, matching the pattern used by every other wired entry in the registry (Telegram, Discord, Shell, etc.). What did NOT change: ComingSoon entries, always-Active entries (Shell, File System), platform entries, or any other registry logic.	2026-03-24 15:17:21 +03:00
Giulio V	49eb3ced05	feat(security): add Merkle hash-chain audit trail (#3601 ) * feat(security): add Merkle hash-chain audit trail Each audit entry now includes a SHA-256 hash linking it to the previous entry (entry_hash, prev_hash, sequence), forming a tamper-evident chain. Modifying any entry invalidates all subsequent hashes. - Chain fields added to AuditEvent with #[serde(default)] for backward compat - AuditLogger tracks chain state and recovers from existing logs on restart - verify_chain() validates hash linkage, sequence continuity, and integrity - Five new tests: genesis seed, multi-entry verify, tamper detection, sequence gap detection, and cross-restart chain recovery * fix(security): replace personal name with neutral label in audit tests	2026-03-24 15:17:21 +03:00
Argenis	fc3af217ad	fix(agent): prevent duplicate tool schema injection in XML dispatcher (#3744 ) Remove duplicate tool listing from XmlToolDispatcher::prompt_instructions() since tool listing is already handled by ToolsSection in prompt.rs. The method now only emits the XML protocol envelope. Also fix UTF-8 char boundary panics in memory consolidation truncation by using char_indices() instead of manual byte-boundary scanning. Fixes #3643 Supersedes #3678 Co-authored-by: TJUEZ <TJUEZ@users.noreply.github.com>	2026-03-24 15:17:21 +03:00
伊姆	3ec4fab88f	fix(config): support socks proxy scheme for Clash Verge (#3001 ) Co-authored-by: imu <imu@sgcc.com.cn>	2026-03-24 15:17:21 +03:00
Giulio V	9320b21340	feat(whatsapp-web): add voice message transcription support (#3617 ) Adds audio message detection and transcription to WhatsApp Web channel. Voice messages (PTT) are downloaded, transcribed via the existing transcription subsystem (Groq Whisper), and delivered as text content. - TranscriptionConfig field with builder pattern - Duration limit enforcement before download - MIME type mapping for audio formats - Graceful error handling (skip on failure) - Preserves full retry/reconnect state machine from master	2026-03-24 15:17:20 +03:00
Sandeep Ghael	caba6bcbf8	fix(channel): resolve multi-room reply routing regression (#3224 ) (#3378 ) * fix(channel): resolve multi-room reply routing regression (#3224) PR #3224 (`f0f0f808`, "feat(matrix): add multi-room support") changed the channel name format in matrix.rs from "matrix" to "matrix:!roomId", but the channel lookup in mod.rs still does an exact match against channels_by_name, which is keyed by Channel::name() (returns "matrix"). This mismatch causes target_channel to always resolve to None for Matrix messages, silently dropping all replies. Fix: fall back to a prefix match on the base channel name (before ':') when the exact lookup fails. This preserves multi-room conversation isolation while correctly routing replies to the originating channel. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * style: apply cargo fmt to channel routing fix Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Sandeep (Claude) <sghael+claude@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 15:17:20 +03:00
Giulio V	1d0145f273	feat(tools): add browser delegation tool (#3610 ) * feat(tools): add browser delegation tool for corporate web app interaction Adds BrowserDelegateTool that delegates browser-based tasks to Claude Code (or other browser-capable CLIs) for interacting with corporate tools (Teams, Outlook, Jira, Confluence) via browser automation. Includes domain validation (allow/blocklist), task templates, Chrome profile persistence for SSO sessions, and timeout management. * fix: resolve clippy violation in browser delegation tool * fix(browser-delegate): validate URLs embedded in task text against domain policy Scan the task text for http(s):// URLs using regex and validate each against the allow/block domain lists before forwarding to the browser CLI subprocess. This prevents bypassing domain restrictions by embedding blocked URLs in the task parameter. * fix(browser-delegate): constrain URL schemes, gate on runtime, document config - Add has_shell_access gate so BrowserDelegateTool is only registered on shell-capable runtimes (skipped with warning on WASM/edge runtimes) - Add boundary tests for javascript: and data: URL scheme rejection - URL scheme validation (http/https only) and config docs were already addressed by a prior commit on this branch * fix(tools): address CodeRabbit review findings for browser delegation Remove dead `max_concurrent_tasks` config field and expand doc comments on the `[browser_delegate]` config section in schema.rs.	2026-03-24 15:17:20 +03:00
Christian Pojoni	d91e54a5d0	fix(tool+channel): revert invalid model set via model_routing_config (#3497 ) When the LLM hallucinates an invalid model ID through the model_routing_config tool's set_default action, the invalid model gets persisted to config.toml. The channel hot-reload then picks it up and every subsequent message fails with a non-retryable 404, permanently killing the connection with no user recovery path. Fix with two layers of defense: 1. Tool probe-and-rollback: after saving the new model, send a minimal chat request to verify the model is accessible. If the API returns a non-retryable error (404, auth failure, etc.), automatically restore the previous config and return a failure notice to the LLM. 2. Channel safety net: in maybe_apply_runtime_config_update, reject config reloads when warmup fails with a non-retryable error instead of applying the broken config anyway. Co-authored-by: Christian Pojoni <christian.pojoni@gmail.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-24 15:17:20 +03:00
DotViegas	101bfa1928	fix(providers): adjust temperature for OpenAI reasoning models (#2936 ) Some OpenAI models (o1, o3, o4, gpt-5 variants) only accept temperature=1.0 and return errors with other values like 0.7. This change automatically adjusts the temperature parameter based on the model being used. Changes: - Add adjust_temperature_for_model() function to detect reasoning models - Apply temperature adjustment in chat_with_system(), chat(), and chat_with_tools() - Preserve user-specified temperature for standard models (gpt-4o, gpt-4-turbo, etc.) - Force temperature=1.0 for reasoning models (o1, o3, o4, gpt-5, gpt-5-mini, gpt-5-nano, gpt-5.x-chat-latest) Testing: - Add 7 unit tests covering reasoning models, standard models, and edge cases - All tests pass successfully - Empirical testing documented in docs/openai-temperature-compatibility.md Impact: - Fixes temperature errors when using o1, o3, o4, and gpt-5 model families - No breaking changes - transparent adjustment for end users - Standard models continue to work with flexible temperature values Risk: Low - isolated change within OpenAI provider, well-tested Rollback: Revert this commit to restore previous behavior Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:19 +03:00
Giulio V	4e74857d34	feat(observability): add Hands dashboard metrics and events (#3595 ) Add HandStarted, HandCompleted, and HandFailed event variants to ObserverEvent, and HandRunDuration, HandFindingsCount, HandSuccessRate metric variants to ObserverMetric. Update all observer backends (log, noop, verbose, prometheus, otel) to handle the new variants with appropriate instrumentation. Prometheus backend registers hand_runs counter, hand_duration histogram, and hand_findings counter. OTel backend creates spans and records metrics for hand runs.	2026-03-24 15:17:19 +03:00
smallwhite	6eef252e10	fix(telegram): avoid duplicate finalize_draft messages (#3259 )	2026-03-24 15:17:19 +03:00
Chris Hengge	8431a44ba4	fix(memory): serialize MemoryCategory as plain string and guard dashboard render crashes (#3051 ) The /memory dashboard page rendered a black screen when MemoryCategory::Custom was serialized by serde's derived impl as a tagged object {"custom":"..."} but the frontend expected a plain string. No navigation was possible without using the browser Back button. Changes: - src/memory/traits.rs: replace derived serde impls with custom serialize (delegates to Display, emits plain snake_case string) and deserialize (parses known variants by name, falls through to Custom(s) for unknown). Adds memory_category_serde_uses_snake_case and memory_category_custom_roundtrip tests. No persistent storage migration needed — all backends (SQLite, Markdown, Postgres) use their own category_to_str/str_to_category helpers and never read serde-serialized category values back from disk. - web/src/App.tsx: export ErrorBoundary class so render crashes surface a recoverable UI instead of a black screen. Adds aria-live="polite" to the pairing error paragraph for screen reader accessibility. - web/src/components/layout/Layout.tsx: wrap Outlet in ErrorBoundary keyed by pathname so the navigation shell stays mounted during a page crash and the boundary resets on route change. Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:19 +03:00
Chris Hengge	c96f238038	fix(channel): bypass mention_only gate for Discord DMs (#2983 ) When mention_only is enabled, the bot correctly requires an @mention in guild (server) channels. However, Direct Messages have no guild_id and are inherently private and addressed to the bot — requiring a @mention in a DM is never correct and silently drops all DM messages. Changes: - src/channels/discord.rs: detect DMs via absence of guild_id in the gateway payload, compute effective_mention_only = self.mention_only && !is_dm, and pass that to normalize_incoming_content instead of self.mention_only. DMs bypass the mention gate; guild messages retain existing behaviour. - Adds three tests: DM bypasses mention gate, guild message without mention is rejected, guild message with mention passes and strips the mention tag. Co-authored-by: Argenis <theonlyhennygod@gmail.com>	2026-03-24 15:17:18 +03:00
Ericsunsk	b5af73cac6	fix(memory): filter autosave noise and scope recall/store by session (#3695 ) * fix(memory): filter autosave noise and scope memory by session * style: format rebase-resolved gateway and memory loader * fix(tests): update memory loader mock for session-aware context * fix(openai-codex): decode utf-8 safely across stream chunks	2026-03-24 15:17:18 +03:00
Vast-stars	7db8853085	fix(agent): remove bare URL → curl fallback in GLM-style tool call parser (#3694 ) * fix(agent): remove bare URL → curl fallback in GLM-style tool call parser The `parse_glm_style_tool_calls` function had a "Plain URL" fallback that converted any bare URL line (e.g. `https://example.com`) into a `shell` tool call running `curl -s '<url>'`. This caused: - False positives: normal URLs in LLM replies misinterpreted as tool calls - Swallowed replies: text with URLs not forwarded to the channel - Unintended shell commands: `curl` executed without user intent Explicit GLM-format tool calls like `browser_open/url>https://...` and `shell/command>...` are unaffected — only the bare URL catch-all is removed. * style: cargo fmt --------- Co-authored-by: argenis de la rosa <theonlyhennygod@gmail.com>	2026-03-24 15:17:18 +03:00
Argenis	66e8442ab8	feat(channels): add X/Twitter and Mochat channel integrations (#3735 ) * feat(channels): add X/Twitter and Mochat channel integrations Add two new channel implementations to close competitive gaps: - X/Twitter: Twitter API v2 with mentions polling, tweet threading (auto-splits at 280 chars), DM support, and rate limit handling - Mochat: HTTP polling-based integration with Mochat customer service platform, configurable poll interval, message dedup Both channels follow the existing Channel trait pattern with full config schema integration, health checks, and dedup. Closes competitive gap: NanoClaw had X/Twitter, Nanobot had Mochat. * fix(channels): use write! instead of format_push_string for clippy Replace url.push_str(&format!(...)) with write!(url, ...) to satisfy clippy::format_push_string lint on CI. * fix(channels): rename reply_to parameter to avoid legacy field grep The component test source_does_not_use_legacy_reply_to_field greps for "reply_to:" in source files. Rename the parameter to reply_tweet_id to pass this check.	2026-03-24 15:17:17 +03:00

1 2 3 4 5 ...

1250 Commits