zeroclaw

Author	SHA1	Message	Date
argenis de la rosa	055507bd18	feat(agent): log query classification route decisions	2026-02-24 16:02:59 +08:00
Argenis	46ef41ac65	fix(agent): parse `tool <name>` markdown fence format (#1438 ) Issue: #1420 Some LLM providers (e.g., xAI grok) output tool calls in the format: ```tool file_write {"path": "...", "content": "..."} ``` Previously, ZeroClaw only matched: - ```tool_call - ```tool-call - ```toolcall - ```invoke This caused silent failures where: 1. Tool calls were not parsed 2. Agent reported success but no tools executed 3. LLM hallucinated tool execution results Fix: 1. Added new regex `MD_TOOL_NAME_RE` to match ` ```tool <name>` format 2. Parse the tool name from the code block header 3. Parse JSON arguments from the block content 4. Updated `detect_tool_call_parse_issue()` to include this format Added 3 tests: - parse_tool_calls_handles_tool_name_fence_format - parse_tool_calls_handles_tool_name_fence_shell - parse_tool_calls_handles_multiple_tool_name_fences Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 16:02:59 +08:00
Vernon Stinebaker	7e6491142e	fix(provider): preserve reasoning_content in tool-call conversation history Thinking/reasoning models (Kimi K2.5, GLM-4.7, DeepSeek-R1) return a reasoning_content field in assistant messages containing tool calls. ZeroClaw was silently dropping this field when constructing conversation history, causing provider APIs to reject follow-up requests with 400 errors: "thinking is enabled but reasoning_content is missing in assistant tool call message". Add reasoning_content: Option<String> as an opaque pass-through at every layer of the pipeline: ChatResponse, ConversationMessage, NativeMessage structs, parse/convert/build functions, and dispatcher. The field is skip_serializing_if = None so it is invisible for non-thinking models. Closes #1327	2026-02-22 17:40:48 +08:00
Chummy	9b40130a53	fix(agent): set tool_call_id for glm shortened parser	2026-02-21 20:48:06 +08:00
Chummy	09861fc52d	fix(agent): make tool_call_id field initialization explicit	2026-02-21 20:48:06 +08:00
Chummy	74a29ec096	fix(agent): stabilize tool-call loop dedupe and id propagation Refs #1242	2026-02-21 20:48:06 +08:00
Chummy	85f218eb0f	feat(tools): add natural-language model routing config tool	2026-02-21 20:45:43 +08:00
chumyin0912@gmail.com	13429566b8	fix(agent): map shortened browser alias args to shell command	2026-02-21 20:02:36 +08:00
Vernon Stinebaker	f0fa825e89	fix(agent): add cross-alias close tag resolution and GLM shortened body parsing Models like GLM-4.7 emit malformed tool call formats that the existing parser cannot handle: cross-alias close tags (e.g. <tool_call>...</invoke>), shortened bodies (tool>value), YAML-style multi-line, and attribute-style (tool key="value"). This adds defense-in-depth parsing for these formats so tool calls are not silently dropped. Changes: - Add TOOL_CALL_CLOSE_TAGS constant for cross-alias close tag matching - Add default_param_for_tool() for shortened body parameter inference - Add parse_glm_shortened_body() for 3 GLM sub-formats inside tags - Extend parse_tool_calls() with cross-alias resolution and GLM fallbacks - Merge duplicate match arms in map_tool_name_alias() for clippy compliance - Add 13 focused tests covering all new parsing paths	2026-02-21 20:02:36 +08:00
Chummy	dbe01e9639	Fix gateway strict-delta and test regressions after rebase	2026-02-21 17:38:27 +08:00
Chummy	78196e027d	Fix flaky regressions after main rebase	2026-02-21 17:38:27 +08:00
Allen Huang	7c4dc0982d	feat(agent): add draft progress streaming for tool call execution Port the progress streaming code from the fork's 75fdeb0 commit. The upstream run_tool_call_loop only uses on_delta for final response streaming, missing real-time feedback during tool execution. Added progress sends at 4 points in the tool loop: - "Thinking..." / "Thinking (round N)..." before each LLM call - "Got N tool call(s) (Xs)" after LLM responds with tool calls - Tool start: "⏳ tool_name: hint..." before each tool execution - Tool complete: "✅ tool_name (Xs)" or "❌ tool_name (Xs)" after Also added DRAFT_CLEAR_SENTINEL handling in the channel draft updater so progress lines are cleared before the final answer streams in. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 17:22:32 +08:00
Chummy	61f98a8fd3	feat(observability): add runtime trace diagnostics and trace doctor query	2026-02-21 17:00:38 +08:00
Chummy	580cc52a0a	Merge pull request #1127 from ecschoye/fix/non-cli-tool-exclusion feat(security): add non_cli_excluded_tools config for channel tool filtering	2026-02-21 15:33:16 +08:00
chumyin	67942318c9	Merge origin/main into fix/non-cli-tool-exclusion	2026-02-21 15:28:53 +08:00
chumyin	782bb0b483	fix: resolve multi-issue provider/channel/tool regressions	2026-02-21 15:12:27 +08:00
Allen Huang	7d81715b60	fix(agent): skip interactive approval in daemon/cron context Daemon heartbeat and cron tasks called agent::run() which hardcoded channel_name as "cli" and always created an ApprovalManager, causing [Y]es / [N]o / [A]lways stdin prompts on the unattended daemon terminal. Add interactive parameter to agent::run(): CLI passes true (preserving approval flow), daemon/cron pass false (no ApprovalManager, channel marked as "daemon"). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 14:52:44 +08:00
agorevski	00a7510e91	fix(tests): update test structs for new usage and hooks fields Add missing `usage: None` to ChatResponse literals in benchmarks, agent loop tests, and file_read tests. Add missing `hooks: None` to channel context structs in channel tests. Remove obsolete `.map(\|(m, _)\| m)` calls in telegram tests to match updated parse_update_message return type. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-20 22:30:23 -08:00
xj	69f4b95f8e	fix(hooks): add JsonSchema derive to HooksConfig and BuiltinHooksConfig Upstream main now derives schemars::JsonSchema on all config structs. Our HooksConfig and BuiltinHooksConfig were missing it, causing CI Build (Smoke) failure when the merge commit was compiled.	2026-02-21 13:34:09 +08:00
xj	eb60d0fb81	fix(hooks): address code review findings - C1: Use real tool success boolean instead of starts_with("Error") heuristic in after_tool_call hook - C2: Wire HookRunner from config into ChannelRuntimeContext so hooks actually fire in daemon/channel mode (was hardcoded to None) - I1: Suppress unused_imports warning on HookHandler public API re-export - I3: Remove session_memory and boot_script config fields that had no backing implementation (YAGNI); keep only command_logger which is wired Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:34:09 +08:00
xj	6d4dca9a07	chore(hooks): fix formatting and clippy warnings Apply cargo fmt and replace sort_by with sort_by_key per clippy. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:34:09 +08:00
xj	0638266b63	feat(hooks): integrate HookRunner into agent loop, channels, and gateway Thread Option<&HookRunner> into run_tool_call_loop with hook fire points for LLM input, before/after tool calls. Add hooks field to ChannelRuntimeContext for message received/sending interception. Build HookRunner from config in run_gateway and fire gateway_start. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:34:09 +08:00
Aleksandr Prilipko	2393b9a551	fix: resolve clippy warnings and rustfmt across codebase Address clippy lints (redundant continue, as-cast, match arms, elided lifetimes, format vs write!) and reformat long cfg attributes and assert macros to pass `cargo fmt --check` and `cargo clippy -D warnings`. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:39:34 +08:00
Kyle Lampa	3f88f14eb9	fix(agent): handle double closing braces in Perl-style tool calls The format ends with }} before /TOOL_CALL, not a single }. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	0b31bdee61	fix(agent): map tool name aliases for MiniMax variations Add comprehensive tool name alias mapping: - fileread -> file_read - filewrite -> file_write - memoryrecall -> memory_recall - bash/sh/cmd -> shell - etc. Apply to all new parsers (XML attribute, Perl, FunctionCall). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	4d08ae275d	fix(agent): improve Perl-style tool call regex The previous regex couldn't handle nested braces in: {tool => "shell", args => { --command "ls" }} Now uses multi-stage parsing: find TOOL_CALL block, extract tool name, then extract args block. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	ba1b231099	fix(agent): parse FunctionCall tool call format Add parser for <FunctionCall> style that MiniMax also uses: <FunctionCall> file_read <code>path>/Users/.../file.md</code> </FunctionCall> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	8541041b23	fix(agent): parse MiniMax tool call formats for execution Add parsers for two additional tool call formats that MiniMax LLM uses: - XML attribute style: <minimax:toolcall><invoke name="shell"><parameter name="command">ls</parameter></invoke></minimax:toolcall> - Perl/hash-ref style: {tool => "shell", args => { --command "ls" }} Previously these were sent as plain text to Telegram channel instead of being executed as tool calls. Also fixes build warnings: - Add #[allow(unused_imports)] to cost/mod.rs and onboard/mod.rs re-exports - Change channels::handle_command visibility to pub(crate) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
s04	dce3c36053	fix: add usage field to ChatResponse constructors added upstream Tests and mock providers added upstream after the branch point now need the usage field that was introduced in the first commit.	2026-02-21 12:29:02 +08:00
s04	0fb6a91595	feat(observability): wire token usage through observer events Add input_tokens and output_tokens fields to ObserverEvent::LlmResponse so per-call token data flows through all observer backends. Prometheus gains three new counters (llm_requests_total, tokens_input_total, tokens_output_total) for granular token tracking by provider/model. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:29:02 +08:00
s04	6f1cf8bc81	feat(provider): add usage field to ChatResponse Add a lightweight TokenUsage struct to providers::traits with input_tokens and output_tokens fields. Add usage: Option<TokenUsage> to ChatResponse and update all construction sites across providers and agent modules with usage: None. This is the first step toward capturing token usage data from LLM API responses. Currently all sites set usage: None — subsequent commits will parse actual usage from each provider's response format.	2026-02-21 12:29:02 +08:00
Aleksandr Prilipko	2af6a25ac2	fix: resolve all compilation, test, and fmt errors on main - Remove duplicate `chat` method in reliable.rs (E0201) - Fix `futures` → `futures_util` imports in agent.rs and loop_.rs (E0433) - Gate PostgresMemory behind `memory-postgres` feature in cli.rs (E0433) - Fix regex backreference in XML tool parser (unsupported by regex crate) - Add missing `skills_prompt_mode` argument in test - Apply rustfmt to files with formatting issues on main	2026-02-21 12:09:06 +08:00
T. Budiman	664625f5f6	fix(gateway): enable tool execution for WhatsApp, Linq, Nextcloud Talk channels Gateway channels (WhatsApp, Linq, Nextcloud Talk) were returning raw <tool> tags without executing tools or showing results. The CLI correctly executed tools and returned results. Root cause: gateway handlers used run_gateway_chat_with_multimodal which explicitly disabled tools for simple chat-only mode. Fix: Create run_gateway_chat_with_tools() which uses process_message() for full tool support, while keeping run_gateway_chat_simple() for the webhook endpoint to maintain backward compatibility with tests. Changes: - Add run_gateway_chat_with_tools() for channel handlers (uses process_message) - Keep run_gateway_chat_simple() for webhook endpoint (uses state.provider) - Remove unused provider_label variables from channel handlers - Remove unused imports (ChatMessage, ProviderCapabilityError) - Fix pre-existing test compilation issue (missing SkillsPromptInjectionMode) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 11:49:53 +08:00
Alex Gorevski	357a938174	fix: resolve three compilation errors breaking release-fast build - Remove duplicate chat method in ReliableProvider impl (E0201) The second chat fn (lines 662-769) was an exact duplicate of the first (lines 540-647) in the same impl block. - Gate PostgresMemory usage in memory CLI behind memory-postgres feature (E0433) super::PostgresMemory is only exported when the feature is enabled; the Postgres match arm now compiles to an explicit bail when the feature is off. - Replace utures::future::join_all with utures_util::future::join_all (E0433) The crate depends on utures-util, not utures. Fixed in both agent.rs and loop_.rs. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-20 11:38:00 -08:00
Chummy	ad5f878e49	fix: tighten Chinese provider tool-call parsing and remove PR noise	2026-02-21 01:27:04 +08:00
Vernon Stinebaker	4fd41d5f2c	fix(provider): add chat() override to ReliableProvider for native tool calling ReliableProvider was missing a chat() override, causing it to fall through to the default Provider::chat() trait implementation. The default implementation delegates to chat_with_history() which returns a plain String and wraps it in ChatResponse with tool_calls: Vec::new() — so native tool calling was completely broken through the retry/failover wrapper even though the underlying provider properly supports it. Changes: - Add chat() with full retry/backoff/failover logic matching existing chat_with_system(), chat_with_history(), and chat_with_tools() overrides - Include context_window_exceeded early-exit matching other method patterns - Add 7 focused tests: delegation with tool calls, retry recovery, supports_native_tools propagation, aggregated error reporting, model failover, non-retryable error skip, and system prompt zero-XML verification	2026-02-21 01:20:52 +08:00
Edvard	e5e7e1a409	feat(security): add non_cli_excluded_tools to filter tools on channel messages On non-CLI channels (Telegram, Discord, etc.), tools like shell and file_write cannot receive interactive approval and are auto-denied, causing the LLM to see confusing error responses and fabricate answers. Add a new config option `non_cli_excluded_tools` under `[autonomy]` that removes specified tools from the tool specs sent to the LLM on non-CLI channels. This prevents the model from attempting tool calls that would fail, forcing it to use data already in the system prompt. The change filters tool_specs in run_tool_call_loop when the excluded_tools parameter is non-empty. CLI channels are unaffected. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 12:08:02 -05:00
Chummy	e081010983	feat(skills): add configurable compact skills prompt injection	2026-02-21 00:00:51 +08:00
Chummy	5f6a8cdfc2	fix(channels): suppress leaked tool json in channel replies	2026-02-20 23:31:57 +08:00
Will Sarg	a9a35d50d1	fix(ci): restore containerized validation on main (#1096 )	2026-02-20 07:48:58 -05:00
Chummy	f7b2f7a7d7	feat(agent): run independent tool calls concurrently in runtime loop	2026-02-20 19:36:42 +08:00
Edvard Schøyen	861137b2b3	fix(security): deny unapproved tool calls on non-CLI channels (#998 ) When autonomy is set to "supervised", the approval gate only prompted interactively on CLI. On Telegram and other channels, all tool calls were silently auto-approved with ApprovalResponse::Yes, including high-risk tools like shell — completely bypassing supervised mode. On non-CLI channels where interactive prompting is not possible, deny tool calls that require approval instead of auto-approving. Users can expand the auto_approve list in config to explicitly allow specific tools on non-interactive channels. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 05:22:56 -05:00
Edvard Schøyen	f35a365d83	fix(agent): implement actual concurrent tool execution (#1001 ) When parallel_tools is enabled, both code branches in execute_tools() ran the same sequential for loop. The parallel path was a no-op. Use futures::future::join_all to execute tool calls concurrently when parallel_tools is true. The futures crate is already a dependency. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 05:05:33 -05:00
Chummy	a2e9c0d1e1	fix(skills): make open-skills sync opt-in and configurable	2026-02-20 16:45:50 +08:00
xj	2d6205ee58	fix(channel): use native tool calling to preserve conversation context AnthropicProvider declared supports_native_tools() = true but did not override chat_with_tools(). The default trait implementation drops all conversation history (sends only system + last user message), breaking multi-turn conversations on Telegram and other channels. Changes: - Override chat_with_tools() in AnthropicProvider: converts OpenAI-format tool JSON to ToolSpec and delegates to chat() which preserves full message history - Skip build_tool_instructions() XML protocol when provider supports native tools (saves ~12k chars in system prompt) - Remove duplicate Tool Use Protocol section from build_system_prompt() for native-tool providers - Update Your Task section to encourage conversational follow-ups instead of XML tool_call tags when using native tools - Add tracing::warn for malformed tool definitions in chat_with_tools	2026-02-20 13:58:27 +08:00
Edvard	ea2ff7c53b	fix(memory): add minimum-length filter for auto-save messages Every user message was auto-saved to memory regardless of length, flooding the store with trivial entries like "ok", "thanks", "hi". These noise entries competed with real memories during recall, degrading relevance — especially with keyword-only search. Skip auto-saving messages shorter than 20 characters. Applied to both the channel path (channels/mod.rs) and CLI agent path (agent/loop_.rs). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 10:26:31 +08:00
Alex Gorevski	9d0ff54037	Merge pull request #1016 from zeroclaw-labs/test/improve-test-assertions test(quality): replace bare .unwrap() with .expect() in agent and shell tests	2026-02-19 16:16:42 -08:00
Alex Gorevski	22bd03c65a	test(quality): replace bare .unwrap() with .expect() in agent and shell tests Replace bare .unwrap() calls with descriptive .expect() messages in src/agent/agent.rs and src/tools/shell.rs test modules. Adds meaningful failure context for memory creation, agent builder, and tool execution assertions. Addresses audit finding on test assertion quality (§5.2). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-19 13:23:33 -08:00
Alex Gorevski	dd541bd7e4	docs(code): add decision-point comments to agent loop, security policy, and reliable provider Adds section markers and decision-point comments to the three most complex control-flow modules. Comments explain loop invariants, retry/fallback strategy, security policy precedence rules, and error handling rationale. This improves maintainability by making the reasoning behind complex branches explicit for reviewers and future contributors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-19 13:19:53 -08:00
Chummy	3733856093	Fix skill instruction/tool injection in system prompts	2026-02-20 02:16:41 +08:00

1 2 3

148 Commits