zeroclaw

Author	SHA1	Message	Date
Chummy	dbe01e9639	Fix gateway strict-delta and test regressions after rebase	2026-02-21 17:38:27 +08:00
Chummy	78196e027d	Fix flaky regressions after main rebase	2026-02-21 17:38:27 +08:00
Allen Huang	7c4dc0982d	feat(agent): add draft progress streaming for tool call execution Port the progress streaming code from the fork's 75fdeb0 commit. The upstream run_tool_call_loop only uses on_delta for final response streaming, missing real-time feedback during tool execution. Added progress sends at 4 points in the tool loop: - "Thinking..." / "Thinking (round N)..." before each LLM call - "Got N tool call(s) (Xs)" after LLM responds with tool calls - Tool start: "⏳ tool_name: hint..." before each tool execution - Tool complete: "✅ tool_name (Xs)" or "❌ tool_name (Xs)" after Also added DRAFT_CLEAR_SENTINEL handling in the channel draft updater so progress lines are cleared before the final answer streams in. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 17:22:32 +08:00
Chummy	61f98a8fd3	feat(observability): add runtime trace diagnostics and trace doctor query	2026-02-21 17:00:38 +08:00
Chummy	580cc52a0a	Merge pull request #1127 from ecschoye/fix/non-cli-tool-exclusion feat(security): add non_cli_excluded_tools config for channel tool filtering	2026-02-21 15:33:16 +08:00
chumyin	67942318c9	Merge origin/main into fix/non-cli-tool-exclusion	2026-02-21 15:28:53 +08:00
chumyin	782bb0b483	fix: resolve multi-issue provider/channel/tool regressions	2026-02-21 15:12:27 +08:00
Allen Huang	7d81715b60	fix(agent): skip interactive approval in daemon/cron context Daemon heartbeat and cron tasks called agent::run() which hardcoded channel_name as "cli" and always created an ApprovalManager, causing [Y]es / [N]o / [A]lways stdin prompts on the unattended daemon terminal. Add interactive parameter to agent::run(): CLI passes true (preserving approval flow), daemon/cron pass false (no ApprovalManager, channel marked as "daemon"). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 14:52:44 +08:00
agorevski	00a7510e91	fix(tests): update test structs for new usage and hooks fields Add missing `usage: None` to ChatResponse literals in benchmarks, agent loop tests, and file_read tests. Add missing `hooks: None` to channel context structs in channel tests. Remove obsolete `.map(\|(m, _)\| m)` calls in telegram tests to match updated parse_update_message return type. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-20 22:30:23 -08:00
xj	69f4b95f8e	fix(hooks): add JsonSchema derive to HooksConfig and BuiltinHooksConfig Upstream main now derives schemars::JsonSchema on all config structs. Our HooksConfig and BuiltinHooksConfig were missing it, causing CI Build (Smoke) failure when the merge commit was compiled.	2026-02-21 13:34:09 +08:00
xj	eb60d0fb81	fix(hooks): address code review findings - C1: Use real tool success boolean instead of starts_with("Error") heuristic in after_tool_call hook - C2: Wire HookRunner from config into ChannelRuntimeContext so hooks actually fire in daemon/channel mode (was hardcoded to None) - I1: Suppress unused_imports warning on HookHandler public API re-export - I3: Remove session_memory and boot_script config fields that had no backing implementation (YAGNI); keep only command_logger which is wired Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:34:09 +08:00
xj	6d4dca9a07	chore(hooks): fix formatting and clippy warnings Apply cargo fmt and replace sort_by with sort_by_key per clippy. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:34:09 +08:00
xj	0638266b63	feat(hooks): integrate HookRunner into agent loop, channels, and gateway Thread Option<&HookRunner> into run_tool_call_loop with hook fire points for LLM input, before/after tool calls. Add hooks field to ChannelRuntimeContext for message received/sending interception. Build HookRunner from config in run_gateway and fire gateway_start. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:34:09 +08:00
Aleksandr Prilipko	2393b9a551	fix: resolve clippy warnings and rustfmt across codebase Address clippy lints (redundant continue, as-cast, match arms, elided lifetimes, format vs write!) and reformat long cfg attributes and assert macros to pass `cargo fmt --check` and `cargo clippy -D warnings`. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:39:34 +08:00
Kyle Lampa	3f88f14eb9	fix(agent): handle double closing braces in Perl-style tool calls The format ends with }} before /TOOL_CALL, not a single }. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	0b31bdee61	fix(agent): map tool name aliases for MiniMax variations Add comprehensive tool name alias mapping: - fileread -> file_read - filewrite -> file_write - memoryrecall -> memory_recall - bash/sh/cmd -> shell - etc. Apply to all new parsers (XML attribute, Perl, FunctionCall). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	4d08ae275d	fix(agent): improve Perl-style tool call regex The previous regex couldn't handle nested braces in: {tool => "shell", args => { --command "ls" }} Now uses multi-stage parsing: find TOOL_CALL block, extract tool name, then extract args block. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	ba1b231099	fix(agent): parse FunctionCall tool call format Add parser for <FunctionCall> style that MiniMax also uses: <FunctionCall> file_read <code>path>/Users/.../file.md</code> </FunctionCall> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
Kyle Lampa	8541041b23	fix(agent): parse MiniMax tool call formats for execution Add parsers for two additional tool call formats that MiniMax LLM uses: - XML attribute style: <minimax:toolcall><invoke name="shell"><parameter name="command">ls</parameter></invoke></minimax:toolcall> - Perl/hash-ref style: {tool => "shell", args => { --command "ls" }} Previously these were sent as plain text to Telegram channel instead of being executed as tool calls. Also fixes build warnings: - Add #[allow(unused_imports)] to cost/mod.rs and onboard/mod.rs re-exports - Change channels::handle_command visibility to pub(crate) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:36:28 +08:00
s04	dce3c36053	fix: add usage field to ChatResponse constructors added upstream Tests and mock providers added upstream after the branch point now need the usage field that was introduced in the first commit.	2026-02-21 12:29:02 +08:00
s04	0fb6a91595	feat(observability): wire token usage through observer events Add input_tokens and output_tokens fields to ObserverEvent::LlmResponse so per-call token data flows through all observer backends. Prometheus gains three new counters (llm_requests_total, tokens_input_total, tokens_output_total) for granular token tracking by provider/model. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:29:02 +08:00
s04	6f1cf8bc81	feat(provider): add usage field to ChatResponse Add a lightweight TokenUsage struct to providers::traits with input_tokens and output_tokens fields. Add usage: Option<TokenUsage> to ChatResponse and update all construction sites across providers and agent modules with usage: None. This is the first step toward capturing token usage data from LLM API responses. Currently all sites set usage: None — subsequent commits will parse actual usage from each provider's response format.	2026-02-21 12:29:02 +08:00
Aleksandr Prilipko	2af6a25ac2	fix: resolve all compilation, test, and fmt errors on main - Remove duplicate `chat` method in reliable.rs (E0201) - Fix `futures` → `futures_util` imports in agent.rs and loop_.rs (E0433) - Gate PostgresMemory behind `memory-postgres` feature in cli.rs (E0433) - Fix regex backreference in XML tool parser (unsupported by regex crate) - Add missing `skills_prompt_mode` argument in test - Apply rustfmt to files with formatting issues on main	2026-02-21 12:09:06 +08:00
T. Budiman	664625f5f6	fix(gateway): enable tool execution for WhatsApp, Linq, Nextcloud Talk channels Gateway channels (WhatsApp, Linq, Nextcloud Talk) were returning raw <tool> tags without executing tools or showing results. The CLI correctly executed tools and returned results. Root cause: gateway handlers used run_gateway_chat_with_multimodal which explicitly disabled tools for simple chat-only mode. Fix: Create run_gateway_chat_with_tools() which uses process_message() for full tool support, while keeping run_gateway_chat_simple() for the webhook endpoint to maintain backward compatibility with tests. Changes: - Add run_gateway_chat_with_tools() for channel handlers (uses process_message) - Keep run_gateway_chat_simple() for webhook endpoint (uses state.provider) - Remove unused provider_label variables from channel handlers - Remove unused imports (ChatMessage, ProviderCapabilityError) - Fix pre-existing test compilation issue (missing SkillsPromptInjectionMode) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 11:49:53 +08:00
Alex Gorevski	357a938174	fix: resolve three compilation errors breaking release-fast build - Remove duplicate chat method in ReliableProvider impl (E0201) The second chat fn (lines 662-769) was an exact duplicate of the first (lines 540-647) in the same impl block. - Gate PostgresMemory usage in memory CLI behind memory-postgres feature (E0433) super::PostgresMemory is only exported when the feature is enabled; the Postgres match arm now compiles to an explicit bail when the feature is off. - Replace utures::future::join_all with utures_util::future::join_all (E0433) The crate depends on utures-util, not utures. Fixed in both agent.rs and loop_.rs. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-20 11:38:00 -08:00
Chummy	ad5f878e49	fix: tighten Chinese provider tool-call parsing and remove PR noise	2026-02-21 01:27:04 +08:00
Vernon Stinebaker	4fd41d5f2c	fix(provider): add chat() override to ReliableProvider for native tool calling ReliableProvider was missing a chat() override, causing it to fall through to the default Provider::chat() trait implementation. The default implementation delegates to chat_with_history() which returns a plain String and wraps it in ChatResponse with tool_calls: Vec::new() — so native tool calling was completely broken through the retry/failover wrapper even though the underlying provider properly supports it. Changes: - Add chat() with full retry/backoff/failover logic matching existing chat_with_system(), chat_with_history(), and chat_with_tools() overrides - Include context_window_exceeded early-exit matching other method patterns - Add 7 focused tests: delegation with tool calls, retry recovery, supports_native_tools propagation, aggregated error reporting, model failover, non-retryable error skip, and system prompt zero-XML verification	2026-02-21 01:20:52 +08:00
Edvard	e5e7e1a409	feat(security): add non_cli_excluded_tools to filter tools on channel messages On non-CLI channels (Telegram, Discord, etc.), tools like shell and file_write cannot receive interactive approval and are auto-denied, causing the LLM to see confusing error responses and fabricate answers. Add a new config option `non_cli_excluded_tools` under `[autonomy]` that removes specified tools from the tool specs sent to the LLM on non-CLI channels. This prevents the model from attempting tool calls that would fail, forcing it to use data already in the system prompt. The change filters tool_specs in run_tool_call_loop when the excluded_tools parameter is non-empty. CLI channels are unaffected. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 12:08:02 -05:00
Chummy	e081010983	feat(skills): add configurable compact skills prompt injection	2026-02-21 00:00:51 +08:00
Chummy	5f6a8cdfc2	fix(channels): suppress leaked tool json in channel replies	2026-02-20 23:31:57 +08:00
Will Sarg	a9a35d50d1	fix(ci): restore containerized validation on main (#1096 )	2026-02-20 07:48:58 -05:00
Chummy	f7b2f7a7d7	feat(agent): run independent tool calls concurrently in runtime loop	2026-02-20 19:36:42 +08:00
Edvard Schøyen	861137b2b3	fix(security): deny unapproved tool calls on non-CLI channels (#998 ) When autonomy is set to "supervised", the approval gate only prompted interactively on CLI. On Telegram and other channels, all tool calls were silently auto-approved with ApprovalResponse::Yes, including high-risk tools like shell — completely bypassing supervised mode. On non-CLI channels where interactive prompting is not possible, deny tool calls that require approval instead of auto-approving. Users can expand the auto_approve list in config to explicitly allow specific tools on non-interactive channels. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 05:22:56 -05:00
Chummy	a2e9c0d1e1	fix(skills): make open-skills sync opt-in and configurable	2026-02-20 16:45:50 +08:00
xj	2d6205ee58	fix(channel): use native tool calling to preserve conversation context AnthropicProvider declared supports_native_tools() = true but did not override chat_with_tools(). The default trait implementation drops all conversation history (sends only system + last user message), breaking multi-turn conversations on Telegram and other channels. Changes: - Override chat_with_tools() in AnthropicProvider: converts OpenAI-format tool JSON to ToolSpec and delegates to chat() which preserves full message history - Skip build_tool_instructions() XML protocol when provider supports native tools (saves ~12k chars in system prompt) - Remove duplicate Tool Use Protocol section from build_system_prompt() for native-tool providers - Update Your Task section to encourage conversational follow-ups instead of XML tool_call tags when using native tools - Add tracing::warn for malformed tool definitions in chat_with_tools	2026-02-20 13:58:27 +08:00
Edvard	ea2ff7c53b	fix(memory): add minimum-length filter for auto-save messages Every user message was auto-saved to memory regardless of length, flooding the store with trivial entries like "ok", "thanks", "hi". These noise entries competed with real memories during recall, degrading relevance — especially with keyword-only search. Skip auto-saving messages shorter than 20 characters. Applied to both the channel path (channels/mod.rs) and CLI agent path (agent/loop_.rs). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 10:26:31 +08:00
Alex Gorevski	dd541bd7e4	docs(code): add decision-point comments to agent loop, security policy, and reliable provider Adds section markers and decision-point comments to the three most complex control-flow modules. Comments explain loop invariants, retry/fallback strategy, security policy precedence rules, and error handling rationale. This improves maintainability by making the reasoning behind complex branches explicit for reviewers and future contributors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-19 13:19:53 -08:00
Chummy	ef82c7dbcd	fix(channels): interrupt in-flight telegram requests on newer sender messages	2026-02-20 01:54:07 +08:00
Chummy	d714d3984e	fix(memory): stop autosaving assistant summaries and filter legacy entries	2026-02-20 01:14:08 +08:00
Alex Gorevski	dce7280812	Merge pull request #865 from agorevski/feat/systematic-test-coverage-852 test: add systematic test coverage for 7 bug pattern groups (#852)	2026-02-19 07:02:20 -08:00
Chummy	dcd0bf641d	feat: add multimodal image marker support with Ollama vision	2026-02-19 21:25:21 +08:00
Chummy	a5d7911923	feat(runtime): add reasoning toggle for ollama	2026-02-19 21:05:19 +08:00
Alex Gorevski	7f03ab77a9	test: add systematic test coverage for 7 bug pattern groups (#852 ) Add ~105 test cases across 7 test groups identified in issue #852: TG1 - Provider resolution (27 tests): Factory resolution, alias mapping, custom URLs, auth styles, credential wiring TG2 - Config persistence (18 tests): Config defaults, TOML roundtrip, agent/memory config, workspace dirs TG3 - Channel routing (14 tests): ChannelMessage identity contracts, SendMessage construction, Channel trait send/listen roundtrip TG4 - Agent loop robustness (12 integration + 14 inline tests): Malformed tool calls, failing tools, iteration limits, empty responses, unicode TG5 - Memory restart (14 tests): Dedup on same key, restart persistence, session scoping, recall, concurrent stores, categories TG6 - Channel message splitting (8+8 inline tests): Code blocks at boundary, long words, emoji, CJK chars, whitespace edge cases TG7 - Provider schema (21 tests): ChatMessage/ToolCall/ChatResponse serialization, tool_call_id preservation, auth style variants Also fixes a bug in split_message_for_telegram() where byte-based indexing could panic on multi-byte characters (emoji, CJK). Now uses char_indices() consistent with the Discord split implementation. Closes #852 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-18 15:28:34 -08:00
Chummy	b4b379e3e7	fix(providers): harden tool fallback and refresh model catalogs	2026-02-18 22:50:02 +08:00
Chummy	50fd5b81e1	fix(test): stabilize cron output capture and clippy cleanups	2026-02-18 20:29:26 +08:00
Chummy	483acccdb7	feat(memory): add configurable postgres storage backend	2026-02-18 20:29:26 +08:00
Vernon Stinebaker	3b0133596c	feat(providers): add native tool calling for OpenAI-compatible providers Implement chat_with_tools() on CompatibleProvider so OpenAI-compatible endpoints (OpenRouter, local LLMs, etc.) can use structured tool calling instead of prompt-injected tool descriptions. Changes: - CompatibleProvider: capabilities() reports native_tool_calling, new chat_with_tools() sends tools in API request and parses tool_calls from response, chat() bridges to chat_with_tools() when ToolSpecs are provided - RouterProvider: chat_with_tools() delegation with model hint resolution - loop_.rs: expose tools_to_openai_format as pub(crate), add tools_to_openai_format_from_specs for ToolSpec-based conversion Adds 9 new tests and updates 1 existing test.	2026-02-18 18:06:36 +08:00
Chummy	219764d4d8	fix(channels): recover malformed invoke/tool_call output in daemon mode	2026-02-18 17:01:36 +08:00
Xiangjun Ma	f1db63219c	refactor(telegram): address code review findings - Add strip_tool_call_tags() to finalize_draft to prevent Markdown parse failures from tool-call tags reaching Telegram API - Deduplicate parse_reply_target() call in update_draft (was called twice, discarding thread_id both times) - Replace body.as_object_mut().unwrap() mutation with separate plain_body JSON literal (eliminates unwrap in runtime path) - Clean up per-chat rate-limit HashMap entry in finalize_draft to prevent unbounded growth over long uptimes - Extract magic number 80 to STREAM_CHUNK_MIN_CHARS constant in agent loop	2026-02-18 16:33:33 +08:00
Xiangjun Ma	93538a70e3	fix(agent): relay final response as progressive chunks via on_delta Previously on_delta sent the entire completed response as a single message, defeating the purpose of the streaming draft updates. Now the text is split into ~80-char chunks on whitespace boundaries (UTF-8 safe via split_inclusive) and sent progressively through the channel, so Telegram draft edits show text arriving incrementally. The consumer in process_channel_message already accumulates chunks and calls update_draft with the full text so far, and Telegram's rate-limiting (draft_update_interval_ms) throttles editMessageText calls to avoid API spam.	2026-02-18 16:33:33 +08:00

1 2 3

115 Commits