Add cascading fallback to file_read tool: UTF-8 → PDF text extraction (via pdf-extract) → lossy UTF-8 conversion. Binary files no longer produce errors; PDFs return extracted text, other binaries get lossy output with U+FFFD replacement characters. Changes: - Cargo.toml: add rag-pdf to default features - file_read.rs: cascading fallback logic + try_extract_pdf_text helper - file_read.rs: update tool description - test_document.pdf: replace empty fixture with PDF containing "Hello PDF" - Tests: remove file_read_rejects_binary_pdf, add unit + e2e tests for PDF extraction and lossy binary reads (including live OpenAI Codex e2e) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| fixtures | ||
| agent_e2e.rs | ||
| agent_loop_robustness.rs | ||
| channel_routing.rs | ||
| config_persistence.rs | ||
| dockerignore_test.rs | ||
| memory_comparison.rs | ||
| memory_restart.rs | ||
| otel_dependency_feature_regression.rs | ||
| provider_resolution.rs | ||
| provider_schema.rs | ||
| reply_target_field_regression.rs | ||
| telegram_attachment_fallback.rs | ||
| whatsapp_webhook_security.rs | ||