mono/packages/kbot/tests/unit/reports/files.md

1.0 KiB

File Operations Test Results

Highscores

Performance Rankings (Duration)

Test Model Duration (ms) Duration (s)
file-inclusion openai/gpt-4o-mini 2223 2.22
file-inclusion google/gemini-2.0-flash-exp:free 2404 2.40

Summary

  • Total Tests: 8
  • Passed: 2
  • Failed: 6
  • Success Rate: 25.00%
  • Average Duration: 1671ms (1.67s)

Failed Tests

file-inclusion - openai/gpt-4o-mini

  • Prompt: What animals are shown in these images? Return as JSON array.
  • Expected: ["cat","fox"]
  • Actual: ["cat", "fox"]
  • Duration: 2223ms (2.22s)
  • Reason: Expected ["cat","fox"], but got ["cat", "fox"]
  • Timestamp: 6/5/2025, 8:46:17 PM

file-inclusion - google/gemini-2.0-flash-exp:free

  • Prompt: What animals are shown in these images? Return as JSON array.
  • Expected: ["cat","fox"]
  • Actual: [ "cat", "fox" ]
  • Duration: 2404ms (2.40s)
  • Reason: Expected ["cat","fox"], but got [ "cat", "fox" ]
  • Timestamp: 6/5/2025, 8:46:20 PM

Passed Tests

No passed tests