1.0 KiB
1.0 KiB
File Operations Test Results
Highscores
Performance Rankings (Duration)
| Test | Model | Duration (ms) | Duration (s) |
|---|---|---|---|
| file-inclusion | openai/gpt-4o-mini | 2223 | 2.22 |
| file-inclusion | google/gemini-2.0-flash-exp:free | 2404 | 2.40 |
Summary
- Total Tests: 8
- Passed: 2
- Failed: 6
- Success Rate: 25.00%
- Average Duration: 1671ms (1.67s)
Failed Tests
file-inclusion - openai/gpt-4o-mini
- Prompt:
What animals are shown in these images? Return as JSON array. - Expected:
["cat","fox"] - Actual:
["cat", "fox"] - Duration: 2223ms (2.22s)
- Reason: Expected ["cat","fox"], but got ["cat", "fox"]
- Timestamp: 6/5/2025, 8:46:17 PM
file-inclusion - google/gemini-2.0-flash-exp:free
- Prompt:
What animals are shown in these images? Return as JSON array. - Expected:
["cat","fox"] - Actual:
[ "cat", "fox" ] - Duration: 2404ms (2.40s)
- Reason: Expected ["cat","fox"], but got [ "cat", "fox" ]
- Timestamp: 6/5/2025, 8:46:20 PM
Passed Tests
No passed tests