958 B
958 B
Web URL Support Test Results
Highscores
Performance Rankings (Duration)
| Test | Model | Duration (ms) | Duration (s) |
|---|---|---|---|
| web_wikipedia | openai/gpt-3.5-turbo | 4125 | 4.13 |
| web_json | openai/gpt-3.5-turbo | 1033 | 1.03 |
Summary
- Total Tests: 2
- Passed: 0
- Failed: 2
- Success Rate: 0.00%
- Average Duration: 2579ms (2.58s)
Failed Tests
web_wikipedia - openai/gpt-3.5-turbo
- Prompt:
Does the content have information about Kenya? Answer with only "yes" or "no". - Expected:
yes - Actual: ``
- Duration: 4125ms (4.13s)
- Reason: Model returned empty response
- Timestamp: 4/6/2025, 5:48:31 PM
web_json - openai/gpt-3.5-turbo
- Prompt:
Is this data in JSON format? Answer with only "yes" or "no". - Expected:
yes - Actual: ``
- Duration: 1033ms (1.03s)
- Reason: Model returned empty response
- Timestamp: 4/6/2025, 5:48:33 PM
Passed Tests
No passed tests