5.5 KiB
5.5 KiB
Language Test Results
Failed Tests
german - deepseek/deepseek-chat:free
- Prompt:
translate "hello" to German. Return only the translated word, no explanation. - Expected:
hallo - Actual: ``
- Duration: 985ms
- Error Type: Error
- Error Code: UNKNOWN
- Error Message: Model returned empty response
- Reason: Model returned empty response
- Timestamp: 4/1/2025, 1:05:50 PM
german - google/gemini-2.0-flash-exp:free
- Prompt:
translate "hello" to German. Return only the translated word, no explanation. - Expected:
hallo - Actual: ``
- Duration: 746ms
- Error Type: Error
- Error Code: UNKNOWN
- Error Message: Model returned empty response
- Reason: Model returned empty response
- Timestamp: 4/1/2025, 1:05:51 PM
german - gpt-4
- Prompt:
translate "hello" to German. Return only the translated word, no explanation. - Expected:
hallo - Actual: ``
- Duration: 1067ms
- Reason: Unknown error occurred
- Timestamp: 4/1/2025, 1:05:52 PM
german - anthropic/claude-2.0
- Prompt:
translate "hello" to German. Return only the translated word, no explanation. - Expected:
hallo - Actual: ``
- Duration: 1253ms
- Reason: Unknown error occurred
- Timestamp: 4/1/2025, 1:47:26 PM
spanish - deepseek/deepseek-chat:free
- Prompt:
translate "yes" to Spanish. Return only the translated word, no explanation. - Expected:
sí - Actual: ``
- Duration: 678ms
- Error Type: Error
- Error Code: UNKNOWN
- Error Message: Model returned empty response
- Reason: Model returned empty response
- Timestamp: 4/1/2025, 1:05:53 PM
spanish - google/gemini-2.0-flash-exp:free
- Prompt:
translate "yes" to Spanish. Return only the translated word, no explanation. - Expected:
sí - Actual: ``
- Duration: 744ms
- Error Type: Error
- Error Code: UNKNOWN
- Error Message: Model returned empty response
- Reason: Model returned empty response
- Timestamp: 4/1/2025, 1:05:53 PM
spanish - gpt-4
- Prompt:
translate "yes" to Spanish. Return only the translated word, no explanation. - Expected:
sí - Actual: ``
- Duration: 1125ms
- Reason: Unknown error occurred
- Timestamp: 4/1/2025, 1:05:55 PM
spanish - anthropic/claude-2.0
- Prompt:
translate "yes" to Spanish. Return only the translated word, no explanation. - Expected:
sí - Actual: ``
- Duration: 932ms
- Reason: Unknown error occurred
- Timestamp: 4/1/2025, 1:47:27 PM
french - deepseek/deepseek-chat:free
- Prompt:
translate "no" to French. Return only the translated word, no explanation. - Expected:
non - Actual: ``
- Duration: 626ms
- Error Type: Error
- Error Code: UNKNOWN
- Error Message: Model returned empty response
- Reason: Model returned empty response
- Timestamp: 4/1/2025, 1:05:55 PM
french - gpt-4
- Prompt:
translate "no" to French. Return only the translated word, no explanation. - Expected:
non - Actual: ``
- Duration: 1341ms
- Reason: Unknown error occurred
- Timestamp: 4/1/2025, 1:05:57 PM
french - google/gemini-2.0-flash-exp:free
- Prompt:
translate "no" to French. Return only the translated word, no explanation. - Expected:
non - Actual: ``
- Duration: 729ms
- Error Type: Error
- Error Code: UNKNOWN
- Error Message: Model returned empty response
- Reason: Model returned empty response
- Timestamp: 4/1/2025, 1:05:56 PM
french - anthropic/claude-2.0
- Prompt:
translate "no" to French. Return only the translated word, no explanation. - Expected:
non - Actual: ``
- Duration: 864ms
- Reason: Unknown error occurred
- Timestamp: 4/1/2025, 1:47:28 PM
Passed Tests
german_translation - deepseek/deepseek-chat:free
- Prompt:
translate "hello" to German. Return only the translation, no explanation. - Expected:
hallo - Actual:
Hallo - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:01 PM
german_translation - google/gemini-2.0-flash-exp:free
- Prompt:
translate "hello" to German. Return only the translation, no explanation. - Expected:
hallo - Actual:
Hallo - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:02 PM
german_translation - gpt-4
- Prompt:
translate "hello" to German. Return only the translation, no explanation. - Expected:
hallo - Actual:
Hallo - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:32 PM
spanish_translation - deepseek/deepseek-chat:free
- Prompt:
translate "yes" to Spanish. Return only the translation, no explanation. - Expected:
sí - Actual:
sí - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:05 PM
spanish_translation - google/gemini-2.0-flash-exp:free
- Prompt:
translate "yes" to Spanish. Return only the translation, no explanation. - Expected:
sí - Actual:
sí - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:06 PM
spanish_translation - gpt-4
- Prompt:
translate "yes" to Spanish. Return only the translation, no explanation. - Expected:
sí - Actual:
sí - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:35 PM
french_translation - deepseek/deepseek-chat:free
- Prompt:
translate "no" to French. Return only the translation, no explanation. - Expected:
non - Actual:
non - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:08 PM
french_translation - google/gemini-2.0-flash-exp:free
- Prompt:
translate "no" to French. Return only the translation, no explanation. - Expected:
non - Actual:
non - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:10 PM
french_translation - gpt-4
- Prompt:
translate "no" to French. Return only the translation, no explanation. - Expected:
non - Actual:
non - Duration: undefinedms
- Timestamp: 4/1/2025, 12:56:37 PM