AI Concepts
Benchmark
Definition
A standardised test used to compare AI model performance on tasks such as reasoning, coding, or knowledge.
In Plain English
A common exam for AI models so we can compare them fairly.
A standardised test used to compare AI model performance on tasks such as reasoning, coding, or knowledge.
A common exam for AI models so we can compare them fairly.