AI Concepts

Benchmark

Definition

A standardised test used to compare AI model performance on tasks such as reasoning, coding, or knowledge.

In Plain English

A common exam for AI models so we can compare them fairly.