The Tracking AI website evaluates various AI models using the Mensa Norway IQ Test and a separate set of offline questions to avoid overlap with training data.
(Please use a modern browser to see the interactive version of this visualization)