Evaluation

How do we know how smart AI systems are?

Melanie Mitchell

July 13, 2023

Abstract

Reproduced with permission of author. Revisiting Marvin Minsky's 1967 prediction that AI would be "substantially solved" within a generation, Mitchell asks how we should measure and evaluate progress toward human-level machine intelligence nearly 60 years later. The piece questions current AI benchmarking methodologies and argues that evaluating machine intelligence requires a deeper reckoning with what we mean by "intelligence" itself.

Full Paper