Evaluation
How do we know how smart AI systems are?
Abstract
Reproduced with permission of author.
Revisiting Marvin Minsky's 1967 prediction that AI would be "substantially solved" within a generation, Mitchell asks how we should measure and evaluate progress toward human-level machine intelligence nearly 60 years later. The piece questions current AI benchmarking methodologies and argues that evaluating machine intelligence requires a deeper reckoning with what we mean by "intelligence" itself.
Full Paper