ADeLe: Predicting and explaining AI performance across tasks
At a glance AI benchmarks report performance on specific tasks but provide limited insight into underlying capabilities; ADeLe evaluates models by scoring both tasks and models across 18 core abilities, enabling direct comparison between task demands and model capabilities. Using these ability scores, the
Read more