Dynabench: Rethinking Benchmarking in NLP

Facebook NLP Research

Abstract

We introduce Dynabench, an open-source platform for dynamic dataset creation and model benchmarking. Dynabench runs in a web browser and supports human-and-model-in-the-loop dataset creation: annotators seek to create examples that a target model will misclassify, but that another person will not.

 

 

To finish reading, please visit source site