Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Anya Belz, Craig Thomson, Ehud Reiter, Simon Mille

Journal title: Findings of the Association for Computational Linguistics: ACL 2023

Journal publisher: Association for Computational Linguistics

Published year: 2023

DOI identifier: 10.18653/v1/2023.findings-acl.226