How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Phillip Rust, Jonas Pfeiffer, Ivan Vulić, Sebastian Ruder, Iryna Gurevych

Journal title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)

Journal publisher: Association for Computational Linguistics

Published year: 2021

Published pages: 3118-3135

DOI identifier: 10.18653/v1/2021.acl-long.243