Predicting attention sparsity in transformers

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Marcos Treviso, António Góis, Patrick Fernandes, Erick Fonseca, André F. T. Martins

Journal title: ACL Workshop on Structured Prediction for Natural Language Processing (SPNLP'22)

Journal publisher: ACL Anthology

Published year: 2022