Increasing Learning Efficiency of Self-Attention Networks throughDirect Position Interactions, Learnable Temperature,and Convoluted Attention

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Philipp Dufter, Martin Schmitt, Hinrich Schütze

Journal title: Proceedings of the 28th International Conference on Computational Linguistics

Journal number: December 2020

Journal publisher: Association for Computational Linguistics

Published year: 2020

DOI identifier: 10.5282/ubm/epub.74088