PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Osawa, Kazuki; Li, Shigang; Hoefler, Torsten

Journal title: arXiv

Journal number: 30

Journal publisher: arXiv

Published year: 2022

DOI identifier: 10.48550/arxiv.2211.14133

ISSN: 2331-8422