Harnessing GPU’s Tensor Cores for Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Azzam Haidar, Stanimire Tomov, Jack Dongarra, Nick Higham

Journal title: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis

Journal publisher: Association for Computing Machinery

Published year: 2019