2 code implementations • 12 Sep 2022 • Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, Naveen Mellempudi, Stuart Oberman, Mohammad Shoeybi, Michael Siu, Hao Wu
FP8 is a natural progression for accelerating deep learning training inference beyond the 16-bit formats common in modern processors.
no code implementations • 11 Jun 2021 • Darko Stosic, Dusan Stosic, Irena Vodenska, H. Eugene Stanley, Tatijana Stosic
From the time-dependent multifractal analysis we find that multifractal spectra for Monday returns are much wider than for other days during periods of financial crises.
no code implementations • 10 Jun 2021 • Dusan Stosic, Darko Stosic, Teresa B. Ludermir, Borko Stosic
In this work we present a simple approach based on concepts from statistical physics to learn optimal distance metric for a given problem.
no code implementations • 27 May 2021 • Darko Stosic, Dusan Stosic
While larger neural models are pushing the boundaries of what deep learning can do, often more weights are needed to train models rather than to run inference for tasks.
2 code implementations • 16 Apr 2021 • Asit Mishra, Jorge Albericio Latorre, Jeff Pool, Darko Stosic, Dusan Stosic, Ganesh Venkatesh, Chong Yu, Paulius Micikevicius
We present the design and behavior of Sparse Tensor Cores, which exploit a 2:4 (50%) sparsity pattern that leads to twice the math throughput of dense matrix units.