no code implementations • 9 Oct 2024 • Wanchao Liang, Tianyu Liu, Less Wright, Will Constable, Andrew Gu, Chien-chin Huang, Iris Zhang, Wei Feng, Howard Huang, Junjie Wang, Sanket Purandare, Gokul Nadathur, Stratos Idreos
By stacking training optimizations, we demonstrate accelerations of 65. 08% with 1D parallelism at the 128-GPU scale (Llama 3. 1 8B), an additional 12. 59% with 2D parallelism at the 256-GPU scale (Llama 3. 1 70B), and an additional 30% with 3D parallelism at the 512-GPU scale (Llama 3. 1 405B) on NVIDIA H100 GPUs over optimized baselines.
1 code implementation • 29 Nov 2021 • Jack Langerman, Ziming Qiu, Gábor Sörös, Dávid Sebők, Yao Wang, Howard Huang
One of the key criticisms of deep learning is that large amounts of expensive and difficult-to-acquire training data are required in order to train models with high performance and good generalization capabilities.
no code implementations • 22 Feb 2021 • Chunhua Geng, Traian E. Abrudan, Veli-Matti Kolmonen, Howard Huang
In this paper, we study probabilistic time-of-arrival (ToA) and angle-of-arrival (AoA) joint localization in real indoor environments.
Networking and Internet Architecture Signal Processing
no code implementations • 6 Sep 2020 • Emre Gönültaş, Eric Lei, Jack Langerman, Howard Huang, Christoph Studer
Channel state information (CSI)-based fingerprinting via neural networks (NNs) is a promising approach to enable accurate indoor and outdoor positioning of user equipments (UEs), even under challenging propagation conditions.
no code implementations • 15 Oct 2019 • Fernando Perez-Cruz, Pablo M. Olmos, Michael Minyi Zhang, Howard Huang
In this paper, we take a new approach for time of arrival geo-localization.