Search Results for author: Guanhua Wang

Found 19 papers, 7 papers with code

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

1 code implementation23 Sep 2024 Guanhua Wang, Chengming Zhang, Zheyu Shen, Ang Li, Olatunji Ruwase

Given the popularity of generative AI, Large Language Models (LLMs) often consume hundreds or thousands of GPUs for parallelizing and accelerating the training process.

FastPersist: Accelerating Model Checkpointing in Deep Learning

no code implementations19 Jun 2024 Guanhua Wang, Olatunji Ruwase, Bing Xie, Yuxiong He

Model checkpoints are critical Deep Learning (DL) artifacts that enable fault tolerance for training and downstream applications, such as inference.

Deep Learning model

Artificial Intelligence for Neuro MRI Acquisition: A Review

no code implementations10 Jun 2024 Hongjia Yang, Guanhua Wang, Ziyu Li, Haoxiang Li, Jialan Zheng, Yuxin Hu, Xiaozhi Cao, Congyu Liao, Huihui Ye, Qiyuan Tian

Magnetic resonance imaging (MRI) has significantly benefited from the resurgence of artificial intelligence (AI).

3D MR Fingerprinting for Dynamic Contrast-Enhanced Imaging of Whole Mouse Brain

no code implementations1 May 2024 Yuran Zhu, Guanhua Wang, Yuning Gu, Walter Zhao, Jiahao Lu, Junqing Zhu, Christina J. MacAskill, Andrew Dupuis, Mark A. Griswold, Dan Ma, Chris A. Flask, Xin Yu

We present the first dynamic and multi-parametric approach for quantitatively tracking contrast agent transport in the mouse brain using 3D MRF.

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

no code implementations22 Apr 2024 Marah Abdin, Jyoti Aneja, Hany Awadalla, Ahmed Awadallah, Ammar Ahmad Awan, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Qin Cai, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Weizhu Chen, Yen-Chun Chen, Yi-Ling Chen, Hao Cheng, Parul Chopra, Xiyang Dai, Matthew Dixon, Ronen Eldan, Victor Fragoso, Jianfeng Gao, Mei Gao, Min Gao, Amit Garg, Allie Del Giorno, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Wenxiang Hu, Jamie Huynh, Dan Iter, Sam Ade Jacobs, Mojan Javaheripi, Xin Jin, Nikos Karampatziakis, Piero Kauffmann, Mahoud Khademi, Dongwoo Kim, Young Jin Kim, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Yunsheng Li, Chen Liang, Lars Liden, Xihui Lin, Zeqi Lin, Ce Liu, Liyuan Liu, Mengchen Liu, Weishung Liu, Xiaodong Liu, Chong Luo, Piyush Madan, Ali Mahmoudzadeh, David Majercak, Matt Mazzola, Caio César Teodoro Mendes, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Liliang Ren, Gustavo de Rosa, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Yelong Shen, Swadheen Shukla, Xia Song, Masahiro Tanaka, Andrea Tupini, Praneetha Vaddamanu, Chunyu Wang, Guanhua Wang, Lijuan Wang, Shuohang Wang, Xin Wang, Yu Wang, Rachel Ward, Wen Wen, Philipp Witte, Haiping Wu, Xiaoxia Wu, Michael Wyatt, Bin Xiao, Can Xu, Jiahang Xu, Weijian Xu, Jilong Xue, Sonali Yadav, Fan Yang, Jianwei Yang, Yifan Yang, ZiYi Yang, Donghan Yu, Lu Yuan, Chenruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

We introduce phi-3-mini, a 3. 8 billion parameter language model trained on 3. 3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3. 5 (e. g., phi-3-mini achieves 69% on MMLU and 8. 38 on MT-bench), despite being small enough to be deployed on a phone.

Ranked #5 on MMR total on MRR-Benchmark (using extra training data)

Language Modeling Language Modelling +3

Free Space Optical Communication for Inter-Satellite Link: Architecture, Potentials and Trends

no code implementations26 Oct 2023 Guanhua Wang, Fang Yang, Jian Song, Zhu Han

The sixth-generation (6G) network is expected to achieve global coverage based on the space-air-ground integrated network, and the latest satellite network will play an important role in it.

Scheduling

ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

1 code implementation16 Jun 2023 Guanhua Wang, Heyang Qin, Sam Ade Jacobs, Connor Holmes, Samyam Rajbhandari, Olatunji Ruwase, Feng Yan, Lei Yang, Yuxiong He

Zero Redundancy Optimizer (ZeRO) has been used to train a wide range of large language models on massive GPUs clusters due to its ease of use, efficiency, and good scalability.

Quantization

Adaptive Sampling for Linear Sensing Systems via Langevin Dynamics

no code implementations27 Feb 2023 Guanhua Wang, Douglas C. Noll, Jeffrey A. Fessler

Adaptive or dynamic signal sampling in sensing systems can adapt subsequent sampling strategies based on acquired signals, thereby potentially improving image quality and speed.

Composing MPC with LQR and Neural Network for Amortized Efficiency and Stable Control

no code implementations14 Dec 2021 Fangyu Wu, Guanhua Wang, Siyuan Zhuang, Kehan Wang, Alexander Keimer, Ion Stoica, Alexandre Bayen

The proposed scheme does not require pre-computation and can improve the amortized running time of the composed MPC with a well-trained neural network.

Computational Efficiency Model Predictive Control

Efficient approximation of Jacobian matrices involving a non-uniform fast Fourier transform (NUFFT)

2 code implementations4 Nov 2021 Guanhua Wang, Jeffrey A. Fessler

In fact, we show that model-based image reconstruction (MBIR) methods with suitably optimized imaging parameters can perform nearly as well as CNN-based methods.

Image Reconstruction Stochastic Optimization

Blind Primed Supervised (BLIPS) Learning for MR Image Reconstruction

2 code implementations11 Apr 2021 Anish Lahiri, Guanhua Wang, Saiprasad Ravishankar, Jeffrey A. Fessler

We also compare the proposed method to alternative approaches for combining dictionary-based methods with supervised learning in MR image reconstruction.

Dictionary Learning Image Reconstruction

B-spline Parameterized Joint Optimization of Reconstruction and K-space Trajectories (BJORK) for Accelerated 2D MRI

2 code implementations27 Jan 2021 Guanhua Wang, Tianrui Luo, Jon-Fredrik Nielsen, Douglas C. Noll, Jeffrey A. Fessler

Though trained with neural network-based reconstruction, the proposed trajectory also leads to improved image quality with compressed sensing-based reconstruction.

Image Reconstruction

ResiliNet: Failure-Resilient Inference in Distributed Neural Networks

no code implementations18 Feb 2020 Ashkan Yousefpour, Brian Q. Nguyen, Siddartha Devic, Guanhua Wang, Aboudy Kreidieh, Hans Lobel, Alexandre M. Bayen, Jason P. Jue

Nevertheless, when a neural network is partitioned and distributed among physical nodes, failure of physical nodes causes the failure of the neural units that are placed on those nodes, which results in a significant performance drop.

Federated Learning

Blink: Fast and Generic Collectives for Distributed ML

no code implementations11 Oct 2019 Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica

Model parameter synchronization across GPUs introduces high overheads for data-parallel training at scale.

Image Classification

KLEESPECTRE: Detecting Information Leakage through Speculative Cache Attacks via Symbolic Execution

1 code implementation2 Sep 2019 Guanhua Wang, Sudipta Chattopadhyay, Arnab Kumar Biswas, Tulika Mitra, Abhik Roychoudhury

Spectre attacks disclosed in early 2018 expose data leakage scenarios via cache side channels.

Cryptography and Security

Gathering Cyber Threat Intelligence from Twitter Using Novelty Classification

no code implementations3 Jul 2019 Ba Dung Le, Guanhua Wang, Mehwish Nasim, Ali Babar

Our analysis of the classification results suggests that Cyber threat relevant tweets on Twitter do not often include the CVE identifier of the related threats.

Binary Classification Classification +2

Task-GAN for Improved GAN based Image Restoration

no code implementations ICLR 2019 Jiahong Ouyang, Guanhua Wang, Enhao Gong, Kevin Chen, John Pauly and Greg Zaharchuk

Deep Learning (DL) algorithms based on Generative Adversarial Network (GAN) have demonstrated great potentials in computer vision tasks such as image restoration.

Diagnostic Generative Adversarial Network +4

oo7: Low-overhead Defense against Spectre Attacks via Program Analysis

2 code implementations16 Jul 2018 Guanhua Wang, Sudipta Chattopadhyay, Ivan Gotovchits, Tulika Mitra, Abhik Roychoudhury

In this paper, we propose oo7, a static analysis approach that can mitigate Spectre attacks by detecting potentially vulnerable code snippets in program binaries and protecting them against the attack by patching them.

Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.