CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery

no code implementations5 Nov 2020 Kiwan Maeng, Shivam Bharuka, Isabel Gao, Mark C. Jeffrey, Vikram Saraph, Bor-Yiing Su, Caroline Trippel, Jiyan Yang, Mike Rabbat, Brandon Lucia, Carole-Jean Wu

The paper is the first to the extent of our knowledge to perform a data-driven, in-depth analysis of applying partial recovery to recommendation models and identified a trade-off between accuracy and performance.

Enhancing Stratospheric Weather Analyses and Forecasts by Deploying Sensors from a Weather Balloon

no code implementations4 Dec 2019 Kiwan Maeng, Iskender Kushan, Brandon Lucia, Ashish Kapoor

We propose a framework to collect stratospheric data by releasing a contrail of tiny sensor devices as a weather balloon ascends.

