Search Results for author: Ping Tak Peter Tang

Found 10 papers, 3 papers with code

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

no code implementations • 26 May 2021 • Zhaoxia, Deng, Jongsoo Park, Ping Tak Peter Tang, Haixin Liu, Jie, Yang, Hector Yuen, Jianyu Huang, Daya Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Satish Nadathur, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy

We share in this paper our search strategies to adapt reference recommendation models to low-precision hardware, our optimization of low-precision compute kernels, and the design and development of tool chain so as to maintain our models' accuracy throughout their lifespan during which topic trends and users' interests inevitably evolve.

Recommendation Systems

Paper
Add Code

Mixed-Precision Embedding Using a Cache

no code implementations • 21 Oct 2020 • Jie Amy Yang, Jianyu Huang, Jongsoo Park, Ping Tak Peter Tang, Andrew Tulloch

We propose a novel change to embedding tables using a cache memory architecture, where the majority of rows in an embedding is trained in low precision, and the most frequently or recently accessed rows cached and trained in full precision.

Quantization Recommendation Systems

Paper
Add Code

Training Recommender Systems at Scale: Communication-Efficient Model and Data Parallelism

no code implementations • 18 Oct 2020 • Vipul Gupta, Dhruv Choudhary, Ping Tak Peter Tang, Xiaohan Wei, Xing Wang, Yuzhen Huang, Arun Kejariwal, Kannan Ramchandran, Michael W. Mahoney

This is done by identifying and updating only the most relevant neurons of the neural network for each training sample in the data.

Recommendation Systems

Paper
Add Code

Sparse Dictionary Learning by Dynamical Neural Networks

no code implementations • ICLR 2019 • Tsung-Han Lin, Ping Tak Peter Tang

A dynamical neural network consists of a set of interconnected neurons that interact over time continuously.

Contrastive Learning Dictionary Learning

Paper
Add Code

Dictionary Learning by Dynamical Neural Networks

no code implementations • 23 May 2018 • Tsung-Han Lin, Ping Tak Peter Tang

A dynamical neural network consists of a set of interconnected neurons that interact over time continuously.

Contrastive Learning Dictionary Learning

Paper
Add Code

A Progressive Batching L-BFGS Method for Machine Learning

no code implementations • ICML 2018 • Raghu Bollapragada, Dheevatsa Mudigere, Jorge Nocedal, Hao-Jun Michael Shi, Ping Tak Peter Tang

The standard L-BFGS method relies on gradient approximations that are not dominated by noise, so that search directions are descent directions, the line search is reliable, and quasi-Newton updating yields useful quadratic models of the objective function.

BIG-bench Machine Learning

Paper
Add Code

Sparse Coding by Spiking Neural Networks: Convergence Theory and Computational Results

no code implementations • 15 May 2017 • Ping Tak Peter Tang, Tsung-Han Lin, Mike Davies

With a moderate but well-defined assumption, we prove that the SNN indeed solves sparse coding.

Paper
Add Code

Enabling Sparse Winograd Convolution by Native Pruning

1 code implementation • 28 Feb 2017 • Sheng Li, Jongsoo Park, Ping Tak Peter Tang

Sparse methods and the use of Winograd convolutions are two orthogonal approaches, each of which significantly accelerates convolution computations in modern CNNs.

238

Paper
Code

On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima

9 code implementations • 15 Sep 2016 • Nitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelyanskiy, Ping Tak Peter Tang

The stochastic gradient descent (SGD) method and its variants are algorithms of choice for many Deep Learning tasks.

284

Paper
Code

Faster CNNs with Direct Sparse Convolutions and Guided Pruning

1 code implementation • 4 Aug 2016 • Jongsoo Park, Sheng Li, Wei Wen, Ping Tak Peter Tang, Hai Li, Yiran Chen, Pradeep Dubey

Pruning CNNs in a way that increase inference speed often imposes specific sparsity structures, thus limiting the achievable sparsity levels.

238

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.