Search Results for author: Wenjie Zi

Found 4 papers, 2 papers with code

Training a Vision Transformer from scratch in less than 24 hours with 1 GPU

1 code implementation9 Nov 2022 Saghar Irandoust, Thibaut Durand, Yunduz Rakhmangulova, Wenjie Zi, Hossein Hajimirsadeghi

We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24 hours) resources.

Optimizing Deeper Transformers on Small Datasets

1 code implementation ACL 2021 Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung, Simon J. D. Prince, Yanshuai Cao

This work shows that this does not always need to be the case: with proper initialization and optimization, the benefits of very deep transformers can carry over to challenging tasks with small datasets, including Text-to-SQL semantic parsing and logical reading comprehension.

Reading Comprehension Semantic Parsing +2

Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus

no code implementations LREC 2016 SoHyun Park, Afsaneh Fazly, Annie Lee, Br Seibel, on, Wenjie Zi, Paul Cook

We then propose a supervised approach to classify out-of-vocabulary terms according to these categories, drawing on features based on word embeddings, and linguistic knowledge of common properties of out-of-vocabulary terms.

General Classification Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.