Search Results for author: Belinda Zeng

Found 15 papers, 2 papers with code

Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks

no code implementations • NAACL (ACL) 2022 • Weiyi Lu, Sunny Rajagopalan, Priyanka Nigam, Jaspreet Singh, Xiaodi Sun, Yi Xu, Belinda Zeng, Trishul Chilimbi

However, one issue that often arises in MTL is the convergence speed between tasks varies due to differences in task difficulty, so it can be a challenge to simultaneously achieve the best performance on all tasks with a single model checkpoint.

Knowledge Distillation Multi-Task Learning

Paper
Add Code

Semantic Aligned Multi-modal Transformer for Vision-LanguageUnderstanding: A Preliminary Study on Visual QA

no code implementations • NAACL (maiworkshop) 2021 • Han Ding, Li Erran Li, Zhiting Hu, Yi Xu, Dilek Hakkani-Tur, Zheng Du, Belinda Zeng

Recent vision-language understanding approaches adopt a multi-modal transformer pre-training and finetuning paradigm.

Question Answering Visual Question Answering

Paper
Add Code

VidLA: Video-Language Alignment at Scale

no code implementations • 21 Mar 2024 • Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi

To effectively address this limitation, we instead keep the network architecture simple and use a set of data tokens that operate at different temporal resolutions in a hierarchical manner, accounting for the temporally hierarchical nature of videos.

Language Modelling Visual Grounding

Paper
Add Code

Robust Multi-Task Learning with Excess Risks

no code implementations • 3 Feb 2024 • Yifei He, Shiji Zhou, Guojun Zhang, Hyokun Yun, Yi Xu, Belinda Zeng, Trishul Chilimbi, Han Zhao

To overcome this limitation, we propose Multi-Task Learning with Excess Risks (ExcessMTL), an excess risk-based task balancing method that updates the task weights by their distances to convergence instead.

Multi-Task Learning

Paper
Add Code

Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective

no code implementations • 26 Jan 2024 • Yue Xing, Xiaofeng Lin, Qifan Song, Yi Xu, Belinda Zeng, Guang Cheng

Pre-training is known to generate universal representations for downstream tasks in large-scale deep learning such as large language models.

Adversarial Robustness Contrastive Learning +1

Paper
Add Code

ForeSeer: Product Aspect Forecasting Using Temporal Graph Embedding

no code implementations • 7 Oct 2023 • Zixuan Liu, Gaurush Hiranandani, Kun Qian, Eddie W. Huang, Yi Xu, Belinda Zeng, Karthik Subbian, Sheng Wang

ForeSeer transfers reviews from similar products on a large product graph and exploits these reviews to predict aspects that might emerge in future reviews.

Graph Embedding Link Prediction

Paper
Add Code

Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

no code implementations • 5 Jun 2023 • Han Xie, Da Zheng, Jun Ma, Houyu Zhang, Vassilis N. Ioannidis, Xiang Song, Qing Ping, Sheng Wang, Carl Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi

Model pre-training on large text corpora has been demonstrated effective for various downstream applications in the NLP domain.

Graph Mining Language Modelling

Paper
Add Code

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

no code implementations • CVPR 2023 • Qian Jiang, Changyou Chen, Han Zhao, Liqun Chen, Qing Ping, Son Dinh Tran, Yi Xu, Belinda Zeng, Trishul Chilimbi

Hence we advocate that the key of better performance lies in meaningful latent modality structures instead of perfect modality alignment.

Few-Shot Image Classification Open-Ended Question Answering +6

Paper
Add Code

Efficient and effective training of language and graph neural network models

no code implementations • 22 Jun 2022 • Vassilis N. Ioannidis, Xiang Song, Da Zheng, Houyu Zhang, Jun Ma, Yi Xu, Belinda Zeng, Trishul Chilimbi, George Karypis

The effectiveness in our framework is achieved by applying stage-wise fine-tuning of the BERT model first with heterogenous graph information and then with a GNN model.

Edge Classification Language Modelling +1

Paper
Add Code

DynaMaR: Dynamic Prompt with Mask Token Representation

no code implementations • 7 Jun 2022 • Xiaodi Sun, Sunny Rajagopalan, Priyanka Nigam, Weiyi Lu, Yi Xu, Belinda Zeng, Trishul Chilimbi

In this paper, we propose an improvement to prompt-based fine-tuning that addresses these two issues.

Language Modelling Sentence

Paper
Add Code

Multi-modal Alignment using Representation Codebook

no code implementations • CVPR 2022 • Jiali Duan, Liqun Chen, Son Tran, Jinyu Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi

Aligning signals from different modalities is an important step in vision-language representation learning as it affects the performance of later stages such as cross-modality fusion.

Representation Learning Retrieval

Paper
Add Code

Vision-Language Pre-Training with Triple Contrastive Learning

1 code implementation • CVPR 2022 • Jinyu Yang, Jiali Duan, Son Tran, Yi Xu, Sampath Chanda, Liqun Chen, Belinda Zeng, Trishul Chilimbi, Junzhou Huang

Besides CMA, TCL introduces an intra-modal contrastive objective to provide complementary benefits in representation learning.

Ranked #3 on Zero-Shot Cross-Modal Retrieval on COCO 2014

Contrastive Learning Cross-Modal Retrieval +6

252

Paper
Code

Magic Pyramid: Accelerating Inference with Early Exiting and Token Pruning

no code implementations • 30 Oct 2021 • Xuanli He, Iman Keivanloo, Yi Xu, Xiang He, Belinda Zeng, Santosh Rajagopalan, Trishul Chilimbi

To achieve this, we propose a novel idea, Magic Pyramid (MP), to reduce both width-wise and depth-wise computation via token pruning and early exiting for Transformer-based models, particularly BERT.

text-classification Text Classification

Paper
Add Code

MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling

no code implementations • 24 Sep 2021 • Tarik Arici, Mehmet Saygin Seyfioglu, Tal Neiman, Yi Xu, Son Train, Trishul Chilimbi, Belinda Zeng, Ismail Tutar

Vision-and-Language Pre-training (VLP) improves model performance for downstream tasks that require image and text inputs.

Image Reconstruction Language Modelling +1

Paper
Add Code

Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive Learners With FlatNCE

1 code implementation • 2 Jul 2021 • Junya Chen, Zhe Gan, Xuan Li, Qing Guo, Liqun Chen, Shuyang Gao, Tagyoung Chung, Yi Xu, Belinda Zeng, Wenlian Lu, Fan Li, Lawrence Carin, Chenyang Tao

InfoNCE-based contrastive representation learners, such as SimCLR, have been tremendously successful in recent years.

Contrastive Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.