Search Results for author: Rui Hou

Found 28 papers, 14 papers with code

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

no code implementations • 19 Dec 2023 • Shraman Pramanick, Guangxing Han, Rui Hou, Sayan Nag, Ser-Nam Lim, Nicolas Ballas, Qifan Wang, Rama Chellappa, Amjad Almahairi

In this work, we introduce VistaLLM, a powerful visual system that addresses coarse- and fine-grained VL tasks over single and multiple input images using a unified framework.

Attribute Language Modelling +1

Paper
Add Code

Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

1 code implementation • 19 Dec 2023 • Da Luo, Yanglei Gan, Rui Hou, Run Lin, Qiao Liu, Yuxiang Cai, Wannian Gao

Specifically, our framework involves a symmetrical contrastive objective that encompasses both sentence-anchored and label-anchored contrastive losses.

Contrastive Learning Relation +2

Paper
Code

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training

1 code implementation • 7 Dec 2023 • Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang, Pascale Fung, Qifan Wang, Fuli Feng, Lifu Huang, Madian Khabsa

Under a unified evaluation of fine-tuned LMs by incorporating four representative perspectives of model robustness, we demonstrate the effectiveness of RoAST compared to state-of-the-art fine-tuning methods on six different types of LMs, which indicates its usefulness in practice.

Adversarial Robustness

Paper
Code

MART: Improving LLM Safety with Multi-round Automatic Red-Teaming

no code implementations • 13 Nov 2023 • Suyu Ge, Chunting Zhou, Rui Hou, Madian Khabsa, Yi-Chia Wang, Qifan Wang, Jiawei Han, Yuning Mao

Specifically, an adversarial LLM and a target LLM interplay with each other in an iterative manner, where the adversarial LLM aims to generate challenging prompts that elicit unsafe responses from the target LLM, while the target LLM is fine-tuned with safety aligned data on these adversarial prompts.

Instruction Following Response Generation

Paper
Add Code

Co-training and Co-distillation for Quality Improvement and Compression of Language Models

no code implementations • 6 Nov 2023 • Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Hongbo Zhang, Sung Ju Hwang, Alexander Min

2) The enhanced performance of the larger model further boosts the performance of the smaller model.

Data Augmentation Knowledge Distillation

Paper
Add Code

Effective Long-Context Scaling of Foundation Models

1 code implementation • 27 Sep 2023 • Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

We also examine the impact of various design choices in the pretraining process, including the data mix and the training curriculum of sequence lengths -- our ablation experiments suggest that having abundant long texts in the pretrain dataset is not the key to achieving strong performance, and we empirically verify that long context continual pretraining is more efficient and similarly effective compared to pretraining from scratch with long sequences.

Continual Pretraining Language Modelling

274

Paper
Code

Aspect-oriented Opinion Alignment Network for Aspect-Based Sentiment Classification

1 code implementation • 22 Aug 2023 • Xueyi Liu, Rui Hou, Yanglei Gan, Da Luo, Changlin Li, Xiaojun Shi, Qiao Liu

In addition, we design a multi-perspective attention mechanism that align relevant opinion information with respect to the given aspect.

Management Sentiment Analysis +1

Paper
Code

Llama 2: Open Foundation and Fine-Tuned Chat Models

14 code implementations • 18 Jul 2023 • Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters.

Ranked #2 on Question Answering on PubChemQA

Arithmetic Reasoning +5

52,402

Paper
Code

A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models

1 code implementation • 26 May 2023 • Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Sung Ju Hwang, Alexander Min

Distillation from Weak Teacher (DWT) is a method of transferring knowledge from a smaller, weaker teacher model to a larger student model to improve its performance.

Knowledge Distillation

124,457

Paper
Code

Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes

no code implementations • 22 May 2023 • Kuan-Hao Huang, Liang Tan, Rui Hou, Sinong Wang, Amjad Almahairi, Ruty Rinott

Fine-tuning a large pre-trained language model for each downstream task causes computational burdens in the inference time due to several times of forward passes.

Language Modelling

Paper
Add Code

Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization

1 code implementation • 6 May 2023 • Anastasia Razdaibiedina, Yuning Mao, Rui Hou, Madian Khabsa, Mike Lewis, Jimmy Ba, Amjad Almahairi

In this work, we introduce Residual Prompt Tuning - a simple and efficient method that significantly improves the performance and stability of prompt tuning.

Paper
Code

MMViT: Multiscale Multiview Vision Transformers

no code implementations • 28 Apr 2023 • Yuchen Liu, Natasha Ong, Kaiyan Peng, Bo Xiong, Qifan Wang, Rui Hou, Madian Khabsa, Kaiyue Yang, David Liu, Donald S. Williamson, Hanchao Yu

Our model encodes different views of the input signal and builds several channel-resolution feature stages to process the multiple views of the input at different resolutions in parallel.

Image Classification

Paper
Add Code

SVT: Supertoken Video Transformer for Efficient Video Understanding

no code implementations • 1 Apr 2023 • Chenbin Pan, Rui Hou, Hanchao Yu, Qifan Wang, Senem Velipasalar, Madian Khabsa

Whether by processing videos with fixed resolution from start to end or incorporating pooling and down-scaling strategies, existing video transformers process the whole video content throughout the network without specially handling the large portions of redundant information.

Video Understanding

Paper
Add Code

Progressive Prompts: Continual Learning for Language Models

2 code implementations • 29 Jan 2023 • Anastasia Razdaibiedina, Yuning Mao, Rui Hou, Madian Khabsa, Mike Lewis, Amjad Almahairi

We introduce Progressive Prompts - a simple and efficient approach for continual learning in language models.

Continual Learning

Paper
Code

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models

2 code implementations • 25 Jan 2023 • Davis Liang, Hila Gonen, Yuning Mao, Rui Hou, Naman Goyal, Marjan Ghazvininejad, Luke Zettlemoyer, Madian Khabsa

Large multilingual language models typically rely on a single vocabulary shared across 100+ languages.

named-entity-recognition Named Entity Recognition +4

Paper
Code

Uniform Masking Prevails in Vision-Language Pretraining

no code implementations • 10 Dec 2022 • Siddharth Verma, Yuchen Lu, Rui Hou, Hanchao Yu, Nicolas Ballas, Madian Khabsa, Amjad Almahairi

Masked Language Modeling (MLM) has proven to be an essential component of Vision-Language (VL) pretraining.

Image-text matching Language Modelling +2

Paper
Add Code

Query Rewriting for Effective Misinformation Discovery

no code implementations • 14 Oct 2022 • Ashkan Kazemi, Artem Abzaliev, Naihao Deng, Rui Hou, Scott A. Hale, Verónica Pérez-Rosas, Rada Mihalcea

We propose a novel system to help fact-checkers formulate search queries for known misinformation claims and effectively search across multiple social media platforms.

Misinformation reinforcement-learning +2

Paper
Add Code

IDPG: An Instance-Dependent Prompt Generation Method

no code implementations • NAACL 2022 • Zhuofeng Wu, Sinong Wang, Jiatao Gu, Rui Hou, Yuxiao Dong, V. G. Vinod Vydiswaran, Hao Ma

Prompt tuning is a new, efficient NLP transfer learning paradigm that adds a task-specific prompt in each input instance during the model training stage.

Language Modelling Natural Language Understanding +2

Paper
Add Code

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning

1 code implementation • ACL 2022 • Yuning Mao, Lambert Mathias, Rui Hou, Amjad Almahairi, Hao Ma, Jiawei Han, Wen-tau Yih, Madian Khabsa

Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited.

Language Modelling Model Selection

Paper
Code

Weakly Supervised 3D Classification of Chest CT using Aggregated Multi-Resolution Deep Segmentation Features

1 code implementation • 31 Oct 2020 • Anindo Saha, Fakrul I. Tushar, Khrystyna Faryna, Vincent M. D'Anniballe, Rui Hou, Maciej A. Mazurowski, Geoffrey D. Rubin, Joseph Y. Lo

Second, segmentation and classification models are connected with two different feature aggregation strategies to enhance the classification performance.

3D Classification Binary Classification +4

Paper
Code

Classification of Multiple Diseases on Body CT Scans using Weakly Supervised Deep Learning

1 code implementation • 3 Aug 2020 • Fakrul Islam Tushar, Vincent M. D'Anniballe, Rui Hou, Maciej A. Mazurowski, Wanyi Fu, Ehsan Samei, Geoffrey D. Rubin, Joseph Y. Lo

Purpose: To design multi-disease classifiers for body CT scans for three different organ systems using automatically extracted labels from radiology text reports. Materials & Methods: This retrospective study included a total of 12, 092 patients (mean age 57 +- 18; 6, 172 women) for model development and testing (from 2012-2017).

Computed Tomography (CT) General Classification

Paper
Code

Semantically-Guided Representation Learning for Self-Supervised Monocular Depth

1 code implementation • ICLR 2020 • Vitor Guizilini, Rui Hou, Jie Li, Rares Ambrus, Adrien Gaidon

Instead of using semantic labels and proxy losses in a multi-task approach, we propose a new architecture leveraging fixed pretrained semantic segmentation networks to guide self-supervised representation learning via pixel-adaptive convolutions.

Depth Prediction Monocular Depth Estimation +3

1,197

Paper
Code

Measuring the Utilization of Public Open Spaces by Deep Learning: a Benchmark Study at the Detroit Riverfront

no code implementations • 4 Feb 2020 • Peng Sun, Rui Hou, Jerome Lynch

It is appealing to make use of surveillance cameras and to extract user-related information through computer vision.

Activity Recognition Pedestrian Detection +1

Paper
Add Code

Real-Time Panoptic Segmentation from Dense Detections

no code implementations • CVPR 2020 • Rui Hou, Jie Li, Arjun Bhargava, Allan Raventos, Vitor Guizilini, Chao Fang, Jerome Lynch, Adrien Gaidon

Panoptic segmentation is a complex full scene parsing task requiring simultaneous instance and semantic segmentation at high resolution.

Clustering object-detection +4

Paper
Add Code

Towards Automatic Detection of Misinformation in Online Medical Videos

no code implementations • 4 Sep 2019 • Rui Hou, Verónica Pérez-Rosas, Stacy Loeb, Rada Mihalcea

Recent years have witnessed a significant increase in the online sharing of medical information, with videos representing a large fraction of such online sources.

Misinformation

Paper
Add Code

An Efficient 3D CNN for Action/Object Segmentation in Video

no code implementations • 21 Jul 2019 • Rui Hou, Chen Chen, Rahul Sukthankar, Mubarak Shah

Convolutional Neural Network (CNN) based image segmentation has made great progress in recent years.

Ranked #64 on Semi-Supervised Video Object Segmentation on DAVIS 2016

Action Segmentation Image Segmentation +6

Paper
Add Code

An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos

no code implementations • 30 Nov 2017 • Rui Hou, Chen Chen, Mubarak Shah

A video is first divided into equal length clips and next for each clip a set of tube proposals are generated based on 3D CNN features.

Action Detection Action Segmentation +4

Paper
Add Code

Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos

1 code implementation • ICCV 2017 • Rui Hou, Chen Chen, Mubarak Shah

A video is first divided into equal length clips and for each clip a set of tube proposals are generated next based on 3D Convolutional Network (ConvNet) features.

Ranked #1 on Action Detection on UCF Sports

Action Detection Image Classification +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.