Search Results for author: Ziyang Luo

Found 21 papers, 10 papers with code

Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model

no code implementations • NAACL (ACL) 2022 • Gongzheng li, Yadong Xi, Jingzhen Ding, Duan Wang, Ziyang Luo, Rongsheng Zhang, Bai Liu, Changjie Fan, Xiaoxi Mao, Zeng Zhao

To fill such a gap, we introduce a scalable inference solution: Easy and Efficient Transformer (EET), including a series of transformer inference optimization at the algorithm and implementation levels.

Inference Optimization

Paper
Add Code

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

1 code implementation • 15 Apr 2024 • Kaixin Li, Yuchen Tian, Qisheng Hu, Ziyang Luo, Jing Ma

Programming often involves converting detailed and complex specifications into code, a process during which developers typically utilize visual aids to more effectively convey concepts.

Code Generation Visual Reasoning

Paper
Code

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

no code implementations • 30 Mar 2024 • Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak, Aleksandr Drozd, Jordan Clive, Kshitij Gupta, Liangyu Chen, Qi Sun, Ken Tsui, Noah Persaud, Nour Fahmy, Tianlong Chen, Mohit Bansal, Nicolo Monti, Tai Dang, Ziyang Luo, Tien-Tung Bui, Roberto Navigli, Virendra Mehta, Matthew Blumberg, Victor May, Huu Nguyen, Sampo Pyysalo

Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility.

Continual Pretraining Language Modelling

Paper
Add Code

Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models

1 code implementation • 24 Jan 2024 • Hongzhan Lin, Ziyang Luo, Wei Gao, Jing Ma, Bo wang, Ruichao Yang

Then we propose to fine-tune a small language model as the debate judge for harmfulness inference, to facilitate multimodal fusion between the harmfulness rationales and the intrinsic multimodal information within memes.

Language Modelling Text Generation

Paper
Code

GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse

no code implementations • 3 Jan 2024 • Hongzhan Lin, Ziyang Luo, Bo wang, Ruichao Yang, Jing Ma

The exponential growth of social media has profoundly transformed how information is created, disseminated, and absorbed, exceeding any precedent in the digital age.

Paper
Add Code

Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Models

1 code implementation • 9 Dec 2023 • Hongzhan Lin, Ziyang Luo, Jing Ma, Long Chen

The age of social media is rife with memes.

Multimodal Reasoning

Paper
Code

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

1 code implementation • 25 Nov 2023 • Ziyang Luo, Nian Liu, Wangbo Zhao, Xuguang Yang, Dingwen Zhang, Deng-Ping Fan, Fahad Khan, Junwei Han

Salient object detection (SOD) and camouflaged object detection (COD) are related yet distinct binary mapping tasks.

Model Optimization object-detection +3

Paper
Code

VST++: Efficient and Stronger Visual Saliency Transformer

no code implementations • 18 Oct 2023 • Nian Liu, Ziyang Luo, Ni Zhang, Junwei Han

Our previous work, the Visual Saliency Transformer (VST), addressed this constraint from a transformer-based sequence-to-sequence perspective, to unify RGB and RGB-D SOD.

object-detection Object Detection +1

Paper
Add Code

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

2 code implementations • 14 Jun 2023 • Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, QIngwei Lin, Daxin Jiang

Moreover, our model even outperforms the largest closed LLMs, Anthropic's Claude and Google's Bard, on HumanEval and HumanEval+.

Ranked #3 on Code Generation on CodeContests (Test Set pass@1 metric)

Code Generation

8,871

Paper
Code

Augmented Large Language Models with Parametric Knowledge Guiding

1 code implementation • 8 May 2023 • Ziyang Luo, Can Xu, Pu Zhao, Xiubo Geng, Chongyang Tao, Jing Ma, QIngwei Lin, Daxin Jiang

We demonstrate that our PKG framework can enhance the performance of "black-box" LLMs on a range of domain knowledge-intensive tasks that require factual (+7. 9%), tabular (+11. 9%), medical (+3. 0%), and multimodal (+8. 1%) knowledge.

Paper
Code

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

1 code implementation • 6 Feb 2023 • Ziyang Luo, Pu Zhao, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Jing Ma, Qingwen Lin, Daxin Jiang

The conventional dense retrieval paradigm relies on encoding images and texts into dense representations using dual-stream encoders, however, it faces challenges with low retrieval speed in large-scale retrieval scenarios.

Retrieval Text Retrieval

Paper
Code

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval

1 code implementation • ICCV 2023 • Ziyang Luo, Pu Zhao, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Jing Ma, QIngwei Lin, Daxin Jiang

To address this issue, we propose a novel sparse retrieval paradigm for ITR that exploits sparse representations in the vocabulary space for images and texts.

Image Classification Retrieval +2

Paper
Code

Zero-Shot Rumor Detection with Propagation Structure via Prompt Learning

1 code implementation • 2 Dec 2022 • Hongzhan Lin, Pengyao Yi, Jing Ma, Haiyun Jiang, Ziyang Luo, Shuming Shi, Ruifang Liu

The spread of rumors along with breaking events seriously hinders the truth in the era of social media.

Domain Adaptation

Paper
Code

A Coarse-to-fine Cascaded Evidence-Distillation Neural Network for Explainable Fake News Detection

1 code implementation • COLING 2022 • Zhiwei Yang, Jing Ma, Hechang Chen, Hongzhan Lin, Ziyang Luo, Yi Chang

Existing fake news detection methods aim to classify a piece of news as true or false and provide veracity explanations, achieving remarkable performances.

Fake News Detection

Paper
Code

DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks

no code implementations • Findings (NAACL) 2022 • Ziyang Luo, Yadong Xi, Jing Ma, Zhiwei Yang, Xiaoxi Mao, Changjie Fan, Rongsheng Zhang

In contrast, Transformer Decoder with the causal attention masks is naturally sensitive to the word order.

Language Modelling Position

Paper
Add Code

I-Tuning: Tuning Frozen Language Models with Image for Lightweight Image Captioning

no code implementations • 14 Feb 2022 • Ziyang Luo, Zhipeng Hu, Yadong Xi, Rongsheng Zhang, Jing Ma

Different to these heavy-cost models, we introduce a lightweight image captioning framework (I-Tuning), which contains a small number of trainable parameters.

Image Captioning Language Modelling

Paper
Add Code

A Frustratingly Simple Approach for End-to-End Image Captioning

no code implementations • 30 Jan 2022 • Ziyang Luo, Yadong Xi, Rongsheng Zhang, Jing Ma

Before training the captioning models, an extra object detector is utilized to recognize the objects in the image at first.

Image Captioning Object +1

Paper
Add Code

Analyzing the Implicit Position Encoding Ability of Transformer Decoder

no code implementations • 29 Sep 2021 • Ziyang Luo, Yadong Xi, Jing Ma, Xiaoxi Mao, Changjie Fan

A common limitation of Transformer Encoder's self-attention mechanism is that it cannot automatically capture the information of word order, so one needs to feed the explicit position encodings into the target model.

Language Modelling Position

Paper
Add Code

Gender Bias Hidden Behind Chinese Word Embeddings: The Case of Chinese Adjectives

no code implementations • ACL (GeBNLP) 2021 • Meichun Jiao, Ziyang Luo

Gender bias in word embeddings gradually becomes a vivid research field in recent years.

Word Embeddings

Paper
Add Code

Have Attention Heads in BERT Learned Constituency Grammar?

no code implementations • EACL 2021 • Ziyang Luo

Our results suggest that SMS tasks decrease the average CGI ability of upper layers, while NLI tasks increase it.

Natural Language Inference Natural Language Understanding +2

Paper
Add Code

Positional Artefacts Propagate Through Masked Language Model Embeddings

no code implementations • ACL 2021 • Ziyang Luo, Artur Kulmizev, Xiaoxi Mao

In this work, we demonstrate that the contextualized word vectors derived from pretrained masked language model-based encoders share a common, perhaps undesirable pattern across layers.

Language Modelling Sentence +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.