Search Results for author: Ziyang Luo

Found 21 papers, 10 papers with code

Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model

no code implementations NAACL (ACL) 2022 Gongzheng li, Yadong Xi, Jingzhen Ding, Duan Wang, Ziyang Luo, Rongsheng Zhang, Bai Liu, Changjie Fan, Xiaoxi Mao, Zeng Zhao

To fill such a gap, we introduce a scalable inference solution: Easy and Efficient Transformer (EET), including a series of transformer inference optimization at the algorithm and implementation levels.

Inference Optimization

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

1 code implementation15 Apr 2024 Kaixin Li, Yuchen Tian, Qisheng Hu, Ziyang Luo, Jing Ma

Programming often involves converting detailed and complex specifications into code, a process during which developers typically utilize visual aids to more effectively convey concepts.

Code Generation Visual Reasoning

Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models

1 code implementation24 Jan 2024 Hongzhan Lin, Ziyang Luo, Wei Gao, Jing Ma, Bo wang, Ruichao Yang

Then we propose to fine-tune a small language model as the debate judge for harmfulness inference, to facilitate multimodal fusion between the harmfulness rationales and the intrinsic multimodal information within memes.

Language Modelling Text Generation

GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse

no code implementations3 Jan 2024 Hongzhan Lin, Ziyang Luo, Bo wang, Ruichao Yang, Jing Ma

The exponential growth of social media has profoundly transformed how information is created, disseminated, and absorbed, exceeding any precedent in the digital age.

VST++: Efficient and Stronger Visual Saliency Transformer

no code implementations18 Oct 2023 Nian Liu, Ziyang Luo, Ni Zhang, Junwei Han

Our previous work, the Visual Saliency Transformer (VST), addressed this constraint from a transformer-based sequence-to-sequence perspective, to unify RGB and RGB-D SOD.

object-detection Object Detection +1

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

2 code implementations14 Jun 2023 Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, QIngwei Lin, Daxin Jiang

Moreover, our model even outperforms the largest closed LLMs, Anthropic's Claude and Google's Bard, on HumanEval and HumanEval+.

Ranked #3 on Code Generation on CodeContests (Test Set pass@1 metric)

Code Generation

Augmented Large Language Models with Parametric Knowledge Guiding

1 code implementation8 May 2023 Ziyang Luo, Can Xu, Pu Zhao, Xiubo Geng, Chongyang Tao, Jing Ma, QIngwei Lin, Daxin Jiang

We demonstrate that our PKG framework can enhance the performance of "black-box" LLMs on a range of domain knowledge-intensive tasks that require factual (+7. 9%), tabular (+11. 9%), medical (+3. 0%), and multimodal (+8. 1%) knowledge.

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

1 code implementation6 Feb 2023 Ziyang Luo, Pu Zhao, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Jing Ma, Qingwen Lin, Daxin Jiang

The conventional dense retrieval paradigm relies on encoding images and texts into dense representations using dual-stream encoders, however, it faces challenges with low retrieval speed in large-scale retrieval scenarios.

Retrieval Text Retrieval

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval

1 code implementation ICCV 2023 Ziyang Luo, Pu Zhao, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Jing Ma, QIngwei Lin, Daxin Jiang

To address this issue, we propose a novel sparse retrieval paradigm for ITR that exploits sparse representations in the vocabulary space for images and texts.

Image Classification Retrieval +2

Zero-Shot Rumor Detection with Propagation Structure via Prompt Learning

1 code implementation2 Dec 2022 Hongzhan Lin, Pengyao Yi, Jing Ma, Haiyun Jiang, Ziyang Luo, Shuming Shi, Ruifang Liu

The spread of rumors along with breaking events seriously hinders the truth in the era of social media.

Domain Adaptation

A Coarse-to-fine Cascaded Evidence-Distillation Neural Network for Explainable Fake News Detection

1 code implementation COLING 2022 Zhiwei Yang, Jing Ma, Hechang Chen, Hongzhan Lin, Ziyang Luo, Yi Chang

Existing fake news detection methods aim to classify a piece of news as true or false and provide veracity explanations, achieving remarkable performances.

Fake News Detection

I-Tuning: Tuning Frozen Language Models with Image for Lightweight Image Captioning

no code implementations14 Feb 2022 Ziyang Luo, Zhipeng Hu, Yadong Xi, Rongsheng Zhang, Jing Ma

Different to these heavy-cost models, we introduce a lightweight image captioning framework (I-Tuning), which contains a small number of trainable parameters.

Image Captioning Language Modelling

A Frustratingly Simple Approach for End-to-End Image Captioning

no code implementations30 Jan 2022 Ziyang Luo, Yadong Xi, Rongsheng Zhang, Jing Ma

Before training the captioning models, an extra object detector is utilized to recognize the objects in the image at first.

Image Captioning Object +1

Analyzing the Implicit Position Encoding Ability of Transformer Decoder

no code implementations29 Sep 2021 Ziyang Luo, Yadong Xi, Jing Ma, Xiaoxi Mao, Changjie Fan

A common limitation of Transformer Encoder's self-attention mechanism is that it cannot automatically capture the information of word order, so one needs to feed the explicit position encodings into the target model.

Language Modelling Position

Positional Artefacts Propagate Through Masked Language Model Embeddings

no code implementations ACL 2021 Ziyang Luo, Artur Kulmizev, Xiaoxi Mao

In this work, we demonstrate that the contextualized word vectors derived from pretrained masked language model-based encoders share a common, perhaps undesirable pattern across layers.

Language Modelling Sentence +3

Cannot find the paper you are looking for? You can Submit a new open access paper.