Search Results for author: Haoyu Song

Found 13 papers, 10 papers with code

RGB-Event based Pedestrian Attribute Recognition: A Benchmark Dataset and An Asymmetric RWKV Fusion Framework

1 code implementation14 Apr 2025 Xiao Wang, Haiyang Wang, Shiao Wang, Qiang Chen, Jiandong Jin, Haoyu Song, Bo Jiang, Chenglong Li

In this paper, we revisit these issues and propose a novel multi-modal RGB-Event attribute recognition task by drawing inspiration from the advantages of event cameras in low-light, high-speed, and low-power consumption.

Attribute Pedestrian Attribute Recognition

VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition

1 code implementation28 Dec 2024 Lan Chen, Haoxiang Yang, Pengpeng Shao, Haoyu Song, Xiao Wang, Zhicheng Zhao, YaoWei Wang, Yonghong Tian

Inspired by the successful application of large models, the introduction of such large models can also be considered to further enhance the performance of multi-modal tasks.

parameter-efficient fine-tuning

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation

no code implementations26 Oct 2024 Haoyu Song, Wei-Nan Zhang, Kaiyan Zhang, Ting Liu

To this end, we propose a novel stack-propagation framework for learning a generation and understanding pipeline. Specifically, the framework stacks a Transformer encoder and two Transformer decoders, where the first decoder models response generation and the second serves as a regularizer and jointly models response generation and consistency understanding.

Dialogue Generation Response Generation

SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks

1 code implementation10 Oct 2024 Haiyang Wang, Qian Zhu, Mowen She, Yabo Li, Haoyu Song, Minghe Xu, Xiao Wang

To address this issue, in this paper, we propose a Spiking Neural Network (SNN) based framework for energy-efficient attribute recognition.

Attribute Knowledge Distillation +1

MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection

1 code implementation16 Aug 2024 Pengfei Cai, Yan Song, Kang Li, Haoyu Song, Ian McLoughlin

Sound event detection (SED) methods that leverage a large pre-trained Transformer encoder network have shown promising performance in recent DCASE challenges.

 Ranked #1 on Sound Event Detection on DESED (PSDS1 metric, using extra training data)

Event Detection Sound Event Detection

Language Models are General-Purpose Interfaces

1 code implementation13 Jun 2022 Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei

Experimental results across various language-only and vision-language benchmarks show that our model outperforms or is competitive with specialized models on finetuning, zero-shot generalization, and few-shot learning.

Causal Language Modeling Few-Shot Learning +7

Visually-Augmented Language Modeling

1 code implementation20 May 2022 Weizhi Wang, Li Dong, Hao Cheng, Haoyu Song, Xiaodong Liu, Xifeng Yan, Jianfeng Gao, Furu Wei

With the visually-augmented context, VaLM uses a visual knowledge fusion layer to enable multimodal grounded language modeling by attending to both text context and visual knowledge in images.

Image Retrieval Language Modeling +2

CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment

no code implementations ACL 2022 Haoyu Song, Li Dong, Wei-Nan Zhang, Ting Liu, Furu Wei

We first evaluate CLIP's zero-shot performance on a typical visual question answering task and demonstrate a zero-shot cross-modality transfer capability of CLIP on the visual entailment task.

parameter-efficient fine-tuning Question Answering +2

Profile Consistency Identification for Open-domain Dialogue Agents

1 code implementation EMNLP 2020 Haoyu Song, Yan Wang, Wei-Nan Zhang, Zhengyu Zhao, Ting Liu, Xiaojiang Liu

Maintaining a consistent attribute profile is crucial for dialogue agents to naturally converse with humans.

Attribute

Exploiting Persona Information for Diverse Generation of Conversational Responses

1 code implementation29 May 2019 Haoyu Song, Wei-Nan Zhang, Yiming Cui, Dong Wang, Ting Liu

Giving conversational context with persona information to a chatbot, how to exploit the information to generate diverse and sustainable conversations is still a non-trivial task.

Chatbot

Cannot find the paper you are looking for? You can Submit a new open access paper.