1 code implementation • 8 Apr 2024 • Tianshuo Cong, Delong Ran, Zesen Liu, Xinlei He, JinYuan Liu, Yichen Gong, Qi Li, Anyu Wang, XiaoYun Wang
Model merging is a promising lightweight model empowerment technique that does not rely on expensive computing devices (e. g., GPUs) or require the collection of specific training data.
1 code implementation • 9 Nov 2023 • Yichen Gong, Delong Ran, JinYuan Liu, Conglei Wang, Tianshuo Cong, Anyu Wang, Sisi Duan, XiaoYun Wang
Ensuring the safety of artificial intelligence-generated content (AIGC) is a longstanding topic in the artificial intelligence (AI) community, and the safety concerns associated with Large Language Models (LLMs) have been widely investigated.
1 code implementation • ACL 2022 • Xichen Pan, Peiyu Chen, Yichen Gong, Helong Zhou, Xinbing Wang, Zhouhan Lin
In particular, audio and visual front-ends are trained on large-scale unimodal datasets, then we integrate components of both front-ends into a larger multimodal framework which learns to recognize parallel audio-visual data into characters through a combination of CTC and seq2seq decoding.
Ranked #2 on Automatic Speech Recognition (ASR) on LRS2
Audio-Visual Speech Recognition Automatic Speech Recognition (ASR) +7
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Ning Shi, Ziheng Zeng, Haotian Zhang, Yichen Gong
In neural text editing, prevalent sequence-to-sequence based approaches directly map the unedited text either to the edited text or the editing operations, in which the performance is degraded by the limited source text encoding and long, varying decoding steps.
2 code implementations • ICLR 2018 • Yichen Gong, Heng Luo, Jian Zhang
Natural Language Inference (NLI) task requires an agent to determine the logical relationship between a natural language premise and a natural language hypothesis.
Ranked #12 on Paraphrase Identification on Quora Question Pairs (Accuracy metric)
no code implementations • WS 2018 • Yichen Gong, Samuel R. Bowman
To answer the question in machine comprehension (MC) task, the models need to establish the interaction between the question and the context.
Ranked #36 on Question Answering on SQuAD1.1 dev