no code implementations • 20 Dec 2016 • Shijie Zhang, Lizhen Qu, ShaoDi You, Zhenglu Yang, Jiawan Zhang
In this paper, we propose the first model to be able to generate visually grounded questions with diverse types for a single image.
1 code implementation • COLING 2018 • Hongru Liang, Haozheng Wang, Jun Wang, ShaoDi You, Zhe Sun, Jin-Mao Wei, Zhenglu Yang
Learning social media content is the basis of many real-world applications, including information retrieval and recommendation systems, among others.
no code implementations • COLING 2018 • Qian Li, Ziwei Li, Jin-Mao Wei, Yanhui Gu, Adam Jatowt, Zhenglu Yang
Enabling a mechanism to understand a temporal story and predict its ending is an interesting issue that has attracted considerable attention, as in case of the ROC Story Cloze Task (SCT).
no code implementations • 17 Sep 2019 • Chengyi Wang, Yu Wu, Shujie Liu, Zhenglu Yang, Ming Zhou
End-to-end speech translation, a hot topic in recent years, aims to translate a segment of audio into a specific language with an end-to-end model.
no code implementations • IJCNLP 2019 • Min Gui, Junfeng Tian, Rui Wang, Zhenglu Yang
Attention plays a key role in the improvement of sequence-to-sequence-based document summarization models.
no code implementations • ACL 2020 • Chengyi Wang, Yu Wu, Shujie Liu, Ming Zhou, Zhenglu Yang
End-to-end speech translation poses a heavy burden on the encoder, because it has to transcribe, understand, and learn cross-lingual semantics simultaneously.
1 code implementation • 17 Sep 2020 • Xiyan Fu, Jun Wang, Zhenglu Yang
Summarization of multimedia data becomes increasingly significant as it is the basis for many real-world applications, such as question answering, Web search, and so forth.
no code implementations • EMNLP 2021 • Yao Zhang, Hongru Liang, Adam Jatowt, Wenqiang Lei, Xin Wei, Ning Jiang, Zhenglu Yang
To the best of our knowledge, there lacks a general framework that approaches multi-hop reasoning in mixed long-short distance reasoning scenarios.
no code implementations • 22 Dec 2020 • Yao Zhang, Xu Zhang, Jun Wang, Hongru Liang, Wenqiang Lei, Zhe Sun, Adam Jatowt, Zhenglu Yang
The current methods for the link prediction taskhavetwonaturalproblems:1)the relation distributions in KGs are usually unbalanced, and 2) there are many unseen relations that occur in practical situations.
no code implementations • ICCV 2021 • Lin Zhao, Shao-Ping Lu, Tao Chen, Zhenglu Yang, Ariel Shamir
Underexposed image enhancement is of importance in many research domains.
no code implementations • 23 Mar 2021 • Hongru Liang, Haozheng Wang, Qian Li, Jun Wang, Guandong Xu, Jiawei Chen, Jin-Mao Wei, Zhenglu Yang
Learning and analyzing rap lyrics is a significant basis for many web applications, such as music recommendation, automatic music categorization, and music information retrieval, due to the abundant source of digital music in the World Wide Web.
no code implementations • NAACL 2021 • Xiyan Fu, Jun Wang, Zhenglu Yang
Multimodal summarization becomes increasingly significant as it is the basis for question answering, Web search, and many other downstream tasks.
no code implementations • ACL 2021 • Xiyan Fu, Yating Zhang, Tianyi Wang, Xiaozhong Liu, Changlong Sun, Zhenglu Yang
In the field of dialogue summarization, due to the lack of training data, it is often difficult for supervised summary generation methods to learn vital information from dialogue context with limited data.
no code implementations • Findings (ACL) 2022 • Yao Zhang, Peiyao Li, Hongru Liang, Adam Jatowt, Zhenglu Yang
In the question answering(QA) task, multi-hop reasoning framework has been extensively studied in recent years to perform more efficient and interpretable answer reasoning on the Knowledge Graph(KG).
no code implementations • 13 Sep 2021 • Zhengkun Zhang, Xiaojun Meng, Yasheng Wang, Xin Jiang, Qun Liu, Zhenglu Yang
Specially, we adopt knowledge distillation from a vision-language pretrained model to improve image selection, which avoids any requirement on the existence and quality of image captions.
1 code implementation • 16 Dec 2021 • Chengyi Wang, Yu Wu, Sanyuan Chen, Shujie Liu, Jinyu Li, Yao Qian, Zhenglu Yang
Recently, pioneer work finds that speech pre-trained models can solve full-stack speech processing tasks, because the model utilizes bottom layers to learn speaker-related information and top layers to encode content-related information.
no code implementations • 8 Mar 2022 • Zhengkun Zhang, Wenya Guo, Xiaojun Meng, Yasheng Wang, Yadao Wang, Xin Jiang, Qun Liu, Zhenglu Yang
In this paper, we design a novel unified parameter-efficient transfer learning framework that works effectively on both pure language and V&L tasks.
no code implementations • ACL 2022 • Huibin Zhang, Zhengkun Zhang, Yao Zhang, Jun Wang, Yufan Li, Ning Jiang, Xin Wei, Zhenglu Yang
Procedural Multimodal Documents (PMDs) organize textual instructions and corresponding images step by step.
no code implementations • 7 Apr 2022 • Wenqiang Lei, Yao Zhang, Feifan Song, Hongru Liang, Jiaxin Mao, Jiancheng Lv, Zhenglu Yang, Tat-Seng Chua
To this end, we contribute to advance the study of the proactive dialogue policy to a more natural and challenging setting, i. e., interacting dynamically with users.
no code implementations • Proceedings of the 2021 International Conference on Management of Data 2021 • Yinan Liu, Wei Shen, Yuanfei Wang, Jianyong Wang, Zhenglu Yang, Xiaojie Yuan
However, noun phrases (NPs) and relation phrases (RPs) in OKBs are not canonicalized and often appear in different paraphrased textual variants, which leads to redundant and ambiguous facts.
1 code implementation • 11 Oct 2023 • Lang Qin, Yao Zhang, Hongru Liang, Jun Wang, Zhenglu Yang
Accurate knowledge selection is critical in knowledge-grounded dialogue systems.
no code implementations • 14 Dec 2023 • Zhaochen Li, Fengheng Li, Wei Feng, Honghe Zhu, An Liu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Jingping Shao, Zhenglu Yang
At the planning stage, we propose a PlanNet to generate the layout of the product and other visual components considering both the appearance features of the product and semantic features of the text, which improves the diversity and rationality of the layouts.
no code implementations • ACL 2022 • Ling.Yu Zhu, Zhengkun Zhang, Jun Wang, Hongbin Wang, Haiying Wu, Zhenglu Yang
Empathetic dialogue assembles emotion understanding, feeling projection, and appropriate response generation.