no code implementations • 16 Jan 2024 • Haoxin Liu, Wenli Zhang, Jiaheng Xie, Buomsoo Kim, Zhu Zhang, Yidong Chai
On the depression detection task, our method (F1 = 0. 975~0. 978) significantly outperforms traditional supervised learning paradigms, including feature engineering (F1 = 0. 760) and architecture engineering (F1 = 0. 756).
no code implementations • 24 May 2023 • Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
Fact-checking is an essential task in NLP that is commonly utilized for validating the factual accuracy of claims.
no code implementations • 6 Mar 2023 • Wenli Zhang, Jiaheng Xie, Zhu Zhang, Xiang Liu
Depression is a common disease worldwide.
no code implementations • 20 Dec 2022 • Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
To better mimic human-level conversations that usually fuse various dialog modes, it is essential to build a system that can effectively handle both TOD and ODD and access different knowledge sources.
1 code implementation • 24 Jun 2022 • Miaoran Li, Baolin Peng, Jianfeng Gao, Zhu Zhang
Existing studies in conversational AI mostly treat task-oriented dialog (TOD) and question answering (QA) as separate tasks.
no code implementations • 22 Apr 2022 • Xin Li, Hsinchun Chen, Jiexun Li, Zhu Zhang
Predicting gene functions is a challenge for biologists in the post genomic era.
no code implementations • 21 Oct 2021 • Baolin Peng, Chunyuan Li, Zhu Zhang, Jinchao Li, Chenguang Zhu, Jianfeng Gao
We propose SYNERGY, a hybrid learning framework where a task bot is developed in two steps: (i) Symbolic knowledge to neural networks: Large amounts of simulated dialog sessions are generated based on task-specific symbolic knowledge which is represented as a task schema consisting of dialog flows and task-oriented databases.
no code implementations • CVPR 2021 • Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin
Temporal video grounding aims to localize the target segment which is semantically aligned with the given sentence in an untrimmed video.
no code implementations • 2 Jun 2021 • Zhu Zhang, Chang Zhou, Jianxin Ma, Zhijie Lin, Jingren Zhou, Hongxia Yang, Zhou Zhao
Further, we design a history sampler to select informative fragments for rehearsal training, making the memory focus on the crucial information.
1 code implementation • 31 May 2021 • Shuai Bai, Zhedong Zheng, Xiaohan Wang, Junyang Lin, Zhu Zhang, Chang Zhou, Yi Yang, Hongxia Yang
In this paper, we apply one new modality, i. e., the language description, to search the vehicle of interest and explore the potential of this task in the real-world scenario.
no code implementations • NeurIPS 2021 • Zhu Zhang, Jianxin Ma, Chang Zhou, Rui Men, Zhikang Li, Ming Ding, Jie Tang, Jingren Zhou, Hongxia Yang
Conditional image synthesis aims to create an image according to some multi-modal guidance in the forms of textual descriptions, reference images, and image blocks to preserve, as well as their combinations.
no code implementations • NeurIPS 2021 • Zhu Zhang, Jianxin Ma, Chang Zhou, Rui Men, Zhikang Li, Ming Ding, Jie Tang, Jingren Zhou, Hongxia Yang
Conditional image synthesis aims to create an image according to some multi-modal guidance in the forms of textual descriptions, reference images, and image blocks to preserve, as well as their combinations.
no code implementations • 1 Jan 2021 • Zhu Zhang, Chang Zhou, Zhou Zhao, Zhijie Lin, Jingren Zhou, Hongxia Yang
Existing reasoning tasks often follow the setting of "reasoning while experiencing", which has an important assumption that the raw contents can be always accessed while reasoning.
no code implementations • 1 Jan 2021 • Zhijie Lin, Zhou Zhao, Zhu Zhang, Huai Baoxing, Jing Yuan
Model Agnostic Meta-Learning~(MAML)~(\cite{finn2017model}) is one of the most well-known gradient-based meta learning algorithms, that learns the meta-initialization through the inner and outer optimization loop.
no code implementations • 1 Jan 2021 • Shen Kai, Lingfei Wu, Siliang Tang, Fangli Xu, Zhu Zhang, Yu Qiang, Yueting Zhuang
The task of visual question generation~(VQG) aims to generate human-like questions from an image and potentially other side information (e. g. answer type or the answer itself).
no code implementations • ACL 2021 • Baolin Peng, Chunyuan Li, Zhu Zhang, Chenguang Zhu, Jinchao Li, Jianfeng Gao
For task-oriented dialog systems to be maximally useful, it must be able to process conversations in a way that is (1) generalizable with a small number of training examples for new task domains, and (2) robust to user input in various styles, modalities or domains.
no code implementations • NeurIPS 2020 • Zhu Zhang, Zhou Zhao, Zhijie Lin, Jieming Zhu, Xiuqiang He
Weakly-supervised vision-language grounding aims to localize a target moment in a video or a specific region in an image according to the given sentence query, where only video-level or image-level sentence annotations are provided during training.
1 code implementation • 19 Aug 2020 • Zhu Zhang, Zhijie Lin, Zhou Zhao, Jieming Zhu, Xiuqiang He
Thus, these methods fail to distinguish the target moment from plausible negative moments.
no code implementations • 16 Aug 2020 • Zhu Zhang, Zhou Zhao, Zhijie Lin, Baoxing Huai, Nicholas Jing Yuan
Spatio-temporal video grounding aims to retrieve the spatio-temporal tube of a queried object according to the given sentence.
1 code implementation • CVPR 2020 • Zhu Zhang, Zhou Zhao, Yang Zhao, Qi. Wang, Huasheng Liu, Lianli Gao
In this paper, we consider a novel task, Spatio-Temporal Video Grounding for Multi-Form Sentences (STVG).
no code implementations • 19 Nov 2019 • Zhijie Lin, Zhou Zhao, Zhu Zhang, Qi. Wang, Huasheng Liu
Video moment retrieval is to search the moment that is most relevant to the given natural language query.
no code implementations • 28 Jun 2019 • Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Xiaofei He
Concretely, we first develop a hierarchical convolutional self-attention encoder to efficiently model long-form video contents, which builds the hierarchical structure for video sequences and captures question-aware long-range dependencies from video context.
no code implementations • 28 Jun 2019 • Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Deng Cai
Thus, we consider a new task to localize unseen activities in videos via image queries, named Image-Based Activity Localization.
1 code implementation • 6 Jun 2019 • Zhu Zhang, Zhijie Lin, Zhou Zhao, Zhenxin Xiao
Query-based moment retrieval aims to localize the most relevant moment in an untrimmed video according to the given natural language query.
1 code implementation • ACL 2018 • Amulya Gupta, Zhu Zhang
With the recent success of Recurrent Neural Networks (RNNs) in Machine Translation (MT), attention mechanisms have become increasingly popular.
no code implementations • 27 Sep 2013 • Ahmed Abbasi, Zhu Zhang, Hsinchun Chen
Existing fake website detection systems are unable to effectively detect fake websites.