no code implementations • EMNLP 2021 • Haolan Zhan, Lei Shen, Hongshen Chen, Hainan Zhang
Knowledge-grounded dialogue generation has achieved promising performance with the engagement of external knowledge sources.
no code implementations • 10 Apr 2025 • Qi Bi, Jingjun Yi, Hao Zheng, Haolan Zhan, Wei Ji, Yawen Huang, Yuexiang Li
By aligning these probability paths in the latent space, the state embeddings are able to represent the same content distribution regardless of the style differences.
1 code implementation • 10 Apr 2025 • Qi Bi, Jingjun Yi, Haolan Zhan, Wei Ji, Gui-Song Xia
Then, the pre- and post- style hallucinate state embeddings are projected into the hyperbolic manifold.
no code implementations • CVPR 2025 • Qi Bi, Jingjun Yi, Huimin Huang, Hao Zheng, Haolan Zhan, Yawen Huang, Yuexiang Li, Xian Wu, Yefeng Zheng
Night-time scene segmentation is a critical yet challenging task in the real-world applications, primarily due to the complicated lighting conditions.
no code implementations • 29 Dec 2024 • Yao Tong, Weijun Li, Xuanli He, Haolan Zhan, Qiongkai Xu
The success of DNNs often depends on training with large-scale datasets, but building such datasets is both expensive and challenging.
no code implementations • 4 Oct 2024 • Shilin Qu, Weiqing Wang, Xin Zhou, Haolan Zhan, Zhuang Li, Lizhen Qu, Linhao Luo, Yuan-Fang Li, Gholamreza Haffari
Our empirical results show: (i) the quality of the SCNs derived from synthetic data is comparable to that from real dialogues annotated with gold frames, and (ii) the quality of the SCNs extracted from real data, annotated with either silver (predicted) or gold frames, surpasses that without the frame annotations.
1 code implementation • 26 Jul 2024 • Jingjun Yi, Qi Bi, Hao Zheng, Haolan Zhan, Wei Ji, Yawen Huang, Yuexiang Li, Yefeng Zheng
In this paper, we present a novel Spectral-dEcomposed Token (SET) learning framework to advance the frontier.
4 code implementations • 22 Jun 2024 • Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, Simon Brunner, Chen Gong, Thong Hoang, Armel Randy Zebaze, Xiaoheng Hong, Wen-Ding Li, Jean Kaddour, Ming Xu, Zhihan Zhang, Prateek Yadav, Naman jain, Alex Gu, Zhoujun Cheng, Jiawei Liu, Qian Liu, Zijian Wang, Binyuan Hui, Niklas Muennighoff, David Lo, Daniel Fried, Xiaoning Du, Harm de Vries, Leandro von Werra
In addition, using multiple tools to solve a task needs compositional reasoning by accurately understanding complex instructions.
Ranked #1 on
Code Generation
on BigCodeBench-Instruct
1 code implementation • 16 Jun 2024 • Zhuang Li, Yuncheng Hua, Thuy-Trang Vu, Haolan Zhan, Lizhen Qu, Gholamreza Haffari
Recent studies emphasize that manually ensuring a consistent response style and maintaining high data quality in training sets can significantly improve the performance of fine-tuned Large Language Models (LLMs) while reducing the number of training examples needed.
1 code implementation • 21 Apr 2024 • Tao Feng, Lizhen Qu, Zhuang Li, Haolan Zhan, Yuncheng Hua, Gholamreza Haffari
Machine learning models have made incredible progress, but they still struggle when applied to examples from unseen domains.
no code implementations • 20 Feb 2024 • Qi Bi, Beichen Zhou, Jingjun Yi, Wei Ji, Haolan Zhan, Gui-Song Xia
In this paper, we propose the task of domain generalized oriented object detection, which intends to explore the generalization of oriented object detectors on arbitrary unseen target domains.
no code implementations • 17 Feb 2024 • Haolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng, Yuncheng Hua, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, Jureynolds Jureynolds, Suraj Sharma, Shilin Qu, Linhao Luo, Lay-Ki Soon, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari
While collecting sufficient human-authored data is costly, synthetic conversations provide suitable amounts of data to help mitigate the scarcity of training data, as well as the chance to assess the alignment between LLMs and humans in the awareness of social norms.
no code implementations • 2 Feb 2024 • Haolan Zhan, YuFei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari
Negotiation is a crucial ability in human communication.
1 code implementation • 29 Jan 2024 • Yuncheng Hua, Zhuang Li, Linhao Luo, Kadek Ananta Satriadi, Tao Feng, Haolan Zhan, Lizhen Qu, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari
We have released our code and software at:~\url{https://github. com/AnonymousEACLDemo/SADAS}.
no code implementations • 16 Jan 2024 • Qi Bi, Wei Ji, Jingjun Yi, Haolan Zhan, Gui-Song Xia
To comprehensively learn the relation between informative patches and fine-grained semantics, the multi-instance knowledge distillation is implemented on both the region/image crop pairs from the teacher and student net, and the region-image crops inside the teacher / student net, which we term as intra-level multi-instance distillation and inter-level multi-instance distillation.
Fine-Grained Visual Categorization
Knowledge Distillation
+2
no code implementations • 11 Jan 2024 • Aditya Joshi, Raj Dabre, Diptesh Kanojia, Zhuang Li, Haolan Zhan, Gholamreza Haffari, Doris Dippold
Motivated by the performance degradation of NLP models for dialectal datasets and its implications for the equity of language technologies, we survey past research in NLP for dialects in terms of datasets, and approaches.
2 code implementations • NeurIPS 2023 • Han Hu, Haolan Zhan, Yujin Huang, Di Liu
There are currently several publicly accessible GUI page datasets for phones, but none for pairwise GUIs between phones and tablets.
no code implementations • 22 May 2023 • Haolan Zhan, Xuanli He, Qiongkai Xu, Yuxiang Wu, Pontus Stenetorp
The burgeoning progress in the field of Large Language Models (LLMs) heralds significant benefits due to their unparalleled capacities.
1 code implementation • 2 May 2023 • Haolan Zhan, Sameen Maruf, Lizhen Qu, YuFei Wang, Ingrid Zukerman, Gholamreza Haffari
Flowchart-grounded troubleshooting dialogue (FTD) systems, which follow the instructions of a flowchart to diagnose users' problems in specific domains (e. g., vehicle, laptop), have been gaining research interest in recent years.
1 code implementation • 24 Apr 2023 • Haolan Zhan, Zhuang Li, YuFei Wang, Linhao Luo, Tao Feng, Xiaoxi Kang, Yuncheng Hua, Lizhen Qu, Lay-Ki Soon, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari
To the best of our knowledge, SocialDial is the first socially-aware dialogue dataset that covers multiple social factors and has fine-grained labels.
Cultural Vocal Bursts Intensity Prediction
Synthetic Data Generation
no code implementations • 18 Apr 2023 • Haolan Zhan, Xuming Lin, Shaobo Cui, Zhongzhou Zhao, Wei Zhou, Haiqing Chen
Specifically, tabular data and persona information are firstly represented as latent variables separately.
no code implementations • 18 Dec 2022 • Haolan Zhan, YuFei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Gholamreza Haffari
Negotiation is one of the crucial abilities in human communication, and there has been a resurgent research interest in negotiation dialogue systems recently, which goal is to empower intelligent agents with such ability that can efficiently help humans resolve conflicts or reach beneficial agreements.
no code implementations • 14 Sep 2021 • Lei Shen, Haolan Zhan, Xin Shen, Hongshen Chen, Xiaofang Zhao, Xiaodan Zhu
The training method updates parameters of a trained NCMs on two small sets with newly maintained and removed samples, respectively.
no code implementations • 13 Sep 2021 • Lei Shen, Haolan Zhan, Xin Shen, Yonghao Song, Xiaofang Zhao
Specifically, we obtain a group of images (PVIs) for each post based on a pre-trained word-image mapping model.
no code implementations • NAACL 2021 • Haolan Zhan, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Yongjun Bao, Yanyan Lan
In particular, a sequential knowledge transition model equipped with a pre-trained knowledge-aware response generator (SKT-KG) formulates the high-level knowledge transition and fully utilizes the limited knowledge data.
no code implementations • 2 Mar 2021 • Haolan Zhan, Hainan Zhang, Hongshen Chen, Lei Shen, Zhuoye Ding, Yongjun Bao, Weipeng Yan, Yanyan Lan
To tackle this problem, we propose an adaptive posterior network based on Transformer architecture that can utilize user-cared information from customer reviews.
no code implementations • 18 Feb 2021 • Lei Shen, Haolan Zhan, Xin Shen, Yang Feng
Open-domain multi-turn conversations mainly have three features, which are hierarchical semantic structure, redundant information, and long-term dependency.
no code implementations • 16 Feb 2021 • Haolan Zhan, Hainan Zhang, Hongshen Chen, Lei Shen, Yanyan Lan, Zhuoye Ding, Dawei Yin
A simple and effective way is to extract keywords directly from the knowledge-base of products, i. e., attributes or title, as the recommendation reason.
no code implementations • ACL 2019 • Lei Shen, Yang Feng, Haolan Zhan
Multi-turn conversations consist of complex semantic structures, and it is still a challenge to generate coherent and diverse responses given previous utterances.