1 code implementation • ACL 2022 • Yizhu Liu, Qi Jia, Kenny Zhu
In this paper, we propose a length-aware attention mechanism (LAAM) to adapt the encoding of the source based on the desired length.
1 code implementation • NAACL 2022 • Yizhu Liu, Qi Jia, Kenny Zhu
In this paper, we propose a new automatic reference-free evaluation metric that compares semantic distribution between source document and summary by pretrained language models and considers summary compression ratio.
no code implementations • 11 Sep 2024 • Qi Jia, Xiang Yue, Tianyu Zheng, Jie Huang, Bill Yuchen Lin
To facilitate automatic assessment on \DataName{}, GPT-4 is employed as the evaluator, tasked with reviewing the quality of the final response generated by the target LLMs given multi-turn dialogue scripts.
1 code implementation • 12 Aug 2024 • Yixin Guo, Yu Liu, Jianghao Li, Weimin WANG, Qi Jia
Then, we extract realistic features of seen samples and mix them with synthetic features together, allowing the model to train seen and unseen classes jointly.
Human-Object Interaction Detection Zero-Shot Human-Object Interaction Detection
1 code implementation • CVPR 2024 • Yu Liu, Yaqi Cai, Qi Jia, Binglin Qiu, Weimin WANG, Nan Pu
To tackle this problem, we devise a Region-Aligned Proxy Learning (RAPL) framework, which comprises a Channel-wise Region Alignment (CRA) module and a Semi-Supervised Proxy Learning (SemiPL) strategy.
2 code implementations • 25 Apr 2024 • Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu, Chengjian Zheng, Diankai Zhang, Ning Wang, Xintao Qiu, Yuanbo Zhou, Kongxian Wu, Xinwei Dai, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong, Jae-Hyeon Lee, Ui-Jin Choi, Min Yan, Xin Liu, Qian Wang, Xiaoqian Ye, Zhan Du, Tiansen Zhang, Long Peng, Jiaming Guo, Xin Di, Bohao Liao, Zhibo Du, Peize Xia, Renjing Pei, Yang Wang, Yang Cao, ZhengJun Zha, Bingnan Han, Hongyuan Yu, Zhuoyuan Wu, Cheng Wan, Yuqing Liu, Haodong Yu, Jizhe Li, Zhijuan Huang, Yuan Huang, Yajun Zou, Xianyu Guan, Qi Jia, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Hyeon-Cheol Moon, Tae-hyun Jeong, Yoonmo Yang, Jae-Gon Kim, Jinwoo Jeong, Sunjei Kim
This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs.
no code implementations • 9 Mar 2024 • Yanyi Zhang, Qi Jia, Xin Fan, Yu Liu, Ran He
Inspired by this, we propose a novel A-O disentangled framework for CZSL, namely Class-specified Cascaded Network (CSCNet).
1 code implementation • 12 Jan 2024 • Tianyu Zheng, Shuyue Guo, Xingwei Qu, Jiawei Guo, Xinrun Du, Qi Jia, Chenghua Lin, Wenhao Huang, Jie Fu, Ge Zhang
In this paper, we introduce Kun, a novel approach for creating high-quality instruction-tuning datasets for large language models (LLMs) without relying on manual annotations.
1 code implementation • 18 Oct 2023 • Qi Jia, Siyu Ren, Yizhu Liu, Kenny Q. Zhu
Despite tremendous improvements in natural language generation, summarization models still suffer from the unfaithfulness issue.
1 code implementation • 12 Oct 2023 • Siyu Ren, Qi Jia, Kenny Q. Zhu
The quadratic complexity of the attention module makes it gradually become the bulk of compute in Transformer-based LLMs during generation.
1 code implementation • 23 May 2023 • Qi Jia, Haifeng Tang, Kenny Q. Zhu
Changing speaker names consistently throughout a dialogue should not affect its meaning and corresponding outputs for text generation from dialogues.
2 code implementations • CVPR 2023 • Fei Du, Peng Yang, Qi Jia, Fengtao Nan, Xiaoting Chen, Yun Yang
In this paper, our goal is to design a simple learning paradigm for long-tail visual recognition, which not only improves the robustness of the feature extractor but also alleviates the bias of the classifier towards head classes while reducing the training skills and overhead.
Ranked #1 on Long-tail Learning on CIFAR-10-LT (ρ=10)
no code implementations • 24 Feb 2023 • Cunjuan Zhu, Qi Jia, Wei Chen, Yanming Guo, Yu Liu
Video-Text Retrieval (VTR) aims to search for the most relevant video related to the semantics in a given sentence, and vice versa.
1 code implementation • 21 Nov 2022 • Qi Jia, Yizhu Liu, Haifeng Tang, Kenny Q. Zhu
Curriculum learning has shown promising improvements in multiple domains by training machine learning models from easy samples to hard ones.
no code implementations • 19 Nov 2022 • Xinwei Xue, Gaoyu Wang, Long Ma, Qi Jia, Yi Wang
In this paper, we design an adjacent slice feature fusion model to introduce information from adjacent slices.
no code implementations • 18 Oct 2022 • Qi Jia, Yizhu Liu, Siyu Ren, Kenny Q. Zhu
Abstractive dialogue summarization is to generate a concise and fluent summary covering the salient information in a dialogue among two or more interlocutors.
no code implementations • 7 Jun 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.
1 code implementation • 27 May 2022 • Yuqing Liu, Qi Jia, Shanshe Wang, Siwei Ma, Wen Gao
Image super-resolution (SR) has been widely investigated in recent years.
1 code implementation • Findings (NAACL) 2022 • Qi Jia, Yizhu Liu, Haifeng Tang, Kenny Q. Zhu
Previous dialogue summarization techniques adapt large language models pretrained on the narrative text by injecting dialogue-specific features into the models.
1 code implementation • 26 Apr 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.
no code implementations • 5 Jan 2022 • Yuqing Liu, Qi Jia, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
It is challenging to restore low-resolution (LR) images to super-resolution (SR) images with correct and clear details.
1 code implementation • CVPR 2022 • Qi Jia, Shuilian Yao, Yu Liu, Xin Fan, Risheng Liu, Zhongxuan Luo
To tackle camouflaged object detection (COD), we are inspired by humans attention coupled with the coarse-to-fine detection strategy, and thereby propose an iterative refinement framework, coined SegMaR, which integrates Segment, Magnify and Reiterate in a multi-stage detection fashion.
1 code implementation • CVPR 2021 • Qi Jia, ZhengJun Li, Xin Fan, Haotian Zhao, Shiyu Teng, Xinchen Ye, Longin Jan Latecki
Generating high-quality stitched images with natural structures is a challenging task in computer vision.
1 code implementation • 4 Dec 2020 • Qi Jia, Hongru Huang, Kenny Q. Zhu
In this paper, we propose the task of relation classification of interlocutors based on their dialogues.
Ranked #1 on Dialog Relation Extraction on DDRel
1 code implementation • EMNLP 2020 • Qi Jia, Yizhu Liu, Siyu Ren, Kenny Q. Zhu, Haifeng Tang
In this paper, we propose a dialogue extraction algorithm to transform a dialogue history into threads based on their dependency relations.
1 code implementation • 19 May 2020 • Qi Jia, Mengxue Zhang, Shengyao Zhang, Kenny Q. Zhu
Matching question-answer relations between two turns in conversations is not only the first step in analyzing dialogue structures, but also valuable for training dialogue systems.
no code implementations • 9 Aug 2017 • Qi Jia, Meiyu Yu, Xin Fan, Haojie Li
We develop dual deep networks with memorable gated recurrent units (GRUs), and sequentially feed these two types of features into the dual networks, respectively.
2 code implementations • 26 Aug 2016 • Qi Jia, Xin Fan, Zhongxuan Luo, Lianbo Song, Tie Qiu
Detecting elliptical objects from an image is a central task in robot navigation and industrial diagnosis where the detection time is always a critical issue.