no code implementations • SemEval (NAACL) 2022 • Yue Zhou, Bowei Wei, Jianyu Liu, Yang Yang
Synonym and antonym practice are the most common practices in our early childhood.
1 code implementation • 11 Jun 2025 • Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Chunyu Miao, Dongyuan Li, Aiwei Liu, Yue Zhou, Yankai Chen, Weizhi Zhang, Yangning Li, Liancheng Fang, Renhe Jiang, Philip S. Yu
This paper argues that progress in AI should not be measured by how independent systems become, but by how well they can work with humans.
no code implementations • 26 May 2025 • Kaiqing Lin, Zhiyuan Yan, Ke-Yue Zhang, Li Hao, Yue Zhou, Yuzhen Lin, Weixiang Li, Taiping Yao, Shouhong Ding, Bin Li
Finally, we introduce user-specific customization, where we model the unique characteristics of the target face identity and perform semantic reasoning via MLLM to enable personalized and explainable deepfake detection.
no code implementations • 22 May 2025 • Yue Zhou, Barbara Di Eugenio
Despite LLMs' explicit alignment against demographic stereotypes, they have been shown to exhibit biases under various social contexts.
1 code implementation • 15 May 2025 • Bin-Bin Gao, Yue Zhou, Jiangtao Yan, Yuezhi Cai, Weixi Zhang, Meng Wang, Jun Liu, Yong liu, Lei Wang, Chengjie Wang
Universal visual anomaly detection aims to identify anomalies from novel or unseen vision domains without additional fine-tuning, which is critical in open scenarios.
2 code implementations • 1 May 2025 • Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Yankai Chen, Chunyu Miao, Hoang Nguyen, Yue Zhou, Weizhi Zhang, Liancheng Fang, Langzhou He, Yangning Li, Dongyuan Li, Renhe Jiang, Xue Liu, Philip S. Yu
Recent advances in large language models (LLMs) have sparked growing interest in building fully autonomous agents.
1 code implementation • 31 Mar 2025 • Hongwei Ren, Xiaopeng Lin, Hongxiang Huang, Yue Zhou, Bojun Cheng
Eye-tracking is a vital technology for human-computer interaction, especially in wearable devices such as AR, VR, and XR.
no code implementations • 8 Mar 2025 • Xinan He, Yue Zhou, Bing Fan, Bin Li, Guopu Zhu, Feng Ding
In this work, we integrate Multimodal Large Language Models (MLLMs) within DM-based face forensics, and propose a fine-grained analysis triad framework called VLForgery, that can 1) predict falsified facial images; 2) locate the falsified face regions subjected to partial synthesis; and 3) attribute the synthesis with specific generators.
1 code implementation • 26 Feb 2025 • Henry Peng Zou, Zhengyao Gu, Yue Zhou, Yankai Chen, Weizhi Zhang, Liancheng Fang, Yibo Wang, Yangning Li, Kay Liu, Philip S. Yu
Test-time computing approaches, which leverage additional computational resources during inference, have been proven effective in enhancing large language model performance.
1 code implementation • 21 Feb 2025 • Yue Zhou, Yi Chang, Yuan Wu
In conclusion, M$^3$ is a simple yet effective model merging method that significantly enhances the performance of the merged model by randomly generating contribution ratios for two fine-tuned LLMs.
no code implementations • 27 Jan 2025 • Xiaopeng Lin, Yulong Huang, Hongwei Ren, Zunchang Liu, Yue Zhou, Haotian Fu, Bojun Cheng
Motion deblurring addresses the challenge of image blur caused by camera or scene movement.
2 code implementations • 23 Jan 2025 • Peiyuan Zhang, Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Yue Zhou, Xiaosong Jia, Xudong Lu, Jingdong Chen, Xiang Li, Junchi Yan, Yansheng Li
Based on the views, a scale augmentation module and an angle acquisition module are constructed.
no code implementations • 30 Dec 2024 • Hongwei Ren, Fei Ma, Xiaopeng Lin, Yuetong Fang, Hongxiang Huang, Yulong Huang, Yue Zhou, Haotian Fu, ZiYi Yang, Fei Richard Yu, Bojun Cheng
Event cameras are biologically inspired sensors that emit events asynchronously with remarkable temporal resolution, garnering significant attention from both industry and academia.
no code implementations • 16 Dec 2024 • Xiaopeng Lin, Hongwei Ren, Yulong Huang, Zunchang Liu, Yue Zhou, Haotian Fu, Biao Pan, Bojun Cheng
Effectively utilizing the high-temporal-resolution event data is crucial for extracting precise motion information and enhancing deblurring performance.
no code implementations • 5 Dec 2024 • Chengwei Zhang, Yue Zhou, Rui Zhao, Yidong Chen, Xiaodong Shi
Speech-to-text translation (ST) is a cross-modal task that involves converting spoken language into text in a different language.
no code implementations • 30 Nov 2024 • Yue Zhou, Barbara Di Eugenio, Lu Cheng
This paper studies the performance of large language models (LLMs), particularly regarding demographic fairness, in solving real-world healthcare tasks.
1 code implementation • 16 Nov 2024 • Yue Zhou, Mengcheng Lan, Xiang Li, Litong Feng, Yiping Ke, Xue Jiang, Qingyun Li, Xue Yang, Wayne Zhang
Remote sensing (RS) visual grounding aims to use natural language expression to locate specific objects (in the form of the bounding box or segmentation mask) in RS images, enhancing human interaction with intelligent RS interpretation systems.
1 code implementation • 13 Oct 2024 • Mengcheng Lan, Chaofeng Chen, Yue Zhou, Jiaxing Xu, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang
Multimodal Large Language Models (MLLMs) have shown exceptional capabilities in vision-language tasks; however, effectively integrating image segmentation into these models remains a significant challenge.
no code implementations • 4 Oct 2024 • Yulong Huang, Zunchang Liu, Changchun Feng, Xiaopeng Lin, Hongwei Ren, Haotian Fu, Yue Zhou, Hong Xing, Bojun Cheng
The PRF enables efficient long sequence learning while maintaining parallel training.
1 code implementation • 16 Sep 2024 • Vinay Samuel, Yue Zhou, Henry Peng Zou
However, these approaches are often validated with traditional benchmarks and early-stage LLMs, leaving uncertainty about their effectiveness when evaluating state-of-the-art LLMs on the contamination of more challenging benchmarks.
1 code implementation • 25 Jul 2024 • Vinay Samuel, Henry Peng Zou, Yue Zhou, Shreyas Chaudhari, Ashwin Kalyan, Tanmay Rajpurohit, Ameet Deshpande, Karthik Narasimhan, Vishvak Murahari
Persona agents, which are LLM agents conditioned to act according to an assigned persona, enable contextually rich and user aligned interactions across domains like education and healthcare.
1 code implementation • 1 Jul 2024 • Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang
Specifically, we query the model to generate a fallacious yet deceptively real procedure for the harmful behavior.
1 code implementation • 13 Jun 2024 • Yue Zhou, Litong Feng, Yiping Ke, Xue Jiang, Junchi Yan, Xue Yang, Wayne Zhang
Vision-Language Foundation Models (VLFMs) have made remarkable progress on various multimodal tasks, such as image captioning, image-text retrieval, visual question answering, and visual grounding.
no code implementations • 24 May 2024 • Xinan He, Yue Zhou, Wei Ye, Feng Ding
The primary objective of the proposed method is to blend hybrid forgery semantics derived from high-frequency components into authentic imagery, named aberrations.
1 code implementation • 9 May 2024 • Hongwei Ren, Yue Zhou, Jiadong Zhu, Haotian Fu, Yulong Huang, Xiaopeng Lin, Yuetong Fang, Fei Ma, Hao Yu, Bojun Cheng
In contrast, Point Cloud is a popular representation for processing 3-dimensional data and serves as an alternative method to exploit local and global spatial features.
1 code implementation • 24 Apr 2024 • Henry Peng Zou, Vinay Samuel, Yue Zhou, Weizhi Zhang, Liancheng Fang, Zihe Song, Philip S. Yu, Cornelia Caragea
To address these limitations, we present ImplicitAVE, the first, publicly available multimodal dataset for implicit attribute value extraction.
1 code implementation • 17 Apr 2024 • Yue Zhou, Yada Zhu, Diego Antognini, Yoon Kim, Yang Zhang
This paper studies the relationship between the surface form of a mathematical problem and its solvability by large language models.
1 code implementation • 16 Apr 2024 • Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu, Nikolaos Agadakos
Health coaching helps patients achieve personalized and lifestyle-related goals, effectively managing chronic conditions and alleviating mental health issues.
1 code implementation • COLING 2022 • Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu, Ben Gerber, Nikolaos Agadakos, Shweta Yadav
In this paper, we propose to build a dialogue system that converses with the patients, helps them create and accomplish specific goals, and can address their emotions with empathy.
no code implementations • CVPR 2024 • Hongwei Ren, Jiadong Zhu, Yue Zhou, Haotian Fu, Yulong Huang, Bojun Cheng
These cameras implicitly capture movement and depth information in events, making them appealing sensors for Camera Pose Relocalization (CPR) tasks.
1 code implementation • 7 Feb 2024 • Yulong Huang, Xiaopeng Lin, Hongwei Ren, Haotian Fu, Yue Zhou, Zunchang Liu, Biao Pan, Bojun Cheng
Spiking neural networks (SNNs) are promising brain-inspired energy-efficient models.
1 code implementation • 27 Jan 2024 • Yue Zhou, Chenlu Guo, Xu Wang, Yi Chang, Yuan Wu
Leveraging large models, these data augmentation techniques have outperformed traditional approaches.
no code implementations • 31 Oct 2023 • Ruijun Shi, Yue Zhou, Tianyu Zhao, Zhoujian Cao, Zhixiang Ren
Space-based gravitational wave (GW) detection is one of the most anticipated GW detection projects in the next decade, which promises to detect abundant compact binary systems.
1 code implementation • 23 Oct 2023 • Henry Peng Zou, Yue Zhou, Weizhi Zhang, Cornelia Caragea
During crisis events, people often use social media platforms such as Twitter to disseminate information about the situation, warnings, advice, and support.
1 code implementation • 23 Oct 2023 • Henry Peng Zou, Yue Zhou, Cornelia Caragea, Doina Caragea
The shared real-time information about natural disasters on social media platforms like Twitter and Facebook plays a critical role in informing volunteers, emergency managers, and response organizations.
no code implementations • 11 Oct 2023 • Hongwei Ren, Yue Zhou, Yulong Huang, Haotian Fu, Xiaopeng Lin, Jie Song, Bojun Cheng
Moreover, it also achieves SOTA performance across all methods on three datasets, utilizing approximately 0. 3\% of the parameters and 0. 5\% of power consumption employed by artificial neural networks (ANNs).
1 code implementation • 13 Sep 2023 • Haoqin Hong, Yue Zhou, Xiangyu Shu, Xiaofang Hu
Traffic sign detection is an important research direction in intelligent driving.
Ranked #1 on
Traffic Sign Detection
on CCTSDB2021
no code implementations • 31 Aug 2023 • Tianyu Zhao, Yue Zhou, Ruijun Shi, Zhoujian Cao, Zhixiang Ren
The detection of Extreme Mass Ratio Inspirals (EMRIs) is intricate due to their complex waveforms, extended duration, and low signal-to-noise ratio (SNR), making them more challenging to be identified compared to compact binary coalescences.
no code implementations • 19 Aug 2023 • Hongwei Ren, Yue Zhou, Haotian Fu, Yulong Huang, Renjing Xu, Bojun Cheng
In the experiment, TTPOINT emerged as the SOTA method on three datasets while also attaining SOTA among point cloud methods on all five datasets.
1 code implementation • 7 Aug 2023 • Zhongliang Jiang, Yue Zhou, Dongliang Cao, Nassir Navab
The recovery of morphologically accurate anatomical images from deformed ones is challenging in ultrasound (US) image acquisition, but crucial to accurate and consistent diagnosis, particularly in the emerging field of computer-assisted diagnosis.
14 code implementations • 14 Dec 2022 • Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, Kai Chen
In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection.
Ranked #1 on
Object Detection In Aerial Images
on DOTA 1.0
no code implementations • SemEval (NAACL) 2022 • Junyuan Shang, Shuohuan Wang, Yu Sun, Yanjun Yu, Yue Zhou, Li Xiang, Guixiu Yang
This paper describes our winning system on SemEval 2022 Task 7: Identifying Plausible Clarifications of Implicit and Underspecified Phrases in Instructional Texts.
3 code implementations • 13 Oct 2022 • Xue Yang, Gefan Zhang, Wentong Li, Xuehui Wang, Yue Zhou, Junchi Yan
Oriented object detection emerges in many applications from aerial images to autonomous driving, while many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box, leading to a gap between the readily available training corpus and the rising demand for oriented object detection.
1 code implementation • 22 Sep 2022 • Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects.
1 code implementation • 6 Jul 2022 • Yuanzhi Duan, Yue Zhou, Peng He, Qiang Liu, Shukai Duan, Xiaofang Hu
In this paper, we propose a novel Feature Shift Minimization (FSM) method to compress CNN models, which evaluates the feature shift by converging the information of both features and filters.
1 code implementation • 28 Apr 2022 • Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen
We present an open-source toolbox, named MMRotate, which provides a coherent algorithm framework of training, inferring, and evaluation for the popular rotated object detection algorithm based on deep learning.
3 code implementations • 29 Jan 2022 • Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian
This is in contrast to recent Gaussian modeling based rotation detectors e. g. GWD loss and KLD loss that involve a human-specified distribution distance metric which require additional hyperparameter tuning that vary across datasets and detectors.
no code implementations • 5 Jan 2022 • Yang Zhou, Jiuhong Xiao, Yue Zhou, Giuseppe Loianno
Multi-robot systems such as swarms of aerial robots are naturally suited to offer additional flexibility, resilience, and robustness in several tasks compared to a single robot by enabling cooperation among the agents.
1 code implementation • 10 Dec 2021 • Yuanzhi Duan, Xiaofang Hu, Yue Zhou, Qiang Liu, Shukai Duan
In this paper, by exploring the similarities between feature maps, we propose a novel filter pruning method, Central Filter (CF), which suggests that a filter is approximately equal to a set of other filters after appropriate adjustments.
1 code implementation • 12 Nov 2021 • Xue Yang, Yue Zhou, Junchi Yan
AlphaRotate is an open-source Tensorflow benchmark for performing scalable rotation detection on various datasets.
no code implementations • IEEE Transactions on Services Computing 2021 • Xin Luo, Yue Zhou, ZhiGang Liu, Lun Hu, Mengchu Zhou
A non-negative latent factor (NLF) model with a single latent factor-dependent, non-negative and multiplicative update (SLF-NMU) algorithm is frequently adopted to extract useful knowledge from non-negative data represented by high-dimensional and sparse (HiDS) matrices arising from various service applications.
no code implementations • 30 Mar 2021 • Viktorija Dudjak, Diana Neves, Tarek Alskaif, Shafi Khadem, Alejandro Pena-Bello, Pietro Saggese, Benjamin Bowler, Merlinda Andoni, Marina Bertolini, Yue Zhou, Blanche Lormeteau, Mustafa A. Mustafa, Yingjie Wang, Christina Francis, Fairouz Zobiri, David Parra, Antonios Papaemmanouil
In recent years extensive research has been conducted on the development of different models that enable energy trading between prosumers and consumers due to expected high integration of distributed energy resources.
no code implementations • 16 Feb 2021 • Giovanni Longobardi, Giuseppe Marino, Rocco Trombetti, Yue Zhou
In this paper, we provide a large family of new maximum scattered linear sets over $\mathrm{PG}(1, q^n)$ for any even $n\geq 6$ and odd $q$.
Combinatorics 94B05, 11T06, 15A04
3 code implementations • CVPR 2021 • Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan
Rotation detection serves as a fundamental building block in many visual applications involving aerial image, scene text, and face etc.
Ranked #35 on
Object Detection In Aerial Images
on DOTA
(using extra training data)
1 code implementation • 17 Aug 2020 • Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, WenGuang Chen
The de facto HPC benchmark LINPACK can not reflect AI computing power and I/O performance without representative workload.
no code implementations • 17 Aug 2020 • Yue Zhou, Kerstin Voigt
We adopt BERT with multitask learning which additionally predicts the worthiness of the news and propose a metric called Polarity-Over-Time to extract the word polarity among different event periods.
no code implementations • 21 Jul 2020 • Tianwen Zhang, Xiaoling Zhang, Jun Shi, Shunjun Wei, Jianguo Wang, Jianwei Li, Hao Su, Yue Zhou
Huge imbalance of different scenes' sample numbers seriously reduces Synthetic Aperture Radar (SAR) ship detection accuracy.
no code implementations • 26 May 2020 • Xinyue Cui, Zhaoyu Xu, Yue Zhou
In this essay, we have comprehensively evaluated the feasibility and suitability of adopting the Machine Learning Models on the forecast of corporation fundamentals (i. e. the earnings), where the prediction results of our method have been thoroughly compared with both analysts' consensus estimation and traditional statistical models.
no code implementations • 8 Apr 2020 • Yue Zhou, Yan Zhang, JingTao Yao
Moreover, the vagueness of satire and news parody determines that a news tweet can hardly be classified with a binary decision, that is, satirical or legitimate.