Search Results for author: Yue Zhou

Found 60 papers, 35 papers with code

A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy

1 code implementation11 Jun 2025 Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Chunyu Miao, Dongyuan Li, Aiwei Liu, Yue Zhou, Yankai Chen, Weizhi Zhang, Yangning Li, Liancheng Fang, Renhe Jiang, Philip S. Yu

This paper argues that progress in AI should not be measured by how independent systems become, but by how well they can work with humans.

Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes

no code implementations26 May 2025 Kaiqing Lin, Zhiyuan Yan, Ke-Yue Zhang, Li Hao, Yue Zhou, Yuzhen Lin, Weixiang Li, Taiping Yao, Shouhong Ding, Bin Li

Finally, we introduce user-specific customization, where we model the unique characteristics of the target face identity and perform semantic reasoning via MLLM to enable personalized and explainable deepfake detection.

DeepFake Detection Face Generation +3

Veracity Bias and Beyond: Uncovering LLMs' Hidden Beliefs in Problem-Solving Reasoning

no code implementations22 May 2025 Yue Zhou, Barbara Di Eugenio

Despite LLMs' explicit alignment against demographic stereotypes, they have been shown to exhibit biases under various social contexts.

Attribute Math

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

1 code implementation15 May 2025 Bin-Bin Gao, Yue Zhou, Jiangtao Yan, Yuezhi Cai, Weixi Zhang, Meng Wang, Jun Liu, Yong liu, Lei Wang, Chengjie Wang

Universal visual anomaly detection aims to identify anomalies from novel or unseen vision domains without additional fine-tuning, which is critical in open scenarios.

Anomaly Detection

Exploring Temporal Dynamics in Event-based Eye Tracker

1 code implementation31 Mar 2025 Hongwei Ren, Xiaopeng Lin, Hongxiang Huang, Yue Zhou, Bojun Cheng

Eye-tracking is a vital technology for human-computer interaction, especially in wearable devices such as AR, VR, and XR.

Mamba

VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models

no code implementations8 Mar 2025 Xinan He, Yue Zhou, Bing Fan, Bin Li, Guopu Zhu, Feng Ding

In this work, we integrate Multimodal Large Language Models (MLLMs) within DM-based face forensics, and propose a fine-grained analysis triad framework called VLForgery, that can 1) predict falsified facial images; 2) locate the falsified face regions subjected to partial synthesis; and 3) attribute the synthesis with specific generators.

Attribute DeepFake Detection +2

TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency

1 code implementation26 Feb 2025 Henry Peng Zou, Zhengyao Gu, Yue Zhou, Yankai Chen, Weizhi Zhang, Liancheng Fang, Yibo Wang, Yangning Li, Kay Liu, Philip S. Yu

Test-time computing approaches, which leverage additional computational resources during inference, have been proven effective in enhancing large language model performance.

intent-classification Intent Classification +3

Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation

1 code implementation21 Feb 2025 Yue Zhou, Yi Chang, Yuan Wu

In conclusion, M$^3$ is a simple yet effective model merging method that significantly enhances the performance of the merged model by randomly generating contribution ratios for two fine-tuned LLMs.

Adversarial Robustness Data Augmentation +1

Frequency-aware Event Cloud Network

no code implementations30 Dec 2024 Hongwei Ren, Fei Ma, Xiaopeng Lin, Yuetong Fang, Hongxiang Huang, Yulong Huang, Yue Zhou, Haotian Fu, ZiYi Yang, Fei Richard Yu, Bojun Cheng

Event cameras are biologically inspired sensors that emit events asynchronously with remarkable temporal resolution, garnering significant attention from both industry and academia.

Action Recognition Pose Estimation

Event-based Motion Deblurring via Multi-Temporal Granularity Fusion

no code implementations16 Dec 2024 Xiaopeng Lin, Hongwei Ren, Yulong Huang, Zunchang Liu, Yue Zhou, Haotian Fu, Biao Pan, Bojun Cheng

Effectively utilizing the high-temporal-resolution event data is crucial for extracting precise motion information and enhancing deblurring performance.

Deblurring Image Deblurring

Representation Purification for End-to-End Speech Translation

no code implementations5 Dec 2024 Chengwei Zhang, Yue Zhou, Rui Zhao, Yidong Chen, Xiaodong Shi

Speech-to-text translation (ST) is a cross-modal task that involves converting spoken language into text in a different language.

Machine Translation Rhythm +4

Unveiling Performance Challenges of Large Language Models in Low-Resource Healthcare: A Demographic Fairness Perspective

no code implementations30 Nov 2024 Yue Zhou, Barbara Di Eugenio, Lu Cheng

This paper studies the performance of large language models (LLMs), particularly regarding demographic fairness, in solving real-world healthcare tasks.

Fairness

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

1 code implementation16 Nov 2024 Yue Zhou, Mengcheng Lan, Xiang Li, Litong Feng, Yiping Ke, Xue Jiang, Qingyun Li, Xue Yang, Wayne Zhang

Remote sensing (RS) visual grounding aims to use natural language expression to locate specific objects (in the form of the bounding box or segmentation mask) in RS images, enhancing human interaction with intelligent RS interpretation systems.

Language Modeling Language Modelling +3

Text4Seg: Reimagining Image Segmentation as Text Generation

1 code implementation13 Oct 2024 Mengcheng Lan, Chaofeng Chen, Yue Zhou, Jiaxing Xu, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang

Multimodal Large Language Models (MLLMs) have shown exceptional capabilities in vision-language tasks; however, effectively integrating image segmentation into these models remains a significant challenge.

Image Segmentation Referring Expression +4

Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges

1 code implementation16 Sep 2024 Vinay Samuel, Yue Zhou, Henry Peng Zou

However, these approaches are often validated with traditional benchmarks and early-stage LLMs, leaving uncertainty about their effectiveness when evaluating state-of-the-art LLMs on the contamination of more challenging benchmarks.

Memorization

PersonaGym: Evaluating Persona Agents and LLMs

1 code implementation25 Jul 2024 Vinay Samuel, Henry Peng Zou, Yue Zhou, Shreyas Chaudhari, Ashwin Kalyan, Tanmay Rajpurohit, Ameet Deshpande, Karthik Narasimhan, Vishvak Murahari

Persona agents, which are LLM agents conditioned to act according to an assigned persona, enable contextually rich and user aligned interactions across domains like education and healthcare.

Towards Vision-Language Geo-Foundation Model: A Survey

1 code implementation13 Jun 2024 Yue Zhou, Litong Feng, Yiping Ke, Xue Jiang, Junchi Yan, Xue Yang, Wayne Zhang

Vision-Language Foundation Models (VLFMs) have made remarkable progress on various multimodal tasks, such as image captioning, image-text retrieval, visual question answering, and visual grounding.

Earth Observation Image Captioning +7

A3:Ambiguous Aberrations Captured via Astray-Learning for Facial Forgery Semantic Sublimation

no code implementations24 May 2024 Xinan He, Yue Zhou, Wei Ye, Feng Ding

The primary objective of the proposed method is to blend hybrid forgery semantics derived from high-frequency components into authentic imagery, named aberrations.

DeepFake Detection Face Swapping +1

Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba

1 code implementation9 May 2024 Hongwei Ren, Yue Zhou, Jiadong Zhu, Haotian Fu, Yulong Huang, Xiaopeng Lin, Yuetong Fang, Fei Ma, Hao Yu, Bojun Cheng

In contrast, Point Cloud is a popular representation for processing 3-dimensional data and serves as an alternative method to exploit local and global spatial features.

Action Recognition Mamba +1

Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation

1 code implementation16 Apr 2024 Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu, Nikolaos Agadakos

Health coaching helps patients achieve personalized and lifestyle-related goals, effectively managing chronic conditions and alleviating mental health issues.

Dialogue Generation

Towards Enhancing Health Coaching Dialogue in Low-Resource Settings

1 code implementation COLING 2022 Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu, Ben Gerber, Nikolaos Agadakos, Shweta Yadav

In this paper, we propose to build a dialogue system that converses with the patients, helps them create and accomplish specific goals, and can address their emotions with empathy.

Empathetic Response Generation Response Generation

A Simple and Effective Point-based Network for Event Camera 6-DOFs Pose Relocalization

no code implementations CVPR 2024 Hongwei Ren, Jiadong Zhu, Yue Zhou, Haotian Fu, Yulong Huang, Bojun Cheng

These cameras implicitly capture movement and depth information in events, making them appealing sensors for Camera Pose Relocalization (CPR) tasks.

A Survey on Data Augmentation in Large Model Era

1 code implementation27 Jan 2024 Yue Zhou, Chenlu Guo, Xu Wang, Yi Chang, Yuan Wu

Leveraging large models, these data augmentation techniques have outperformed traditional approaches.

Audio Signal Processing Image Augmentation +2

Compact Binary Systems Waveform Generation with Generative Pre-trained Transformer

no code implementations31 Oct 2023 Ruijun Shi, Yue Zhou, Tianyu Zhao, Zhoujian Cao, Zhixiang Ren

Space-based gravitational wave (GW) detection is one of the most anticipated GW detection projects in the next decade, which promises to detect abundant compact binary systems.

Gravitational Wave Detection

DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet Classification via Memory Bank

1 code implementation23 Oct 2023 Henry Peng Zou, Yue Zhou, Weizhi Zhang, Cornelia Caragea

During crisis events, people often use social media platforms such as Twitter to disseminate information about the situation, warnings, advice, and support.

Semi-Supervised Text Classification

CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster Tweet Classification

1 code implementation23 Oct 2023 Henry Peng Zou, Yue Zhou, Cornelia Caragea, Doina Caragea

The shared real-time information about natural disasters on social media platforms like Twitter and Facebook plays a critical role in informing volunteers, emergency managers, and response organizations.

Few-Shot Learning

SpikePoint: An Efficient Point-based Spiking Neural Network for Event Cameras Action Recognition

no code implementations11 Oct 2023 Hongwei Ren, Yue Zhou, Yulong Huang, Haotian Fu, Xiaopeng Lin, Jie Song, Bojun Cheng

Moreover, it also achieves SOTA performance across all methods on three datasets, utilizing approximately 0. 3\% of the parameters and 0. 5\% of power consumption employed by artificial neural networks (ANNs).

Action Recognition

Dilated convolutional neural network for detecting extreme-mass-ratio inspirals

no code implementations31 Aug 2023 Tianyu Zhao, Yue Zhou, Ruijun Shi, Zhoujian Cao, Zhixiang Ren

The detection of Extreme Mass Ratio Inspirals (EMRIs) is intricate due to their complex waveforms, extended duration, and low signal-to-noise ratio (SNR), making them more challenging to be identified compared to compact binary coalescences.

TTPOINT: A Tensorized Point Cloud Network for Lightweight Action Recognition with Event Cameras

no code implementations19 Aug 2023 Hongwei Ren, Yue Zhou, Haotian Fu, Yulong Huang, Renjing Xu, Bojun Cheng

In the experiment, TTPOINT emerged as the SOTA method on three datasets while also attaining SOTA among point cloud methods on all five datasets.

Action Recognition

DefCor-Net: Physics-Aware Ultrasound Deformation Correction

1 code implementation7 Aug 2023 Zhongliang Jiang, Yue Zhou, Dongliang Cao, Nassir Navab

The recovery of morphologically accurate anatomical images from deformed ones is challenging in ultrasound (US) image acquisition, but crucial to accurate and consistent diagnosis, particularly in the emerging field of computer-assisted diagnosis.

Anatomy

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

14 code implementations14 Dec 2022 Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, Kai Chen

In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection.

Object object-detection +7

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection

3 code implementations13 Oct 2022 Xue Yang, Gefan Zhang, Wentong Li, Xuehui Wang, Yue Zhou, Junchi Yan

Oriented object detection emerges in many applications from aerial images to autonomous driving, while many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box, leading to a gap between the readily available training corpus and the rising demand for oriented object detection.

All Autonomous Driving +7

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization

1 code implementation22 Sep 2022 Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan

Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects.

regression

Network Pruning via Feature Shift Minimization

1 code implementation6 Jul 2022 Yuanzhi Duan, Yue Zhou, Peng He, Qiang Liu, Shukai Duan, Xiaofang Hu

In this paper, we propose a novel Feature Shift Minimization (FSM) method to compress CNN models, which evaluates the feature shift by converging the information of both features and filters.

Network Pruning

MMRotate: A Rotated Object Detection Benchmark using PyTorch

1 code implementation28 Apr 2022 Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen

We present an open-source toolbox, named MMRotate, which provides a coherent algorithm framework of training, inferring, and evaluation for the popular rotated object detection algorithm based on deep learning.

Object object-detection +1

The KFIoU Loss for Rotated Object Detection

3 code implementations29 Jan 2022 Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian

This is in contrast to recent Gaussian modeling based rotation detectors e. g. GWD loss and KLD loss that involve a human-specified distribution distance metric which require additional hyperparameter tuning that vary across datasets and detectors.

Object object-detection +1

Multi-Robot Collaborative Perception with Graph Neural Networks

no code implementations5 Jan 2022 Yang Zhou, Jiuhong Xiao, Yue Zhou, Giuseppe Loianno

Multi-robot systems such as swarms of aerial robots are naturally suited to offer additional flexibility, resilience, and robustness in several tasks compared to a single robot by enabling cooperation among the agents.

Decision Making Graph Neural Network +2

Network Compression via Central Filter

1 code implementation10 Dec 2021 Yuanzhi Duan, Xiaofang Hu, Yue Zhou, Qiang Liu, Shukai Duan

In this paper, by exploring the similarities between feature maps, we propose a novel filter pruning method, Central Filter (CF), which suggests that a filter is approximately equal to a set of other filters after appropriate adjustments.

Network Pruning

AlphaRotate: A Rotation Detection Benchmark using TensorFlow

1 code implementation12 Nov 2021 Xue Yang, Yue Zhou, Junchi Yan

AlphaRotate is an open-source Tensorflow benchmark for performing scalable rotation detection on various datasets.

Generalized Nesterov's Acceleration-incorporated Non-negative and Adaptive Latent Factor Analysis

no code implementations IEEE Transactions on Services Computing 2021 Xin Luo, Yue Zhou, ZhiGang Liu, Lun Hu, Mengchu Zhou

A non-negative latent factor (NLF) model with a single latent factor-dependent, non-negative and multiplicative update (SLF-NMU) algorithm is frequently adopted to extract useful knowledge from non-negative data represented by high-dimensional and sparse (HiDS) matrices arising from various service applications.

Computational Efficiency

Impact of Local Energy Markets on the Distribution Systems: A Comprehensive Review

no code implementations30 Mar 2021 Viktorija Dudjak, Diana Neves, Tarek Alskaif, Shafi Khadem, Alejandro Pena-Bello, Pietro Saggese, Benjamin Bowler, Merlinda Andoni, Marina Bertolini, Yue Zhou, Blanche Lormeteau, Mustafa A. Mustafa, Yingjie Wang, Christina Francis, Fairouz Zobiri, David Parra, Antonios Papaemmanouil

In recent years extensive research has been conducted on the development of different models that enable energy trading between prosumers and consumers due to expected high integration of distributed energy resources.

energy trading

A large family of maximum scattered linear sets of $\mathrm{PG}(1,q^n)$ and their associated MRD codes

no code implementations16 Feb 2021 Giovanni Longobardi, Giuseppe Marino, Rocco Trombetti, Yue Zhou

In this paper, we provide a large family of new maximum scattered linear sets over $\mathrm{PG}(1, q^n)$ for any even $n\geq 6$ and odd $q$.

Combinatorics 94B05, 11T06, 15A04

Dense Label Encoding for Boundary Discontinuity Free Rotation Detection

3 code implementations CVPR 2021 Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan

Rotation detection serves as a fundamental building block in many visual applications involving aerial image, scene text, and face etc.

Ranked #35 on Object Detection In Aerial Images on DOTA (using extra training data)

Classification General Classification +2

AIPerf: Automated machine learning as an AI-HPC benchmark

1 code implementation17 Aug 2020 Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, WenGuang Chen

The de facto HPC benchmark LINPACK can not reflect AI computing power and I/O performance without representative workload.

AutoML Benchmarking +1

Stock Index Prediction with Multi-task Learning and Word Polarity Over Time

no code implementations17 Aug 2020 Yue Zhou, Kerstin Voigt

We adopt BERT with multitask learning which additionally predicts the worthiness of the news and propose a metric called Polarity-Over-Time to extract the word polarity among different event periods.

Multi-Task Learning Stock Prediction

Balance Scene Learning Mechanism for Offshore and Inshore Ship Detection in SAR Images

no code implementations21 Jul 2020 Tianwen Zhang, Xiaoling Zhang, Jun Shi, Shunjun Wei, Jianguo Wang, Jianwei Li, Hao Su, Yue Zhou

Huge imbalance of different scenes' sample numbers seriously reduces Synthetic Aperture Radar (SAR) ship detection accuracy.

SAR Ship Detection

Using Machine Learning to Forecast Future Earnings

no code implementations26 May 2020 Xinyue Cui, Zhaoyu Xu, Yue Zhou

In this essay, we have comprehensively evaluated the feasibility and suitability of adopting the Machine Learning Models on the forecast of corporation fundamentals (i. e. the earnings), where the prediction results of our method have been thoroughly compared with both analysts' consensus estimation and traditional statistical models.

BIG-bench Machine Learning

Satirical News Detection with Semantic Feature Extraction and Game-theoretic Rough Sets

no code implementations8 Apr 2020 Yue Zhou, Yan Zhang, JingTao Yao

Moreover, the vagueness of satire and news parody determines that a news tweet can hardly be classified with a binary decision, that is, satirical or legitimate.

Misinformation

Cannot find the paper you are looking for? You can Submit a new open access paper.