Search Results for author: YuHan Wang

Found 33 papers, 12 papers with code

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

no code implementations2 Apr 2025 Zijun Wang, Haoqin Tu, YuHan Wang, Juncheng Wu, Jieru Mei, Brian R. Bartoldson, Bhavya Kailkhura, Cihang Xie

This paper introduces STAR-1, a high-quality, just-1k-scale safety dataset specifically designed for large reasoning models (LRMs) like DeepSeek-R1.

Diversity Safety Alignment

A Data Balancing and Ensemble Learning Approach for Credit Card Fraud Detection

no code implementations27 Mar 2025 YuHan Wang

This research introduces an innovative method for identifying credit card fraud by combining the SMOTE-KMEANS technique with an ensemble machine learning model.

Ensemble Learning Fraud Detection

Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model

no code implementations26 Mar 2025 YuHan Wang, Suzhi Bi, Ying-Jun Angela Zhang, Xiaojun Yuan

The distortion-perception (DP) tradeoff reveals a fundamental conflict between distortion metrics (e. g., MSE and PSNR) and perceptual quality.

Denoising

Causal invariant geographic network representations with feature and structural distribution shifts

no code implementations25 Mar 2025 YuHan Wang, Silu He, Qinyao Luo, Hongyuan Yuan, Ling Zhao, Jiawei Zhu, Haifeng Li

We propose a feature-structure mixed invariant representation learning (FSM-IRL) model that accounts for both feature distribution shifts and structural distribution shifts.

Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction

no code implementations13 Mar 2025 YuHan Wang, Cheng Liu, Daou Zhang, Weichao Wu

This deficiency often leads to the detection of anomalous entities or actions that are susceptible to machine illusions and lack sufficient explanation.

Anomaly Detection Human-Object Interaction Detection

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

1 code implementation11 Mar 2025 YuHan Wang, Fangzhou Hong, Shuai Yang, Liming Jiang, Wayne Wu, Chen Change Loy

In this paper, we explore human multiview diffusion models at the megapixel level and introduce a solution called mesh attention to enable training at 1024x1024 resolution.

3D Generation Image to 3D

Retrieval Models Aren't Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models

no code implementations3 Mar 2025 Zhengliang Shi, YuHan Wang, Lingyong Yan, Pengjie Ren, Shuaiqiang Wang, Dawei Yin, Zhaochun Ren

Tool learning aims to augment large language models (LLMs) with diverse tools, enabling them to act as agents for solving practical tasks.

Benchmarking Information Retrieval +1

IMM-MOT: A Novel 3D Multi-object Tracking Framework with Interacting Multiple Model Filter

no code implementations13 Feb 2025 Xiaohong Liu, Xulong Zhao, Gang Liu, Zili Wu, Tao Wang, Lei Meng, YuHan Wang

3D Multi-Object Tracking (MOT) provides the trajectories of surrounding objects, assisting robots or vehicles in smarter path planning and obstacle avoidance.

3D Multi-Object Tracking

A Deep Learning Framework Integrating CNN and BiLSTM for Financial Systemic Risk Analysis and Prediction

no code implementations7 Feb 2025 Yu Cheng, Zhen Xu, Yuan Chen, YuHan Wang, Zhenghao Lin, Jinsong Liu

This study proposes a deep learning model based on the combination of convolutional neural network (CNN) and bidirectional long short-term memory network (BiLSTM) for discriminant analysis of financial systemic risk.

Time Series

Simultaneous Beamforming and Anti-Jamming With Intelligent Omni-Surfaces

no code implementations4 Feb 2025 YuHan Wang, Shuhao Zeng, Qingyu Liu, Boya Di, Hongliang Zhang

In this paper, we consider an IOS-aided multi-user anti-jamming communication system, aiming to improve desired signals and nullify jamming by optimizing IOS phase shifts and transmit beamforming.

Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications

no code implementations24 Dec 2024 YuHan Wang, Zhen Xu, Yue Yao, Jinsong Liu, Jiating Lin

To this end, this paper proposes a deep learning model based on the combination of convolutional neural networks (CNN) and Transformer for credit user default prediction.

Prediction

RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding

1 code implementation18 Jun 2024 Linrui Xu, Ling Zhao, Wang Guo, Qiujun Li, Kewang Long, Kaiqi Zou, YuHan Wang, Haifeng Li

Under the new LaGD paradigm, the old datasets, which have led to advances in RSI intelligence understanding in the last decade, are no longer suitable for fire-new tasks.

Attribute Instruction Following +3

Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning

no code implementations12 Jun 2024 Zhongao Sun, Jiameng Li, YuHan Wang, Jiarong Cheng, Qing Zhou, Chun Li

Brain tumor segmentation remains a significant challenge, particularly in the context of multi-modal magnetic resonance imaging (MRI) where missing modality images are common in clinical settings, leading to reduced segmentation accuracy.

Brain Tumor Segmentation Knowledge Distillation +2

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

1 code implementation13 May 2024 Zeyu Wang, Yuanchun Shi, Yuntao Wang, Yuchen Yao, Kun Yan, YuHan Wang, Lei Ji, Xuhai Xu, Chun Yu

Modern information querying systems are progressively incorporating multimodal inputs like vision and audio.

Natural Language Queries

Lasso Ridge based XGBoost and Deep_LSTM Help Tennis Players Perform better

no code implementations11 May 2024 Wankang Zhai, YuHan Wang

Understanding the dynamics of momentum and game fluctuation in tennis matches is cru-cial for predicting match outcomes and enhancing player performance.

Sports Analytics

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

3 code implementations16 Apr 2024 Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi

In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.

Image Super-Resolution

LSTTN: A Long-Short Term Transformer-based Spatio-temporal Neural Network for Traffic Flow Forecasting

1 code implementation25 Mar 2024 Qinyao Luo, Silu He, Xing Han, YuHan Wang, Haifeng Li

Accurate traffic forecasting is a fundamental problem in intelligent transportation systems and learning long-range traffic representations with key information through spatiotemporal graph neural networks (STGNNs) is a basic assumption of current traffic flow prediction models.

Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model

1 code implementation13 Mar 2024 Ruibin Zhang, Donglai Xue, YuHan Wang, Ruixu Geng, Fei Gao

Millimeter wave (mmWave) radars have attracted significant attention from both academia and industry due to their capability to operate in extreme weather conditions.

Autonomous Navigation

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

1 code implementation19 Jan 2024 Wenlong Liu, Tianyu Yang, YuHan Wang, QiZhi Yu, Lei Zhang

Finally, we propose a KNN interpolation mechanism for the mask attention module of the spotting head to better handle primitive mask downsampling, which is primitive-level in contrast to pixel-level for the image.

Point Cloud Segmentation Vector Graphics

Neural Video Fields Editing

no code implementations12 Dec 2023 Shuzhou Yang, Chong Mou, Jiwen Yu, YuHan Wang, Xiandong Meng, Jian Zhang

Specifically, we construct a neural video field, powered by tri-plane and sparse grid, to enable encoding long videos with hundreds of frames in a memory-efficient manner.

Video Editing

MindShift: Leveraging Large Language Models for Mental-States-Based Problematic Smartphone Use Intervention

no code implementations28 Sep 2023 Ruolan Wu, Chun Yu, Xiaole Pan, Yujia Liu, Ningning Zhang, Yue Fu, YuHan Wang, Zhi Zheng, Li Chen, Qiaolei Jiang, Xuhai Xu, Yuanchun Shi

We first conducted a Wizard-of-Oz study (N=12) and an interview study (N=10) to summarize the mental states behind problematic smartphone use: boredom, stress, and inertia.

Persuasion Strategies

Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction

1 code implementation20 Aug 2023 Zeyu Han, YuHan Wang, Luping Zhou, Peng Wang, Binyu Yan, Jiliu Zhou, Yan Wang, Dinggang Shen

To obtain high-quality positron emission tomography (PET) scans while reducing radiation exposure to the human body, various approaches have been proposed to reconstruct standard-dose PET (SPET) images from low-dose PET (LPET) images.

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement

no code implementations18 Aug 2023 Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang

Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.

Denoising Low-Light Image Enhancement

3D RegNet: Deep Learning Model for COVID-19 Diagnosis on Chest CT Image

no code implementations8 Jul 2021 Haibo Qi, YuHan Wang, Xinyu Liu

In this paper, a 3D-RegNet-based neural network is proposed for diagnosing the physical condition of patients with coronavirus (Covid-19) infection.

COVID-19 Diagnosis Diagnostic

HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping

1 code implementation18 Jun 2021 YuHan Wang, Xu Chen, Junwei Zhu, Wenqing Chu, Ying Tai, Chengjie Wang, Jilin Li, Yongjian Wu, Feiyue Huang, Rongrong Ji

In this work, we propose a high fidelity face swapping method, called HifiFace, which can well preserve the face shape of the source face and generate photo-realistic results.

3D Face Reconstruction Decoder +3

In situ Performance of the Low Frequency Arrayfor Advanced ACTPol

no code implementations7 Jan 2021 Yaqiong Li, Jason E. Austermann, James A. Beall, Sarah Marie Bruno, Steve K. Choi, Nicholas F. Cothard, Kevin T. Crowley, Shannon M. Duff, Shuay-Pwu Patty Ho, Joseph E. Golec, Gene C. Hilton, Matthew Hasselfield, Johannes Hubmay, Brian J. Koopman, Marius Lungu, Jeff McMahon, Michael D. Niemack, LymanA. Page, Maria Salatino, Sara M. Simon, Suzanne T. Staggs, Jason R. Stevens, Joel N. Ullom, Eve M. Vavagiakis, YuHan Wang, Edward J. Wollack, Zhilei Xu

The Advanced Atacama Cosmology Telescope Polarimeter (AdvACT) \cite{thornton} is an upgrade for the Atacama Cosmology Telescope using Transition Edge Sensor (TES) detector arrays to measure cosmic microwave background (CMB) temperature and polarization anisotropies in multiple frequencies.

Instrumentation and Methods for Astrophysics

Effective and Sparse Count-Sketch via k-means clustering

no code implementations24 Nov 2020 YuHan Wang, Zijian Lei, Liang Lan

This data-oblivious matrix sketching method could produce a bad sketched matrix which will result in low accuracy for subsequent machine learning tasks (e. g. classification); (2) For highly sparse input data, count-sketch could produce a dense sketched data matrix.

BIG-bench Machine Learning Clustering

CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition

2 code implementations CVPR 2020 Yuge Huang, YuHan Wang, Ying Tai, Xiaoming Liu, Pengcheng Shen, Shaoxin Li, Jilin Li, Feiyue Huang

As an emerging topic in face recognition, designing margin-based loss functions can increase the feature margin between different classes for enhanced discriminability.

Ranked #13 on Face Verification on IJB-C (TAR @ FAR=1e-4 metric)

Face Recognition Face Verification

Cannot find the paper you are looking for? You can Submit a new open access paper.