no code implementations • 4 Apr 2025 • Xianyuan Liu, Jiayang Zhang, Shuo Zhou, Thijs L. van der Plas, Avish Vijayaraghavan, Anastasiia Grishina, Mengdie Zhuang, Daniel Schofield, Christopher Tomlinson, YuHan Wang, Ruizhe Li, Louisa van Zeeland, Sina Tabakhi, Cyndie Demeocq, Xiang Li, Arunav Das, Orlando Timmerman, Thomas Baldwin-McDonald, Jinge Wu, Peizhen Bai, Zahraa Al Sahili, Omnia Alwazzan, Thao N. Do, Mohammod N. I. Suvon, Angeline Wang, Lucia Cipolina-Kun, Luigi A. Moretti, Lucas Farndale, Nitisha Jain, Natalia Efremova, Yan Ge, Marta Varela, Hak-Keung Lam, Oya Celiktutan, Ben R. Evans, Alejandro Coca-Castro, Honghan Wu, Zahraa S. Abdallah, Chen Chen, Valentin Danchev, Nataliya Tkachenko, Lei Lu, Tingting Zhu, Gregory G. Slabaugh, Roger K. Moore, William K. Cheung, Peter H. Charlton, Haiping Lu
Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction, and decision-making across disciplines such as healthcare, science, and engineering.
no code implementations • 2 Apr 2025 • Lingzhi Shen, Yunfei Long, Xiaohao Cai, Guanming Chen, YuHan Wang, Imran Razzak, Shoaib Jameel
Graph-based personality detection constructs graph structures from textual data, particularly social media posts.
no code implementations • 2 Apr 2025 • Zijun Wang, Haoqin Tu, YuHan Wang, Juncheng Wu, Jieru Mei, Brian R. Bartoldson, Bhavya Kailkhura, Cihang Xie
This paper introduces STAR-1, a high-quality, just-1k-scale safety dataset specifically designed for large reasoning models (LRMs) like DeepSeek-R1.
no code implementations • 27 Mar 2025 • YuHan Wang
This research introduces an innovative method for identifying credit card fraud by combining the SMOTE-KMEANS technique with an ensemble machine learning model.
no code implementations • 26 Mar 2025 • YuHan Wang, Suzhi Bi, Ying-Jun Angela Zhang, Xiaojun Yuan
The distortion-perception (DP) tradeoff reveals a fundamental conflict between distortion metrics (e. g., MSE and PSNR) and perceptual quality.
no code implementations • 25 Mar 2025 • YuHan Wang, Silu He, Qinyao Luo, Hongyuan Yuan, Ling Zhao, Jiawei Zhu, Haifeng Li
We propose a feature-structure mixed invariant representation learning (FSM-IRL) model that accounts for both feature distribution shifts and structural distribution shifts.
no code implementations • 13 Mar 2025 • YuHan Wang, Cheng Liu, Daou Zhang, Weichao Wu
This deficiency often leads to the detection of anomalous entities or actions that are susceptible to machine illusions and lack sufficient explanation.
1 code implementation • 11 Mar 2025 • YuHan Wang, Fangzhou Hong, Shuai Yang, Liming Jiang, Wayne Wu, Chen Change Loy
In this paper, we explore human multiview diffusion models at the megapixel level and introduce a solution called mesh attention to enable training at 1024x1024 resolution.
no code implementations • 3 Mar 2025 • Zhengliang Shi, YuHan Wang, Lingyong Yan, Pengjie Ren, Shuaiqiang Wang, Dawei Yin, Zhaochun Ren
Tool learning aims to augment large language models (LLMs) with diverse tools, enabling them to act as agents for solving practical tasks.
no code implementations • 13 Feb 2025 • Xiaohong Liu, Xulong Zhao, Gang Liu, Zili Wu, Tao Wang, Lei Meng, YuHan Wang
3D Multi-Object Tracking (MOT) provides the trajectories of surrounding objects, assisting robots or vehicles in smarter path planning and obstacle avoidance.
no code implementations • 7 Feb 2025 • Yu Cheng, Zhen Xu, Yuan Chen, YuHan Wang, Zhenghao Lin, Jinsong Liu
This study proposes a deep learning model based on the combination of convolutional neural network (CNN) and bidirectional long short-term memory network (BiLSTM) for discriminant analysis of financial systemic risk.
no code implementations • 4 Feb 2025 • YuHan Wang, Shuhao Zeng, Qingyu Liu, Boya Di, Hongliang Zhang
In this paper, we consider an IOS-aided multi-user anti-jamming communication system, aiming to improve desired signals and nullify jamming by optimizing IOS phase shifts and transmit beamforming.
no code implementations • 24 Dec 2024 • YuHan Wang, Zhen Xu, Yue Yao, Jinsong Liu, Jiating Lin
To this end, this paper proposes a deep learning model based on the combination of convolutional neural networks (CNN) and Transformer for credit user default prediction.
1 code implementation • 18 Jun 2024 • Linrui Xu, Ling Zhao, Wang Guo, Qiujun Li, Kewang Long, Kaiqi Zou, YuHan Wang, Haifeng Li
Under the new LaGD paradigm, the old datasets, which have led to advances in RSI intelligence understanding in the last decade, are no longer suitable for fire-new tasks.
no code implementations • 12 Jun 2024 • Zhongao Sun, Jiameng Li, YuHan Wang, Jiarong Cheng, Qing Zhou, Chun Li
Brain tumor segmentation remains a significant challenge, particularly in the context of multi-modal magnetic resonance imaging (MRI) where missing modality images are common in clinical settings, leading to reduced segmentation accuracy.
1 code implementation • 13 May 2024 • Zeyu Wang, Yuanchun Shi, Yuntao Wang, Yuchen Yao, Kun Yan, YuHan Wang, Lei Ji, Xuhai Xu, Chun Yu
Modern information querying systems are progressively incorporating multimodal inputs like vision and audio.
no code implementations • 11 May 2024 • Wankang Zhai, YuHan Wang
Understanding the dynamics of momentum and game fluctuation in tennis matches is cru-cial for predicting match outcomes and enhancing player performance.
3 code implementations • 16 Apr 2024 • Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi
In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.
1 code implementation • 25 Mar 2024 • Qinyao Luo, Silu He, Xing Han, YuHan Wang, Haifeng Li
Accurate traffic forecasting is a fundamental problem in intelligent transportation systems and learning long-range traffic representations with key information through spatiotemporal graph neural networks (STGNNs) is a basic assumption of current traffic flow prediction models.
1 code implementation • 13 Mar 2024 • Ruibin Zhang, Donglai Xue, YuHan Wang, Ruixu Geng, Fei Gao
Millimeter wave (mmWave) radars have attracted significant attention from both academia and industry due to their capability to operate in extreme weather conditions.
no code implementations • 11 Mar 2024 • Bambang Parmanto, Bayu Aryoyudanta, Wilbert Soekinto, I Made Agus Setiawan, YuHan Wang, Haomin Hu, Andi Saptono, Yong K. Choi
The RAG framework improved the performance of all FMs used in this study across all measures.
1 code implementation • 19 Jan 2024 • Wenlong Liu, Tianyu Yang, YuHan Wang, QiZhi Yu, Lei Zhang
Finally, we propose a KNN interpolation mechanism for the mask attention module of the spotting head to better handle primitive mask downsampling, which is primitive-level in contrast to pixel-level for the image.
no code implementations • 12 Dec 2023 • Shuzhou Yang, Chong Mou, Jiwen Yu, YuHan Wang, Xiandong Meng, Jian Zhang
Specifically, we construct a neural video field, powered by tri-plane and sparse grid, to enable encoding long videos with hundreds of frames in a memory-efficient manner.
no code implementations • 28 Sep 2023 • Ruolan Wu, Chun Yu, Xiaole Pan, Yujia Liu, Ningning Zhang, Yue Fu, YuHan Wang, Zhi Zheng, Li Chen, Qiaolei Jiang, Xuhai Xu, Yuanchun Shi
We first conducted a Wizard-of-Oz study (N=12) and an interview study (N=10) to summarize the mental states behind problematic smartphone use: boredom, stress, and inertia.
1 code implementation • ICCV 2023 • YuHan Wang, Liming Jiang, Chen Change Loy
In this paper, we introduce a novel motion generator design that uses a learning-based inversion network for GAN.
1 code implementation • 20 Aug 2023 • Zeyu Han, YuHan Wang, Luping Zhou, Peng Wang, Binyu Yan, Jiliu Zhou, Yan Wang, Dinggang Shen
To obtain high-quality positron emission tomography (PET) scans while reducing radiation exposure to the human body, various approaches have been proposed to reconstruct standard-dose PET (SPET) images from low-dose PET (LPET) images.
no code implementations • 18 Aug 2023 • Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang
Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.
no code implementations • 8 Jul 2021 • Haibo Qi, YuHan Wang, Xinyu Liu
In this paper, a 3D-RegNet-based neural network is proposed for diagnosing the physical condition of patients with coronavirus (Covid-19) infection.
1 code implementation • 18 Jun 2021 • YuHan Wang, Xu Chen, Junwei Zhu, Wenqing Chu, Ying Tai, Chengjie Wang, Jilin Li, Yongjian Wu, Feiyue Huang, Rongrong Ji
In this work, we propose a high fidelity face swapping method, called HifiFace, which can well preserve the face shape of the source face and generate photo-realistic results.
Ranked #7 on
Face Swapping
on FaceForensics++
1 code implementation • Neurocomputing 2021 • Yunheng Li, Zhuben Dong, Kaiyuan Liu, Lin Feng, Lianyu Hu, Jie Zhu, Li Xu, YuHan Wang, Shenglan Liu
Due to boundary ambiguity and over-segmentation issues, identifying all the frames in long untrimmed videos is still challenging.
Ranked #14 on
Action Segmentation
on GTEA
no code implementations • 7 Jan 2021 • Yaqiong Li, Jason E. Austermann, James A. Beall, Sarah Marie Bruno, Steve K. Choi, Nicholas F. Cothard, Kevin T. Crowley, Shannon M. Duff, Shuay-Pwu Patty Ho, Joseph E. Golec, Gene C. Hilton, Matthew Hasselfield, Johannes Hubmay, Brian J. Koopman, Marius Lungu, Jeff McMahon, Michael D. Niemack, LymanA. Page, Maria Salatino, Sara M. Simon, Suzanne T. Staggs, Jason R. Stevens, Joel N. Ullom, Eve M. Vavagiakis, YuHan Wang, Edward J. Wollack, Zhilei Xu
The Advanced Atacama Cosmology Telescope Polarimeter (AdvACT) \cite{thornton} is an upgrade for the Atacama Cosmology Telescope using Transition Edge Sensor (TES) detector arrays to measure cosmic microwave background (CMB) temperature and polarization anisotropies in multiple frequencies.
Instrumentation and Methods for Astrophysics
no code implementations • 24 Nov 2020 • YuHan Wang, Zijian Lei, Liang Lan
This data-oblivious matrix sketching method could produce a bad sketched matrix which will result in low accuracy for subsequent machine learning tasks (e. g. classification); (2) For highly sparse input data, count-sketch could produce a dense sketched data matrix.
2 code implementations • CVPR 2020 • Yuge Huang, YuHan Wang, Ying Tai, Xiaoming Liu, Pengcheng Shen, Shaoxin Li, Jilin Li, Feiyue Huang
As an emerging topic in face recognition, designing margin-based loss functions can increase the feature margin between different classes for enhanced discriminability.
Ranked #13 on
Face Verification
on IJB-C
(TAR @ FAR=1e-4 metric)