Search Results for author: YuBo Wang

Found 33 papers, 14 papers with code

C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized Recommendation

1 code implementation16 Jun 2025 Siqi Liang, Yudi Zhang, YuBo Wang

Sequential recommender systems aim to model users' evolving preferences by capturing patterns in their historical interactions.

Benchmarking Sequential Recommendation

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

no code implementations3 Jun 2025 YuBo Wang, Ping Nie, Kai Zou, Lijun Wu, Wenhu Chen

Recent studies have shown that even RL on a single problem can unleash these models' reasoning capabilities.

GPU Math +1

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

1 code implementation25 May 2025 Xuyang Liu, Zichen Wen, Shaobo Wang, Junjie Chen, Zhishan Tao, YuBo Wang, Xiangqi Jin, Chang Zou, Yiyu Wang, Chenfei Liao, Xu Zheng, Honggang Chen, Weijia Li, Xuming Hu, Conghui He, Linfeng Zhang

The rapid advancement of large language models (LLMs) and multi-modal LLMs (MLLMs) has historically relied on model-centric scaling through increasing parameter counts from millions to hundreds of billions to drive performance gains.

Position

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

1 code implementation17 Apr 2025 Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, YuFei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, YuTing Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, YuBo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, XinCao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun, Subhajit Paul, Ni Tang, Junhao Huang, Zihan Cheng, Hongyun Zhu, Yuehan Wu, Kaixin Deng, Hang Ouyang, Tianxin Xiao, Fan Yang, Zhizun Luo, Zeyu Xiao, Zhuoyuan Li, Nguyen Pham Hoang Le, An Dinh Thien, Son T. Luu, Kiet Van Nguyen, Ronghua Xu, Xianmin Tian, Weijian Zhou, Jiacheng Zhang, Yuqian Chen, Yihang Duan, Yujie Wu, Suresh Raikwar, Arsh Garg, Kritika, Jianhua Zheng, Xiaoshan Ma, Ruolin Zhao, Yongyu Yang, Yongsheng Liang, Guiming Huang, Qiang Li, Hongbin Zhang, Xiangyu Zheng, A. N. Rajagopalan

This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images.

Raindrop Removal Rain Removal +1

A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

no code implementations3 Apr 2025 Zhuohan Ge, Nicole Hu, Darian Li, YuBo Wang, Shihao Qi, Yuming Xu, Han Shi, Jason Zhang

The detection and intervention of mental health issues represent a critical global research focus, and social media data has been recognized as an important resource for mental health research.

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

no code implementations23 Feb 2025 YuBo Wang, Jianting Tang, Chaohu Liu, Linli Xu

In this paper, we propose a novel method called Parameter Learning Attack (PLA) for tracking the copyright of LVLMs without modifying the original model.

Adversarial Attack Question Answering +1

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

no code implementations20 Feb 2025 M-A-P Team, Xinrun Du, Yifan Yao, Kaijing Ma, Bingli Wang, Tianyu Zheng, King Zhu, Minghao Liu, Yiming Liang, Xiaolong Jin, Zhenlin Wei, Chujie Zheng, Kaixin Deng, Shawn Gavin, Shian Jia, Sichao Jiang, Yiyan Liao, Rui Li, Qinrui Li, Sirun Li, Yizhi Li, Yunwen Li, David Ma, Yuansheng Ni, Haoran Que, Qiyao Wang, Zhoufutu Wen, Siwei Wu, Tyshawn Hsing, Ming Xu, Zhenzhu Yang, Zekun Moore Wang, Junting Zhou, Yuelin Bai, Xingyuan Bu, Chenglin Cai, Liang Chen, Yifan Chen, Chengtuo Cheng, Tianhao Cheng, Keyi Ding, Siming Huang, Yun Huang, Yaoru Li, Yizhe Li, Zhaoqun Li, Tianhao Liang, Chengdong Lin, Hongquan Lin, Yinghao Ma, Tianyang Pang, Zhongyuan Peng, Zifan Peng, Qige Qi, Shi Qiu, Xingwei Qu, Shanghaoran Quan, Yizhou Tan, Zili Wang, Chenqing Wang, Hao Wang, Yiya Wang, YuBo Wang, Jiajun Xu, Kexin Yang, Ruibin Yuan, Yuanhao Yue, Tianyang Zhan, Chun Zhang, Jinyang Zhang, Xiyue Zhang, Xingjian Zhang, Yue Zhang, Yongchi Zhao, Xiangyu Zheng, Chenghua Zhong, Yang Gao, Zhoujun Li, Dayiheng Liu, Qian Liu, Tianyu Liu, Shiwen Ni, Junran Peng, Yujia Qin, Wenbo Su, Guoyin Wang, Shi Wang, Jian Yang, Min Yang, Meng Cao, Xiang Yue, Zhaoxiang Zhang, Wangchunshu Zhou, Jiaheng Liu, Qunshu Lin, Wenhao Huang, Ge Zhang

To address this gap, we present SuperGPQA, a comprehensive benchmark that evaluates graduate-level knowledge and reasoning capabilities across 285 disciplines.

Collaborative Filtering

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

1 code implementation29 Jan 2025 YuBo Wang, Xiang Yue, Wenhu Chen

To validate the effectiveness of CFT, we construct multiple critique datasets (e. g., WebInstruct, MetaMath, NuminaMath), where GPT-4o serves as the teacher to generate critiques in the form of ([query; noisy response], critique).

Instruction Following Math +1

Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification

no code implementations6 Jan 2025 YuBo Wang, Haoyang Li, Fei Teng, Lei Chen

While neural network-based models, such as CNN and BERT, have demonstrated remarkable performance in text classification, their effectiveness heavily relies on abundant labeled training data.

Data Integration Few-Shot Text Classification +3

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

1 code implementation6 Dec 2024 Jarvis Guo, Tuney Zheng, Yuelin Bai, Bo Li, YuBo Wang, King Zhu, Yizhi Li, Graham Neubig, Wenhu Chen, Xiang Yue

To address these challenges, we introduce a scalable and cost-effective method to construct a large-scale multimodal instruction-tuning dataset with rich intermediate rationales designed to elicit CoT reasoning.

Multimodal Reasoning Visual Question Answering

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

1 code implementation14 Oct 2024 Jiacheng Chen, Tianhao Liang, Sherman Siu, Zhengqing Wang, Kai Wang, YuBo Wang, Yuansheng Ni, Wang Zhu, Ziyan Jiang, Bohan Lyu, Dongfu Jiang, Xuan He, YuAn Liu, Hexiang Hu, Xiang Yue, Wenhu Chen

We evaluate a wide variety of frontier vision-language models on MEGA-Bench to understand their capabilities across these dimensions.

Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models

no code implementations9 Oct 2024 YuBo Wang, Chaohu Liu, Yanqiu Qu, Haoyu Cao, Deqiang Jiang, Linli Xu

Large vision-language models (LVLMs) integrate visual information into large language models, showcasing remarkable multi-modal conversational capabilities.

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

2 code implementations4 Sep 2024 Xiang Yue, Tianyu Zheng, Yuansheng Ni, YuBo Wang, Kai Zhang, Shengbang Tong, Yuxuan Sun, Botao Yu, Ge Zhang, Huan Sun, Yu Su, Wenhu Chen, Graham Neubig

This paper introduces MMMU-Pro, a robust version of the Massive Multi-discipline Multimodal Understanding and Reasoning (MMMU) benchmark.

Optical Character Recognition (OCR)

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

no code implementations20 Jun 2024 Junjie Wang, Yin Zhang, Yatai Ji, Yuxiang Zhang, Chunyang Jiang, YuBo Wang, Kang Zhu, Zekun Wang, Tiezhen Wang, Wenhao Huang, Jie Fu, Bei Chen, Qunshu Lin, Minghao Liu, Ge Zhang, Wenhu Chen

Recent advancements in Large Multimodal Models (LMMs) have leveraged extensive multimodal datasets to enhance capabilities in complex knowledge-driven tasks.

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

2 code implementations3 Jun 2024 YuBo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue, Wenhu Chen

In the age of large-scale language models, benchmarks like the Massive Multitask Language Understanding (MMLU) have been pivotal in pushing the boundaries of what AI can achieve in language comprehension and reasoning across diverse domains.

MMLU Multi-task Language Understanding

KGLink: A column type annotation method that combines knowledge graph and pre-trained language model

1 code implementation1 Jun 2024 YuBo Wang, Hao Xin, Lei Chen

By leveraging the strengths of KGLink, we successfully surmount challenges related to type granularity and valuable context issues, establishing it as a robust solution for the semantic annotation of tabular data.

Column Type Annotation Deep Learning +2

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence

no code implementations CVPR 2024 Ruihai Wu, Haoran Lu, Yiyan Wang, YuBo Wang, Hao Dong

Garment manipulation (e. g., unfolding, folding and hanging clothes) is essential for future robots to accomplish home-assistant tasks, while highly challenging due to the diversity of garment configurations, geometries and deformations.

Diversity

SRGS: Super-Resolution 3D Gaussian Splatting

1 code implementation16 Apr 2024 Xiang Feng, Yongbo He, YuBo Wang, Yan Yang, Wen Li, Yifei Chen, Zhenzhong Kuang, Jiajun Ding, Jianping Fan, Yu Jun

This approach relies on the representation power of Gaussian primitives to provide a high-quality rendering.

3DGS NeRF +2

Simulating Nighttime Visible Satellite Imagery of Tropical Cyclones Using Conditional Generative Adversarial Networks

no code implementations22 Jan 2024 Jinghuai Yao, Puyuan Du, Yucheng Zhao, YuBo Wang

The model was trained and validated using data from the Advanced Himawari Imager (AHI) in the daytime, achieving statistical results of SSIM = 0. 923 and Root Mean Square Error (RMSE) = 0. 0299, which significantly surpasses existing models.

SSIM

ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields

no code implementations19 Dec 2023 Xiang Feng, Yongbo He, YuBo Wang, Chengkai Wang, Zhenzhong Kuang, Jiajun Ding, Feiwei Qin, Jun Yu, Jianping Fan

This framework aims to guide the NeRF model to synthesize high-resolution novel views via single-scene internal learning rather than requiring any external high-resolution training data.

Inverse Rendering NeRF +1

FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration

1 code implementation26 Nov 2023 Zihao Zou, Jiaming Liu, Shirin Shoushtari, YuBo Wang, Weijie Gan, Ulugbek S. Kamilov

Face video restoration (FVR) is a challenging but important problem where one seeks to recover a perceptually realistic face videos from a low-quality input.

Deblurring Image Enhancement +3

Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering (Published in Findings of EMNLP 2024)

1 code implementation5 Sep 2023 YuBo Wang, Xueguang Ma, Wenhu Chen

In this study, we present a system called LLMs Augmented with Medical Textbooks (LLM-AMT) designed to enhance the proficiency of LLMs in specialized domains.

Question Answering Retrieval

YZR-net : Self-supervised Hidden representations Invariant to Transformations for profanity detection

no code implementations22 Nov 2022 Vedant Sandeep Joshi, Sivanagaraja Tatinati, YuBo Wang

Some miscreants use this framework to send profane messages which can have a negative impact on other students as well as the teacher of the class.

Looking For A Match: Self-supervised Clustering For Automatic Doubt Matching In e-learning Platforms

no code implementations20 Aug 2022 Vedant Sandeep Joshi, Sivanagaraja Tatinati, YuBo Wang

Results highlighted that, custom BYOL improves the top-1 matching accuracy by approximately 6\% and 5\% as compared to both BYOL and supervised learning instances, respectively.

A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems

no code implementations24 Sep 2021 Xian Yeow Lee, Soumik Sarkar, YuBo Wang

We conduct further analysis on the impact of both observations and actions: on the observation end, we examine the robustness of graph-based policy on two typical data acquisition errors in power systems, namely sensor communication failure and measurement misalignment.

Deep Reinforcement Learning Reinforcement Learning (RL)

PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems

1 code implementation8 Sep 2021 Ting-Han Fan, Xian Yeow Lee, YuBo Wang

We introduce PowerGym, an open-source reinforcement learning environment for Volt-Var control in power distribution systems.

OpenAI Gym reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.