no code implementations • 13 Mar 2025 • Hao He, Ceyuan Yang, Shanchuan Lin, Yinghao Xu, Meng Wei, Liangke Gui, Qi Zhao, Gordon Wetzstein, Lu Jiang, Hongsheng Li
This paper introduces CameraCtrl II, a framework that enables large-scale dynamic scene exploration through a camera-controlled video diffusion model.
no code implementations • 2 Jan 2025 • Jianyi Wang, Zhijie Lin, Meng Wei, Yang Zhao, Ceyuan Yang, Chen Change Loy, Lu Jiang
Video restoration poses non-trivial challenges in maintaining fidelity while recovering temporally consistent details from unknown degradations in the wild.
1 code implementation • 3 Dec 2024 • Zhongnian Li, Meng Wei, Peng Ying, Tongfeng Sun, Xinzheng Xu
Annotating data for sensitive labels (e. g., disease, smoking) poses a potential threats to individual privacy in many real-world scenarios.
no code implementations • 3 Dec 2024 • Zhongnian Li, Meng Wei, Peng Ying, Xinzheng Xu
Learning from Multi-Positive and Unlabeled (MPU) data has gradually attracted significant attention from practical applications.
1 code implementation • 26 Nov 2024 • Meng Wei, Zhongnian Li, Peng Ying, Xinzheng Xu
These VLMs leverage a predefined set of categories to construct text prompts for zero-shot reasoning.
no code implementations • 27 Oct 2024 • Meng Wei, Qianyi Wu, Jianmin Zheng, Hamid Rezatofighi, Jianfei Cai
Previous attempts to regularize 3D Gaussian normals often degrade rendering quality due to the fundamental disconnect between normal vectors and the rendering pipeline in 3DGS-based methods.
no code implementations • 11 Oct 2024 • Xiaoyu Yue, Zidong Wang, Zeyu Lu, Shuyang Sun, Meng Wei, Wanli Ouyang, Lei Bai, Luping Zhou
Conventional class-guided diffusion models generally succeed in generating images with correct semantic content, but often struggle with texture details.
Ranked #28 on
Image Generation
on ImageNet 256x256
1 code implementation • 24 May 2024 • Zhongnian Li, Jinghao Xu, Peng Ying, Meng Wei, Tongfeng Sun, Xinzheng Xu
Weakly supervised learning has recently achieved considerable success in reducing annotation costs and label noise.
1 code implementation • 25 Mar 2024 • Meng Wei, Zhongnian Li, Yong Zhou, Xinzheng Xu
Long-tailed data is prevalent in real-world classification tasks and heavily relies on supervised information, which makes the annotation process exceptionally labor-intensive and time-consuming.
no code implementations • 25 Mar 2024 • Meng Wei, Zhongnian Li, Peng Ying, Yong Zhou, Xinzheng Xu
In this novel labeling setting, each training instance is associated with a \textit{determined label} (either "Yes" or "No"), which indicates whether the training instance contains the provided class label.
no code implementations • 11 Mar 2024 • Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren
Finally, to better reflect the clinical significant and insignificant errors that radiologists would normally assign in the report, we introduce a novel clinical quality reinforcement learning strategy.
1 code implementation • NeurIPS 2023 • Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
Secondly, part segmentation introduces an open granularity challenge due to the diverse and often ambiguous definitions of parts in the open world.
Open Vocabulary Semantic Segmentation
Open-Vocabulary Semantic Segmentation
+1
no code implementations • 3 Oct 2023 • Xiaoyu Yue, Lei Bai, Meng Wei, Jiangmiao Pang, Xihui Liu, Luping Zhou, Wanli Ouyang
Masked AutoEncoder (MAE) has revolutionized the field of self-supervised learning with its simple yet effective masking and reconstruction strategies.
no code implementations • 9 Aug 2023 • Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miaojing Shi, Tom Vercauteren
Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions.
1 code implementation • 22 Jul 2023 • Yuncheng Yang, Meng Wei, Junjun He, Jie Yang, Jin Ye, Yun Gu
To make up for its deficiency when applying transfer learning to medical image segmentation, in this paper, we therefore propose a new Transferability Estimation (TE) method.
no code implementations • 18 Jul 2023 • Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Roger Zimmermann
While recent video-based methods utilizing video tubelets have shown promising results, we argue that the effective modeling of spatial and temporal context plays a more significant role than the choice between clip tubelets and video tubelets.
no code implementations • 1 Feb 2023 • Meng Wei, Zhongnian Li, Yong Zhou, Qiaoyu Guo, Xinzheng Xu
Annotating multi-class instances is a crucial task in the field of machine learning.
1 code implementation • 14 Oct 2022 • Jin Ye, Haoyu Wang, Ziyan Huang, Zhongying Deng, Yanzhou Su, Can Tu, Qian Wu, Yuncheng Yang, Meng Wei, Jingqi Niu, Junjun He
The combination of PET-based metabolic and CT-based anatomic information can contribute to better tumor segmentation results.
no code implementations • 28 Sep 2022 • Meng Wei, Yong Zhou, Zhongnian Li, Xinzheng Xu
In such scenarios, the number of samples in one class is considerably lower than in other classes, which consequently leads to a decline in the accuracy of predictions.
1 code implementation • 8 Mar 2022 • Yanda Meng, Joshua Bridge, Meng Wei, Yitian Zhao, Yihong Qiao, Xiaoyun Yang, Xiaowei Huang, Yalin Zheng
This paper proposes an adaptive auxiliary task learning based approach for object counting problems.
1 code implementation • 10 Dec 2021 • Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua
Since each verb is associated with a specific set of semantic roles, all existing GSR methods resort to a two-stage framework: predicting the verb in the first stage and detecting the semantic roles in the second stage.
Ranked #4 on
Situation Recognition
on imSitu
1 code implementation • ICCV 2021 • Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip Torr, Wayne Zhang, Dahua Lin
As a typical example, the Vision Transformer (ViT) directly applies a pure transformer architecture on image classification, by simply splitting images into tokens with a fixed length, and employing transformers to learn relations between these tokens.
no code implementations • 12 Aug 2020 • Meng Wei, Chun Yuan, Xiaoyu Yue, Kuo Zhong
Second, since learning too many context-specific classification subspaces can suffer from data sparsity issues, we propose a hierarchical semantic aggregation(HSA) module to reduces the number of subspaces by introducing higher order structural information.
no code implementations • 17 Oct 2018 • Zhenghang Zhong, Zhe Tang, Xiangxing Li, Tiancheng Yuan, Yang Yang, Meng Wei, Yuanyuan Zhang, Renzhi Sheng, Naomi Grant, Chongfeng Ling, Xintao Huan, Kyeong Soo Kim, Sanghyuk Lee
In this paper, we present a new location fingerprinting database comprised of Wi-Fi received signal strength (RSS) and geomagnetic field intensity measured with multiple devices at a multi-floor building in Xi'an Jiatong-Liverpool University, Suzhou, China.