Search Results for author: Meng Wei

Found 24 papers, 10 papers with code

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

no code implementations13 Mar 2025 Hao He, Ceyuan Yang, Shanchuan Lin, Yinghao Xu, Meng Wei, Liangke Gui, Qi Zhao, Gordon Wetzstein, Lu Jiang, Hongsheng Li

This paper introduces CameraCtrl II, a framework that enables large-scale dynamic scene exploration through a camera-controlled video diffusion model.

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

no code implementations2 Jan 2025 Jianyi Wang, Zhijie Lin, Meng Wei, Yang Zhao, Ceyuan Yang, Chen Change Loy, Lu Jiang

Video restoration poses non-trivial challenges in maintaining fidelity while recovering temporally consistent details from unknown degradations in the wild.

Video Restoration

Learning from Concealed Labels

1 code implementation3 Dec 2024 Zhongnian Li, Meng Wei, Peng Ying, Tongfeng Sun, Xinzheng Xu

Annotating data for sensitive labels (e. g., disease, smoking) poses a potential threats to individual privacy in many real-world scenarios.

Multi-class Classification

ESA: Example Sieve Approach for Multi-Positive and Unlabeled Learning

no code implementations3 Dec 2024 Zhongnian Li, Meng Wei, Peng Ying, Xinzheng Xu

Learning from Multi-Positive and Unlabeled (MPU) data has gradually attracted significant attention from practical applications.

CoA: Chain-of-Action for Generative Semantic Labels

1 code implementation26 Nov 2024 Meng Wei, Zhongnian Li, Peng Ying, Xinzheng Xu

These VLMs leverage a predefined set of categories to construct text prompts for zero-shot reasoning.

Autonomous Driving Image Classification

Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering

no code implementations27 Oct 2024 Meng Wei, Qianyi Wu, Jianmin Zheng, Hamid Rezatofighi, Jianfei Cai

Previous attempts to regularize 3D Gaussian normals often degrade rendering quality due to the fundamental disconnect between normal vectors and the rendering pipeline in 3DGS-based methods.

3DGS Novel View Synthesis

Diffusion Models Need Visual Priors for Image Generation

no code implementations11 Oct 2024 Xiaoyu Yue, Zidong Wang, Zeyu Lu, Shuyang Sun, Meng Wei, Wanli Ouyang, Lei Bai, Luping Zhou

Conventional class-guided diffusion models generally succeed in generating images with correct semantic content, but often struggle with texture details.

Image Generation

Learning from True-False Labels via Multi-modal Prompt Retrieving

1 code implementation24 May 2024 Zhongnian Li, Jinghao Xu, Peng Ying, Meng Wei, Tongfeng Sun, Xinzheng Xu

Weakly supervised learning has recently achieved considerable success in reducing annotation costs and label noise.

Weakly-supervised Learning

Learning from Reduced Labels for Long-Tailed Data

1 code implementation25 Mar 2024 Meng Wei, Zhongnian Li, Yong Zhou, Xinzheng Xu

Long-tailed data is prevalent in real-world classification tasks and heavily relies on supervised information, which makes the annotation process exceptionally labor-intensive and time-consuming.

Weakly-supervised Learning

Determined Multi-Label Learning via Similarity-Based Prompt

no code implementations25 Mar 2024 Meng Wei, Zhongnian Li, Peng Ying, Yong Zhou, Xinzheng Xu

In this novel labeling setting, each training instance is associated with a \textit{determined label} (either "Yes" or "No"), which indicates whether the training instance contains the provided class label.

Multi-Label Classification MUlTI-LABEL-ClASSIFICATION +1

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

no code implementations11 Mar 2024 Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

Finally, to better reflect the clinical significant and insignificant errors that radiologists would normally assign in the report, we introduce a novel clinical quality reinforcement learning strategy.

Decoder Language Modeling +4

OV-PARTS: Towards Open-Vocabulary Part Segmentation

1 code implementation NeurIPS 2023 Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang

Secondly, part segmentation introduces an open granularity challenge due to the diverse and often ambiguous definitions of parts in the open world.

Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation +1

Understanding Masked Autoencoders From a Local Contrastive Perspective

no code implementations3 Oct 2023 Xiaoyu Yue, Lei Bai, Meng Wei, Jiangmiao Pang, Xihui Liu, Luping Zhou, Wanli Ouyang

Masked AutoEncoder (MAE) has revolutionized the field of self-supervised learning with its simple yet effective masking and reconstruction strategies.

Contrastive Learning Data Augmentation +2

SegMatch: A semi-supervised learning method for surgical instrument segmentation

no code implementations9 Aug 2023 Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miaojing Shi, Tom Vercauteren

Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions.

Pseudo Label Segmentation +1

Pick the Best Pre-trained Model: Towards Transferability Estimation for Medical Image Segmentation

1 code implementation22 Jul 2023 Yuncheng Yang, Meng Wei, Junjun He, Jie Yang, Jin Ye, Yun Gu

To make up for its deficiency when applying transfer learning to medical image segmentation, in this paper, we therefore propose a new Transferability Estimation (TE) method.

Image Segmentation Medical Image Segmentation +3

In Defense of Clip-based Video Relation Detection

no code implementations18 Jul 2023 Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Roger Zimmermann

While recent video-based methods utilizing video tubelets have shown promising results, we argue that the effective modeling of spatial and temporal context plays a more significant role than the choice between clip tubelets and video tubelets.

Feature Compression Object Tracking +2

Learning from Stochastic Labels

no code implementations1 Feb 2023 Meng Wei, Zhongnian Li, Yong Zhou, Qiaoyu Guo, Xinzheng Xu

Annotating multi-class instances is a crucial task in the field of machine learning.

Class-Imbalanced Complementary-Label Learning via Weighted Loss

no code implementations28 Sep 2022 Meng Wei, Yong Zhou, Zhongnian Li, Xinzheng Xu

In such scenarios, the number of samples in one class is considerably lower than in other classes, which consequently leads to a decline in the accuracy of predictions.

Multi-class Classification Weakly Supervised Classification

Counting with Adaptive Auxiliary Learning

1 code implementation8 Mar 2022 Yanda Meng, Joshua Bridge, Meng Wei, Yitian Zhao, Yihong Qiao, Xiaoyun Yang, Xiaowei Huang, Yalin Zheng

This paper proposes an adaptive auxiliary task learning based approach for object counting problems.

Auxiliary Learning Object Counting

Rethinking the Two-Stage Framework for Grounded Situation Recognition

1 code implementation10 Dec 2021 Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

Since each verb is associated with a specific set of semantic roles, all existing GSR methods resort to a two-stage framework: predicting the verb in the first stage and detecting the semantic roles in the second stage.

Grounded Situation Recognition Object Recognition +2

Vision Transformer with Progressive Sampling

1 code implementation ICCV 2021 Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip Torr, Wayne Zhang, Dahua Lin

As a typical example, the Vision Transformer (ViT) directly applies a pure transformer architecture on image classification, by simply splitting images into tokens with a fixed length, and employing transformers to learn relations between these tokens.

Image Classification

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation

no code implementations12 Aug 2020 Meng Wei, Chun Yuan, Xiaoyu Yue, Kuo Zhong

Second, since learning too many context-specific classification subspaces can suffer from data sparsity issues, we propose a hierarchical semantic aggregation(HSA) module to reduces the number of subspaces by introducing higher order structural information.

General Classification Graph Generation +5

XJTLUIndoorLoc: A New Fingerprinting Database for Indoor Localization and Trajectory Estimation Based on Wi-Fi RSS and Geomagnetic Field

no code implementations17 Oct 2018 Zhenghang Zhong, Zhe Tang, Xiangxing Li, Tiancheng Yuan, Yang Yang, Meng Wei, Yuanyuan Zhang, Renzhi Sheng, Naomi Grant, Chongfeng Ling, Xintao Huan, Kyeong Soo Kim, Sanghyuk Lee

In this paper, we present a new location fingerprinting database comprised of Wi-Fi received signal strength (RSS) and geomagnetic field intensity measured with multiple devices at a multi-floor building in Xi'an Jiatong-Liverpool University, Suzhou, China.

Indoor Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.