Search Results for author: Yiming Wu

Found 20 papers, 9 papers with code

PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis

1 code implementation • 24 May 2024 • Zicheng Wang, Zhenghao Chen, Yiming Wu, Zhen Zhao, Luping Zhou, Dong Xu

In this study, we introduce PoinTramba, a pioneering hybrid framework that synergies the analytical power of Transformer with the remarkable computational efficiency of Mamba for enhanced point cloud analysis.

Paper
Code

Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval

no code implementations • 24 May 2024 • Yiming Wu, Hangfei Li, Fangfang Wang, Yilong Zhang, Ronghua Liang

In response, we propose a Self-distilled Dynamic Fusion Network to compose the multi-granularity features dynamically by considering the consistency of routing path and modality-specific information simultaneously.

Paper
Add Code

SOEDiff: Efficient Distillation for Small Object Editing

no code implementations • 15 May 2024 • Qihe Pan, Zicheng Wang, Zhen Zhao, Yiming Wu, Sifan Long, Haoran Liang, Ronghua Liang

In this paper, we delve into a new task known as small object editing (SOE), which focuses on text-based image inpainting within a constrained, small-sized area.

Image Inpainting Object

Paper
Add Code

Training-Free Unsupervised Prompt for Vision-Language Models

1 code implementation • 25 Apr 2024 • Sifan Long, Linbin Wang, Zhen Zhao, Zichang Tan, Yiming Wu, Shengsheng Wang, Jingdong Wang

In light of this, we propose Training-Free Unsupervised Prompts (TFUP), which maximally preserves the inherent representation capabilities and enhances them with a residual connection to similarity-based prediction probabilities in a training-free and labeling-free manner.

Paper
Code

Progressive Target-Styled Feature Augmentation for Unsupervised Domain Adaptation on Point Clouds

1 code implementation • 27 Nov 2023 • Zicheng Wang, Zhen Zhao, Yiming Wu, Luping Zhou, Dong Xu

Unlike previous works that focus on feature extractor adaptation, our PTSFA approach focuses on classifier adaptation.

Self-Supervised Learning Unsupervised Domain Adaptation

Paper
Code

Panoptic Scene Graph Generation with Semantics-Prototype Learning

1 code implementation • 28 Jul 2023 • Li Li, Wei Ji, Yiming Wu, Mengze Li, You Qin, Lina Wei, Roger Zimmermann

To promise consistency and accuracy during the transfer process, we propose to measure the invariance of representations in each predicate class, and learn unbiased prototypes of predicates with different intensities.

Ranked #3 on Panoptic Scene Graph Generation on PSG Dataset

Graph Generation Panoptic Scene Graph Generation

Paper
Code

HeightFormer: Explicit Height Modeling without Extra Data for Camera-only 3D Object Detection in Bird's Eye View

no code implementations • 25 Jul 2023 • Yiming Wu, Ruixiang Li, Zequn Qin, Xinhai Zhao, Xi Li

In this work, we propose to explicitly model heights in the BEV space, which needs no extra data like LiDAR and can fit arbitrary camera rigs and types compared to modeling depths.

3D Object Detection Autonomous Driving +1

Paper
Add Code

MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding

no code implementations • 26 Dec 2022 • Wei Ji, Long Chen, Yinwei Wei, Yiming Wu, Tat-Seng Chua

In this work, we propose a novel multi-resolution temporal video sentence grounding network: MRTNet, which consists of a multi-modal feature encoder, a Multi-Resolution Temporal (MRT) module, and a predictor module.

Decoder Descriptive +1

Paper
Add Code

Improving Long Tailed Document-Level Relation Extraction via Easy Relation Augmentation and Contrastive Learning

no code implementations • 21 May 2022 • Yangkai Du, Tengfei Ma, Lingfei Wu, Yiming Wu, Xuhong Zhang, Bo Long, Shouling Ji

Towards real-world information extraction scenario, research of relation extraction is advancing to document-level relation extraction(DocRE).

Ranked #25 on Relation Extraction on DocRED

Contrastive Learning Document-level Relation Extraction +1

Paper
Add Code

D3T-GAN: Data-Dependent Domain Transfer GANs for Few-shot Image Generation

no code implementations • 12 May 2022 • Xintian Wu, Huanyu Wang, Yiming Wu, Xi Li

To transfer knowledge between discriminators, we design a multi-level discriminant knowledge distillation from the source discriminator to the target discriminator on both the real and fake samples.

Image Generation Knowledge Distillation +1

Paper
Add Code

F3A-GAN: Facial Flow for Face Animation with Generative Adversarial Networks

no code implementations • 12 May 2022 • Xintian Wu, Qihang Zhang, Yiming Wu, Huanyu Wang, Songyuan Li, Lingyun Sun, Xi Li

Formulated as a conditional generation problem, face animation aims at synthesizing continuous face images from a single source image driven by a set of conditional face motion.

Paper
Add Code

MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification

1 code implementation • 12 Oct 2021 • Yiming Wu, Xintian Wu, Xi Li, Jian Tian

As a challenging task, unsupervised person ReID aims to match the same identity with query images which does not require any labeled information.

Unsupervised Person Re-Identification

Paper
Code

Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Labels and Features

1 code implementation • 14 May 2020 • Yiming Wu, Tristan Carsault, Eita Nakamura, Kazuyoshi Yoshii

In contrast, we propose a unified generative and discriminative approach in the framework of amortized variational inference.

General Classification Variational Inference

Paper
Code

BANet: Bidirectional Aggregation Network with Occlusion Handling for Panoptic Segmentation

no code implementations • CVPR 2020 • Yifeng Chen, Guangchen Lin, Songyuan Li, Bourahla Omar, Yiming Wu, Fangfang Wang, Junyi Feng, Mingliang Xu, Xi Li

Panoptic segmentation aims to perform instance segmentation for foreground instances and semantic segmentation for background stuff simultaneously.

Instance Segmentation Occlusion Handling +2

Paper
Add Code

Adaptive Graph Representation Learning for Video Person Re-identification

1 code implementation • 5 Sep 2019 • Yiming Wu, Omar El Farouk Bourahla, Xi Li, Fei Wu, Qi Tian, Xue Zhou

While correlations between parts are ignored in the previous methods, to leverage the relations of different parts, we propose an innovative adaptive graph representation learning scheme for video person Re-ID, which enables the contextual interactions between relevant regional features.

Ranked #3 on Person Re-Identification on PRID2011

Graph Representation Learning Video-Based Person Re-Identification

Paper
Code

An Enhanced Ad Event-Prediction Method Based on Feature Engineering

no code implementations • 3 Jul 2019 • Saeid Soheily Khah, Yiming Wu

In digital advertising, Click-Through Rate (CTR) and Conversion Rate (CVR) are very important metrics for evaluating ad performance.

Feature Engineering Marketing

Paper
Add Code

ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation

1 code implementation • CVPR 2019 • Xiaoliang Dai, Peizhao Zhang, Bichen Wu, Hongxu Yin, Fei Sun, Yanghan Wang, Marat Dukhan, Yunqing Hu, Yiming Wu, Yangqing Jia, Peter Vajda, Matt Uyttendaele, Niraj K. Jha

We formulate platform-aware NN architecture search in an optimization framework and propose a novel algorithm to search for optimal architectures aided by efficient accuracy and resource (latency and/or energy) predictors.

Bayesian Optimization Efficient Neural Network +1