Search Results for author: Peike Li

Found 13 papers, 4 papers with code

Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment

no code implementations17 Mar 2025 Chen Liu, Peike Li, Liying Yang, Dadong Wang, Lincheng Li, Xin Yu

Accurately localizing audible objects based on audio-visual cues is the core objective of audio-visual segmentation.

Contrastive Learning

Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics

no code implementations17 Mar 2025 Chen Liu, Liying Yang, Peike Li, Dadong Wang, Lincheng Li, Xin Yu

Considering that not all derived audio representations directly correspond to visual features (e. g., off-screen sounds), we propose a dynamic elimination module to filter out non-matching elements.

Semantic Segmentation

JEN-1 DreamStyler: Customized Musical Concept Learning via Pivotal Parameters Tuning

no code implementations18 Jun 2024 BoYu Chen, Peike Li, Yao Yao, Alex Wang

In this paper, we propose a novel method for customized text-to-music generation, which can capture the concept from a two-minute reference music and generate a new piece of music conforming to the concept.

Music Generation Text-to-Music Generation

JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation

1 code implementation29 Oct 2023 Yao Yao, Peike Li, BoYu Chen, Alex Wang

With rapid advances in generative artificial intelligence, the text-to-music synthesis task has emerged as a promising direction for music generation.

Music Generation

BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge

no code implementations20 Aug 2023 Chen Liu, Peike Li, Hu Zhang, Lincheng Li, Zi Huang, Dadong Wang, Xin Yu

In a nutshell, our BAVS is designed to eliminate the interference of background noise or off-screen sounds in segmentation by establishing the audio-visual correspondences in an explicit manner.

Audio Classification Segmentation

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

2 code implementations9 Aug 2023 Peike Li, BoYu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang

Despite the task's significance, prevailing generative models exhibit limitations in music quality, computational efficiency, and generalization.

Computational Efficiency In-Context Learning +2

Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics

no code implementations31 Jul 2023 Chen Liu, Peike Li, Xingqun Qi, Hu Zhang, Lincheng Li, Dadong Wang, Xin Yu

However, we observed that prior arts are prone to segment a certain salient object in a video regardless of the audio information.

Object Segmentation +1

Dynamic Gradient Reactivation for Backward Compatible Person Re-identification

no code implementations12 Jul 2022 Xiao Pan, Hao Luo, Weihua Chen, Fan Wang, Hao Li, Wei Jiang, Jianming Zhang, Jianyang Gu, Peike Li

To address this issue, we propose the Ranking-based Backward Compatible Learning (RBCL), which directly optimizes the ranking metric between new features and old features.

Person Re-Identification Retrieval

In-N-Out Generative Learning for Dense Unsupervised Video Segmentation

1 code implementation29 Mar 2022 Xiao Pan, Peike Li, Zongxin Yang, Huiling Zhou, Chang Zhou, Hongxia Yang, Jingren Zhou, Yi Yang

By contrast, pixel-level optimization is more explicit, however, it is sensitive to the visual quality of training data and is not robust to object deformation.

Contrastive Learning Semantic Segmentation +3

Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar

no code implementations ICCV 2021 Peike Li, Xin Yu, Yi Yang

By iteratively updating the latent representations and our decoder, our DAP-FSR will be adapted to the target domain, thus achieving authentic and high-quality upsampled HR faces.

Decoder Super-Resolution

Consistent Structural Relation Learning for Zero-Shot Segmentation

no code implementations NeurIPS 2020 Peike Li, Yunchao Wei, Yi Yang

Concretely, by exploring the pair-wise and list-wise structures, we impose the relations of generated visual features to be consistent with their counterparts in the semantic word embedding space.

Relation Semantic Segmentation +3

Self-Correction for Human Parsing

2 code implementations22 Oct 2019 Peike Li, Yunqiu Xu, Yunchao Wei, Yi Yang

To tackle the problem of learning with label noises, this work introduces a purification strategy, called Self-Correction for Human Parsing (SCHP), to progressively promote the reliability of the supervised labels as well as the learned models.

Human Parsing Human Part Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.