Search Results for author: Yun Zheng

Found 19 papers, 6 papers with code

CoReS: Orchestrating the Dance of Reasoning and Segmentation

no code implementations • 8 Apr 2024 • Xiaoyi Bao, Siyang Sun, Shuailei Ma, Kecheng Zheng, Yuxin Guo, Guosheng Zhao, Yun Zheng, Xingang Wang

We believe that the act of reasoning segmentation should mirror the cognitive stages of human visual search, where each step is a progressive refinement of thought toward the final object.

Segmentation

Paper
Add Code

Automated Identification and Segmentation of Hi Sources in CRAFTS Using Deep Learning Method

1 code implementation • 29 Mar 2024 • Zihao Song, Huaxi Chen, Donghui Quan, Di Li, Yinghui Zheng, Shulei Ni, Yunchuan Chen, Yun Zheng

We introduce a machine learning-based method for extracting HI sources from 3D spectral data, and construct a dedicated dataset of HI sources from CRAFTS.

UNET Segmentation

Paper
Code

Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization

1 code implementation • NeurIPS 2023 • Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng

Audio-Visual Source Localization (AVSL) aims to locate sounding objects within video frames given the paired audio clips.

Contrastive Learning

Paper
Code

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model

no code implementations • 18 Dec 2023 • Shuailei Ma, Chen-Wei Xie, Ying WEI, Siyang Sun, Jiaqi Fan, Xiaoyi Bao, Yuxin Guo, Yun Zheng

In this paper, we conduct a direct analysis of the multi-modal prompts by asking the following questions: $(i)$ How do the learned multi-modal prompts improve the recognition performance?

Language Modelling

Paper
Add Code

Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

no code implementations • 11 Dec 2023 • Xiaoyi Bao, Jie Qin, Siyang Sun, Yun Zheng, Xingang Wang

To improve the semantic consistency of foreground instances, we propose an unlabeled branch as an efficient data utilization method, which teaches the model how to extract intrinsic features robust to intra-class differences.

Few-Shot Semantic Segmentation Semantic Segmentation

Paper
Add Code

MomentDiff: Generative Video Moment Retrieval from Random to Real

1 code implementation • NeurIPS 2023 • Pandeng Li, Chen-Wei Xie, Hongtao Xie, Liming Zhao, Lei Zhang, Yun Zheng, Deli Zhao, Yongdong Zhang

Video moment retrieval pursues an efficient and generalized solution to identify the specific temporal segments within an untrimmed video that correspond to a given language description.

Moment Retrieval Retrieval

Paper
Code

RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training

no code implementations • CVPR 2023 • Chen-Wei Xie, Siyang Sun, Xiong Xiong, Yun Zheng, Deli Zhao, Jingren Zhou

This process can be considered as an open-book exam: with the reference set as a cheat sheet, the proposed method doesn't need to memorize all visual concepts in the training data.

Classification Image Classification +5

Paper
Add Code

Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval

1 code implementation • ICCV 2023 • Pandeng Li, Chen-Wei Xie, Liming Zhao, Hongtao Xie, Jiannan Ge, Yun Zheng, Deli Zhao, Yongdong Zhang

In the event-sentence prototype matching phase, we design a temporal prototype generation mechanism to associate intra-frame objects and interact inter-frame temporal relations.

Object Retrieval +2

Paper
Code

RCL: Recurrent Continuous Localization for Temporal Action Detection

no code implementations • CVPR 2022 • Qiang Wang, Yanhao Zhang, Yun Zheng, Pan Pan

Temporal representation is the cornerstone of modern action detection techniques.

Action Detection

Paper
Add Code

Disentangled Representation Learning for Text-Video Retrieval

2 code implementations • 14 Mar 2022 • Qiang Wang, Yanhao Zhang, Yun Zheng, Pan Pan, Xian-Sheng Hua

Cross-modality interaction is a critical component in Text-Video Retrieval (TVR), yet there has been little examination of how different influencing factors for computing interaction affect performance.

Ranked #9 on Video Retrieval on MSR-VTT-1kA (using extra training data)

Representation Learning Retrieval +1

2,972

Paper
Code

Multiple Object Tracking with Correlation Learning

no code implementations • CVPR 2021 • Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu

Recent works have shown that convolutional networks have substantially improved the performance of multiple object tracking by simultaneously learning detection and appearance features.

Multiple Object Tracking Object +1

Paper
Add Code

Few-Shot Incremental Learning with Continually Evolved Classifiers

1 code implementation • CVPR 2021 • Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu

First, we adopt a simple but effective decoupled learning strategy of representations and classifiers that only the classifiers are updated in each incremental session, which avoids knowledge forgetting in the representations.

Ranked #7 on Few-Shot Class-Incremental Learning on CIFAR-100

Few-Shot Class-Incremental Learning Incremental Learning

136

Paper
Code

Large-Scale Visual Search with Binary Distributed Graph at Alibaba

no code implementations • 9 Feb 2021 • Kang Zhao, Pan Pan, Yun Zheng, Yanhao Zhang, Changxu Wang, Yingya Zhang, Yinghui Xu, Rong Jin

For a deployed visual search system with several billions of online images in total, building a billion-scale offline graph in hours is essential, which is almost unachievable by most existing methods.

graph construction

Paper
Add Code

Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce

no code implementations • 9 Feb 2021 • Yanhao Zhang, Qiang Wang, Pan Pan, Yun Zheng, Cheng Da, Siyang Sun, Yinghui Xu

Nowadays, live-stream and short video shopping in E-commerce have grown exponentially.

Retrieval Video-to-Shop

Paper
Add Code

Visual Search at Alibaba

no code implementations • 9 Feb 2021 • Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Yingya Zhang, Xiaofeng Ren, Rong Jin

We hope visual search at Alibaba becomes more widely incorporated into today's commercial applications.

Image Retrieval

Paper
Add Code

Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting Richness of User Click Behavior for Visual Search Relevance

no code implementations • 9 Feb 2021 • Yanhao Zhang, Pan Pan, Yun Zheng, Kang Zhao, Jianmin Wu, Yinghui Xu, Rong Jin

Benefiting from exploration of user click data, our networks are more effective to encode richer supervision and better distinguish real-shot images in terms of category and feature.

Paper
Add Code

Large Scale Long-tailed Product Recognition System at Alibaba

no code implementations • 9 Feb 2021 • Xiangzeng Zhou, Pan Pan, Yun Zheng, Yinghui Xu, Rong Jin

In this paper, we present a novel side information based large scale visual recognition co-training~(SICoT) system to deal with the long tail problem by leveraging the image related side information.

Object Recognition

Paper
Add Code

Weakly Supervised Learning with Side Information for Noisy Labeled Images

no code implementations • ECCV 2020 • Lele Cheng, Xiangzeng Zhou, Liming Zhao, Dangwei Li, Hong Shang, Yun Zheng, Pan Pan, Yinghui Xu

In many real-world datasets, like WebVision, the performance of DNN based classifier is often limited by the noisy labeled data.

Weakly-supervised Learning

Paper
Add Code

Intrinsic radiation background of LaBr$_3$(Ce) detector via coincidence measurements and simulations

no code implementations • 15 Jul 2020 • Hao Cheng, Bao-Hua Sun, Li-Hua Zhu, Tian-Xiao Li, Guang-Shuai Li, Cong-Bo Li, Xiao-Guang Wu, Yun Zheng

The LaBr$_3$(Ce) detector has attracted much attention in recent years for its superior characteristics to other scintillating materials in terms of resolution and efficiency.

Instrumentation and Detectors Nuclear Experiment

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.