Search Results for author: Chenchen Zhu

Found 24 papers, 12 papers with code

SqueezeSAM: User friendly mobile interactive segmentation

no code implementations • 11 Dec 2023 • Balakrishnan Varadarajan, Bilge Soran, Forrest Iandola, Xiaoyu Xiang, Yunyang Xiong, Chenchen Zhu, Raghuraman Krishnamoorthi, Vikas Chandra

Finally, when a user clicks on an object, they typically expect all related pieces of the object to be segmented.

Data Augmentation Interactive Segmentation +4

Paper
Add Code

Gen2Det: Generate to Detect

no code implementations • 7 Dec 2023 • Saksham Suri, Fanyi Xiao, Animesh Sinha, Sean Chang Culatana, Raghuraman Krishnamoorthi, Chenchen Zhu, Abhinav Shrivastava

In the long-tailed detection setting on LVIS, Gen2Det improves the performance on rare categories by a large margin while also significantly improving the performance on other categories, e. g. we see an improvement of 2. 13 Box AP and 1. 84 Mask AP over just training on real data on LVIS with Mask R-CNN.

Image Generation Object +2

Paper
Add Code

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

no code implementations • 4 Dec 2023 • Zhuoran Yu, Chenchen Zhu, Sean Culatana, Raghuraman Krishnamoorthi, Fanyi Xiao, Yong Jae Lee

We present a new framework leveraging off-the-shelf generative models to generate synthetic training images, addressing multiple challenges: class name ambiguity, lack of diversity in naive prompts, and domain shifts.

Domain Generalization Text-to-Image Generation

Paper
Add Code

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

1 code implementation • 1 Dec 2023 • Yunyang Xiong, Bala Varadarajan, Lemeng Wu, Xiaoyu Xiang, Fanyi Xiao, Chenchen Zhu, Xiaoliang Dai, Dilin Wang, Fei Sun, Forrest Iandola, Raghuraman Krishnamoorthi, Vikas Chandra

On segment anything task such as zero-shot instance segmentation, our EfficientSAMs with SAMI-pretrained lightweight image encoders perform favorably with a significant gain (e. g., ~4 AP on COCO/LVIS) over other fast SAM models.

Ranked #3 on Zero-Shot Instance Segmentation on LVIS v1.0 val

Image Classification Instance Segmentation +5

1,737

Paper
Code

EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding

1 code implementation • ICCV 2023 • Chenchen Zhu, Fanyi Xiao, Andres Alvarado, Yasmine Babaei, Jiabo Hu, Hichem El-Mohri, Sean Chang Culatana, Roshan Sumbaly, Zhicheng Yan

To bootstrap the research on EgoObjects, we present a suite of 4 benchmark tasks around the egocentric object understanding, including a novel instance level- and the classical category level object detection.

Continual Learning Object +2

Paper
Code

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

no code implementations • 1 Jun 2023 • Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Mohamed Elhoseiny, Sean Chang Culatana

Although acquired extensive knowledge of visual concepts, it is non-trivial to exploit knowledge from these VL models to the task of semantic segmentation, as they are usually trained at an image level.

Open Vocabulary Semantic Segmentation Segmentation +3

Paper
Add Code

Going Denser with Open-Vocabulary Part Segmentation

2 code implementations • ICCV 2023 • Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan

In this paper, we propose a detector with the ability to predict both open-vocabulary objects and their part segmentation.

Object object-detection +3

361

Paper
Code

Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only

no code implementations • ICCV 2023 • Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Sean Chang Culatana, Mohamed Elhoseiny

Semantic segmentation is a crucial task in computer vision that involves segmenting images into semantically meaningful regions at the pixel level.

Open Vocabulary Semantic Segmentation Segmentation +3

Paper
Add Code

Enhanced Training of Query-Based Object Detection via Selective Query Recollection

2 code implementations • CVPR 2023 • Fangyi Chen, Han Zhang, Kai Hu, Yu-Kai Huang, Chenchen Zhu, Marios Savvides

This paper investigates a phenomenon where query-based object detectors mispredict at the last decoding stage while predicting correctly at an intermediate stage.

Ranked #10 on Object Detection on COCO 2017 val

Attribute Object +2

1,816

Paper
Code

3rd Continual Learning Workshop Challenge on Egocentric Category and Instance Level Object Understanding

1 code implementation • 13 Dec 2022 • Lorenzo Pellegrini, Chenchen Zhu, Fanyi Xiao, Zhicheng Yan, Antonio Carta, Matthias De Lange, Vincenzo Lomonaco, Roshan Sumbaly, Pau Rodriguez, David Vazquez

Continual Learning, also known as Lifelong or Incremental Learning, has recently gained renewed interest among the Artificial Intelligence research community.

Continual Learning Incremental Learning +3

Paper
Code

Unitail: Detecting, Reading, and Matching in Retail Scene

no code implementations • 1 Apr 2022 • Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides

To make full use of computer vision technology in stores, it is required to consider the actual needs that fit the characteristics of the retail scene.

Ranked #1 on Dense Object Detection on SKU-110K

Benchmarking Dense Object Detection +2

Paper
Add Code

Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection

no code implementations • CVPR 2021 • Chenchen Zhu, Fangyi Chen, Uzair Ahmed, Zhiqiang Shen, Marios Savvides

In this work, we investigate utilizing this semantic relation together with the visual information and introduce explicit relation reasoning into the learning of novel object detection.

Ranked #14 on Few-Shot Object Detection on MS-COCO (30-shot)

Few-Shot Object Detection Novel Object Detection +2

Paper
Add Code

Solving Missing-Annotation Object Detection with Background Recalibration Loss

2 code implementations • 12 Feb 2020 • Han Zhang, Fangyi Chen, Zhiqiang Shen, Qiqi Hao, Chenchen Zhu, Marios Savvides

In this paper, we introduce a superior solution called Background Recalibration Loss (BRL) that can automatically re-calibrate the loss signals according to the pre-defined IoU threshold and input image.

Object object-detection +1

Paper
Code

Soft Anchor-Point Object Detection

2 code implementations • ECCV 2020 • Chenchen Zhu, Fangyi Chen, Zhiqiang Shen, Marios Savvides

In this work, we boost the performance of the anchor-point detector over the key-point counterparts while maintaining the speed advantage.

Ranked #3 on Dense Object Detection on SKU-110K

Dense Object Detection feature selection +2

118

Paper
Code

SAPD：Soft Anchor-Point Detector

1 code implementation • arXiv 2019 • Chenchen Zhu, Fangyi Chen, Zhiqiang Shen, Marios Savvides

In this work, we aim at finding a new balance of speed and accuracy for anchor-free detectors.

feature selection Object Detection

Paper
Code

Adversarial-Based Knowledge Distillation for Multi-Model Ensemble and Noisy Data Refinement

no code implementations • 22 Aug 2019 • Zhiqiang Shen, Zhankui He, Wanyun Cui, Jiahui Yu, Yutong Zheng, Chenchen Zhu, Marios Savvides

In order to distill diverse knowledge from different trained (teacher) models, we propose to use adversarial-based learning strategy where we define a block-wise training loss to guide and optimize the predefined student network to recover the knowledge in teacher models, and to promote the discriminator network to distinguish teacher vs. student features simultaneously.

Knowledge Distillation Missing Labels

Paper
Add Code

MMDetection: Open MMLab Detection Toolbox and Benchmark

144 code implementations • 17 Jun 2019 • Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

In this paper, we introduce the various features of this toolbox.

Benchmarking Instance Segmentation +2

27,729

Paper
Code

Feature Selective Anchor-Free Module for Single-Shot Object Detection

4 code implementations • CVPR 2019 • Chenchen Zhu, Yihui He, Marios Savvides

The general concept of the FSAF module is online feature selection applied to the training of multi-level anchor-free branches.

Ranked #128 on Object Detection on COCO test-dev

feature selection object-detection +1

27,729

Paper
Code

Bounding Box Regression with Uncertainty for Accurate Object Detection

4 code implementations • CVPR 2019 • Yihui He, Chenchen Zhu, Jianren Wang, Marios Savvides, Xiangyu Zhang

Large-scale object detection datasets (e. g., MS-COCO) try to define the ground truth bounding boxes as clear as possible.

Ranked #22 on Object Detection on PASCAL VOC 2007

Object object-detection +3

704

Paper
Code

Seeing Small Faces from Robust Anchor's Perspective

no code implementations • CVPR 2018 • Chenchen Zhu, Ran Tao, Khoa Luu, Marios Savvides

This paper introduces a novel anchor design to support anchor-based face detection for superior scale-invariant performance, especially on tiny faces.

Face Detection

Paper
Add Code

Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses

no code implementations • ICCV 2017 • Chandrasekhar Bhagavatula, Chenchen Zhu, Khoa Luu, Marios Savvides

We present our novel approach to simultaneously extract the 3D shape of the face and the semantically consistent 2D alignment through a 3D Spatial Transformer Network (3DSTN) to model both the camera projection matrix and the warping parameters of a 3D model.

Ranked #10 on Face Alignment on AFLW2000-3D

Face Alignment

Paper
Add Code

Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation

1 code implementation • 12 Apr 2017 • Ngan Le, Kha Gia Quach, Khoa Luu, Marios Savvides, Chenchen Zhu

To address these issues and boost the classic variational LS methods to a new level of the learnable deep learning approaches, we propose a novel definition of contour evolution named Recurrent Level Set (RLS)} to employ Gated Recurrent Unit under the energy minimization of a variational LS functional.

Segmentation Semantic Segmentation

Paper
Code

Towards a Deep Learning Framework for Unconstrained Face Detection

no code implementations • 16 Dec 2016 • Yutong Zheng, Chenchen Zhu, Khoa Luu, Chandrasekhar Bhagavatula, T. Hoang Ngan Le, Marios Savvides

Robust face detection is one of the most important pre-processing steps to support facial expression analysis, facial landmarking, face recognition, pose estimation, building of 3D facial models, etc.

Face Detection Face Recognition +2

Paper
Add Code

CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection

no code implementations • 17 Jun 2016 • Chenchen Zhu, Yutong Zheng, Khoa Luu, Marios Savvides

Robust face detection in the wild is one of the ultimate components to support various facial related problems, i. e. unconstrained face recognition, facial periocular recognition, facial landmarking and pose estimation, facial expression recognition, 3D facial model construction, etc.

Ranked #30 on Face Detection on WIDER Face (Medium)

Face Detection Face Recognition +6

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.