Search Results for author: Shilong Zhang

Found 10 papers, 9 papers with code

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

no code implementations • 25 Mar 2024 • Shilong Zhang, Lianghua Huang, Xi Chen, Yifei Zhang, Zhi-Fan Wu, Yutong Feng, Wei Wang, Yujun Shen, Yu Liu, Ping Luo

This work presents FlashFace, a practical tool with which users can easily personalize their own photos on the fly by providing one or a few reference face images and a text prompt.

Face Swapping Instruction Following +1

Paper
Add Code

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

2 code implementations • 7 Jul 2023 • Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Yu Liu, Kai Chen, Ping Luo

Before sending to LLM, the reference is replaced by RoI features and interleaved with language embeddings as a sequence.

Ranked #1 on Visual Question Answering (VQA) on VCR (Q-AR) test

Attribute Common Sense Reasoning +4

450

Paper
Code

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

1 code implementation • 8 May 2023 • Tao Gong, Chengqi Lyu, Shilong Zhang, Yudong Wang, Miao Zheng, Qian Zhao, Kuikun Liu, Wenwei Zhang, Ping Luo, Kai Chen

To further enhance the ability to chat with humans of the MultiModal-GPT, we utilize language-only instruction-following data to train the MultiModal-GPT jointly.

Instruction Following Language Modelling

1,397

Paper
Code

Dense Distinct Query for End-to-End Object Detection

1 code implementation • CVPR 2023 • Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen

Concretely, we first lay dense queries like traditional detectors and then select distinct ones for one-to-one assignments.

Ranked #3 on Object Detection on CrowdHuman (full body)

Object object-detection +1

236

Paper
Code

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

9 code implementations • 14 Dec 2022 • Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, Kai Chen

In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection.

Ranked #1 on Oriented Object Detection on DOTA 1.5

Object object-detection +7

27,709

Paper
Code

Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection

1 code implementation • CVPR 2023 • Xinjiang Wang, Xingyi Yang, Shilong Zhang, Yijiang Li, Litong Feng, Shijie Fang, Chengqi Lyu, Kai Chen, Wayne Zhang

In this study, we dive deep into the inconsistency of pseudo targets in semi-supervised object detection (SSOD).

Ranked #2 on Semi-Supervised Object Detection on COCO 2% labeled data

object-detection Object Detection +1

270

Paper
Code

What Are Expected Queries in End-to-End Object Detection?

1 code implementation • 2 Jun 2022 • Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Kai Chen

As both sparse and dense queries are imperfect, then \emph{what are expected queries in end-to-end object detection}?

Instance Segmentation object-detection +2

236

Paper
Code

Group R-CNN for Weakly Semi-supervised Object Detection with Points

1 code implementation • CVPR 2022 • Shilong Zhang, Zhuoran Yu, Liyang Liu, Xinjiang Wang, Aojun Zhou, Kai Chen

The core of this task is to train a point-to-box regressor on well-labeled images that can be used to predict credible bounding boxes for each point annotation.

Object Detection Representation Learning +1

137

Paper
Code

Group Fisher Pruning for Practical Network Compression

2 code implementations • 2 Aug 2021 • Liyang Liu, Shilong Zhang, Zhanghui Kuang, Aojun Zhou, Jing-Hao Xue, Xinjiang Wang, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Our method can be used to prune any structures including those with coupled channels.

Image Classification object-detection +1

149

Paper
Code

Scale-Equalizing Pyramid Convolution for Object Detection

2 code implementations • CVPR 2020 • Xinjiang Wang, Shilong Zhang, Zhuoran Yu, Litong Feng, Wayne Zhang

Inspired by this, a convolution across the pyramid level is proposed in this study, which is termed pyramid convolution and is a modified 3-D convolution.

Ranked #87 on Object Detection on COCO test-dev

Object object-detection +1

416

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.