Search Results for author: Jialian Wu

Found 7 papers, 4 papers with code

GRiT: A Generative Region-to-text Transformer for Object Understanding

1 code implementation • 1 Dec 2022 • Jialian Wu, JianFeng Wang, Zhengyuan Yang, Zhe Gan, Zicheng Liu, Junsong Yuan, Lijuan Wang

Specifically, GRiT consists of a visual encoder to extract image features, a foreground object extractor to localize objects, and a text decoder to generate open-set object descriptions.

Ranked #2 on Dense Captioning on Visual Genome

Dense Captioning Descriptive +3

271

Paper
Code

Deformable VisTR: Spatio temporal deformable attention for video instance segmentation

1 code implementation • 12 Mar 2022 • Sudhir Yarram, Jialian Wu, Pan Ji, Yi Xu, Junsong Yuan

To improve the training efficiency, we propose Deformable VisTR, leveraging spatio-temporal deformable attention module that only attends to a small fixed set of key spatio-temporal sampling points around a reference point.

Instance Segmentation Semantic Segmentation +1

Paper
Code

Efficient Video Instance Segmentation via Tracklet Query and Proposal

no code implementations • CVPR 2022 • Jialian Wu, Sudhir Yarram, Hui Liang, Tian Lan, Junsong Yuan, Jayan Eledath, Gerard Medioni

In addition, VisTR is not fully end-to-end learnable in multiple video clips as it requires a hand-crafted data association to link instance tracklets between successive clips.

Instance Segmentation Segmentation +2

Paper
Add Code

Track to Detect and Segment: An Online Multi-Object Tracker

1 code implementation • CVPR 2021 • Jialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan

Most online multi-object trackers perform object detection stand-alone in a neural net without any input from tracking.

Ranked #1 on Instance Segmentation on nuScenes

3D Multi-Object Tracking Instance Segmentation +7

544

Paper
Code

Stacked Homography Transformations for Multi-View Pedestrian Detection

no code implementations • ICCV 2021 • Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan

This task is confronted with two challenges: how to establish the 3D correspondences from views to the BEV map and how to assemble occupancy information across views.

Ranked #7 on Multiview Detection on MultiviewX

Multiview Detection Pedestrian Detection

Paper
Add Code

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation

1 code implementation • 13 Aug 2020 • Jialian Wu, Liangchen Song, Tiancai Wang, Qian Zhang, Junsong Yuan

In the classification tree, as the number of parent class nodes are significantly less, their logits are less noisy and can be utilized to suppress the wrong/noisy logits existed in the fine-grained class nodes.

Ranked #5 on Few-Shot Object Detection on LVIS v1.0 val

Classification Few-Shot Object Detection +7

Paper
Code

Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians

no code implementations • CVPR 2020 • Jialian Wu, Chunluan Zhou, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan

State-of-the-art pedestrian detectors have performed promisingly on non-occluded pedestrians, yet they are still confronted by heavy occlusions.

Ranked #18 on Pedestrian Detection on Caltech

Pedestrian Detection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.