Search Results for author: Peixian Chen

Found 18 papers, 11 papers with code

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

1 code implementation • 4 Apr 2024 • Sichen Chen, Yingyi Zhang, Siming Huang, Ran Yi, Ke Fan, Ruixin Zhang, Peixian Chen, Jun Wang, Shouhong Ding, Lizhuang Ma

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

Edge-computing Pose Estimation

Paper
Code

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

2 code implementations • 19 Dec 2023 • Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

They endow Large Language Models (LLMs) with powerful capabilities in visual understanding, enabling them to tackle diverse multi-modal tasks.

Visual Reasoning

8,783

Paper
Code

Aligning and Prompting Everything All at Once for Universal Visual Perception

2 code implementations • 4 Dec 2023 • Yunhang Shen, Chaoyou Fu, Peixian Chen, Mengdan Zhang, Ke Li, Xing Sun, Yunsheng Wu, Shaohui Lin, Rongrong Ji

However, predominant paradigms, driven by casting instance-level tasks as an object-word alignment, bring heavy cross-modality interaction, which is not effective in prompting object detection and visual grounding.

Object object-detection +6

411

Paper
Code

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

3 code implementations • 23 Jun 2023 • Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji

Multimodal Large Language Model (MLLM) relies on the powerful LLM to perform multimodal tasks, showing amazing emergent abilities in recent studies, such as writing poems based on an image.

Benchmarking Language Modelling +3

8,783

Paper
Code

Multi-modal Queried Object Detection in the Wild

1 code implementation • NeurIPS 2023 • Yifan Xu, Mengdan Zhang, Chaoyou Fu, Peixian Chen, Xiaoshan Yang, Ke Li, Changsheng Xu

To address the learning inertia problem brought by the frozen detector, a vision conditioned masked language prediction strategy is proposed.

Ranked #1 on Few-Shot Object Detection on ODinW-35

Few-Shot Object Detection Object +2

223

Paper
Code

Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization

2 code implementations • 22 Jun 2022 • Peixian Chen, Kekai Sheng, Mengdan Zhang, Mingbao Lin, Yunhang Shen, Shaohui Lin, Bo Ren, Ke Li

Open-vocabulary object detection (OVD) aims to scale up vocabulary size to detect objects of novel categories beyond the training vocabulary.

Ranked #12 on Open Vocabulary Object Detection on LVIS v1.0

Causal Inference object-detection +1

Paper
Code

Efficient Decoder-free Object Detection with Transformers

2 code implementations • 14 Jun 2022 • Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen

A natural usage of ViTs in detection is to replace the CNN-based backbone with a transformer-based backbone, which is straightforward and effective, with the price of bringing considerable computation burden for inference.

Object Object Detection

Paper
Code

ARM: Any-Time Super-Resolution Method

1 code implementation • 21 Mar 2022 • Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji

To that effect, we construct an Edge-to-PSNR lookup table that maps the edge score of an image patch to the PSNR performance for each subnet, together with a set of computation costs for the subnets.

Image Super-Resolution

Paper
Code

Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID

no code implementations • ICCV 2021 • Peixian Chen, Wenfeng Liu, Pingyang Dai, Jianzhuang Liu, Qixiang Ye, Mingliang Xu, Qi'an Chen, Rongrong Ji

To avoid such problematic models in occluded person ReID, we propose the Occlusion-Aware Mask Network (OAMN).

Person Re-Identification

Paper
Add Code

Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models

1 code implementation • ICCV 2021 • Jie Li, Rongrong Ji, Peixian Chen, Baochang Zhang, Xiaopeng Hong, Ruixin Zhang, Shaoxin Li, Jilin Li, Feiyue Huang, Yongjian Wu

A common practice is to start from a large perturbation and then iteratively reduce it with a deterministic direction and a random one while keeping it adversarial.

Dimensionality Reduction

Paper
Code

Learning Task-oriented Disentangled Representations for Unsupervised Domain Adaptation

no code implementations • 27 Jul 2020 • Pingyang Dai, Peixian Chen, Qiong Wu, Xiaopeng Hong, Qixiang Ye, Qi Tian, Rongrong Ji

This drawback limits the flexibility of UDA in complicated open-set tasks where no labels are shared between domains.

Retrieval Unsupervised Domain Adaptation

Paper
Add Code

Dual Distribution Alignment Network for Generalizable Person Re-Identification

1 code implementation • 27 Jul 2020 • Peixian Chen, Pingyang Dai, Jianzhuang Liu, Feng Zheng, Qi Tian, Rongrong Ji

Domain generalization (DG) serves as a promising solution to handle person Re-Identification (Re-ID), which trains the model using labels from the source domain alone, and then directly adopts the trained model to the target domain without model updating.

Domain Generalization Generalizable Person Re-identification

Paper
Code

Video-based Person Re-identification with Two-stream Convolutional Network and Co-attentive Snippet Embedding

no code implementations • 28 May 2019 • Peixian Chen, Pingyang Dai, Qiong Wu, Yuyu Huang

Recently, the applications of person re-identification in visual surveillance and human-computer interaction are sharply increasing, which signifies the critical role of such a problem.

Optical Flow Estimation Video-Based Person Re-Identification

Paper
Add Code

A Novel Document Generation Process for Topic Detection based on Hierarchical Latent Tree Models

no code implementations • 12 Dec 2017 • Peixian Chen, Zhourong Chen, Nevin L. Zhang

It is the new state-of-the-art for the hierarchical topic detection.

Topic Models

Paper
Add Code

Sparse Boltzmann Machines with Structure Learning as Applied to Text Analysis

no code implementations • 17 Sep 2016 • Zhourong Chen, Nevin L. Zhang, Dit-yan Yeung, Peixian Chen

We are interested in exploring the possibility and benefits of structure learning for deep models.

Paper
Add Code

Latent Tree Models for Hierarchical Topic Detection

1 code implementation • 21 May 2016 • Peixian Chen, Nevin L. Zhang, Tengfei Liu, Leonard K. M. Poon, Zhourong Chen, Farhan Khawar

The variables at other levels are binary latent variables, with those at the lowest latent level representing word co-occurrence patterns and those at higher levels representing co-occurrence of patterns at the level below.

Clustering Topic Models

Paper
Code

Progressive EM for Latent Tree Models and Hierarchical Topic Detection

no code implementations • 5 Aug 2015 • Peixian Chen, Nevin L. Zhang, Leonard K. M. Poon, Zhourong Chen

It is as efficient as the state-of-the-art LDA-based method for hierarchical topic detection and finds substantially better topics and topic hierarchies.

Paper
Add Code

Bayesian Adaptive Matrix Factorization With Automatic Model Selection

no code implementations • CVPR 2015 • Peixian Chen, Naiyan Wang, Nevin L. Zhang, Dit-yan Yeung

Low-rank matrix factorization has long been recognized as a fundamental problem in many computer vision applications.

Model Selection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.