Search Results for author: Zhihui Wang

Found 38 papers, 12 papers with code

ConsistencyDet: A Robust Object Detector with a Denoising Paradigm of Consistency Model

1 code implementation • 11 Apr 2024 • Lifan Jiang, Zhihui Wang, Changmiao Wang, Ming Li, Jiaxu Leng, Xindong Wu

In the present study, we introduce a novel framework designed to articulate object detection as a denoising diffusion process, which operates on the perturbed bounding boxes of annotated entities.

Attribute Denoising +2

Paper
Code

A self-attention-based differentially private tabular GAN with high data utility

no code implementations • 20 Dec 2023 • Zijian Li, Zhihui Wang

Generative Adversarial Networks (GANs) have become a ubiquitous technology for data generation, with their prowess in image generation being well-established.

Generative Adversarial Network Image Generation

Paper
Add Code

3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V

no code implementations • 15 Dec 2023 • Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang

In this work, we present a new visual prompting method called 3DAxiesPrompts (3DAP) to unleash the capabilities of GPT-4V in performing 3D spatial tasks.

3D Object Detection object-detection +1

Paper
Add Code

Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection

no code implementations • ICCV 2023 • Xinzhu Ma, Yongtao Wang, Yinmin Zhang, Zhiyi Xia, Yuan Meng, Zhihui Wang, Haojie Li, Wanli Ouyang

In this work, we build a modular-designed codebase, formulate strong training recipes, design an error diagnosis toolbox, and discuss current methods for image-based 3D object detection.

3D Object Detection Object +1

Paper
Add Code

Dynamic Dual-Graph Fusion Convolutional Network For Alzheimer's Disease Diagnosis

no code implementations • 5 Aug 2023 • Fanshi Li, Zhihui Wang, Yifan Guo, Congcong Liu, Yanjie Zhu, Yihang Zhou, Jun Li, Dong Liang, Haifeng Wang

In this paper, a dynamic dual-graph fusion convolutional network is proposed to improve Alzheimer's disease (AD) diagnosis performance.

Graph Learning

Paper
Add Code

Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator

no code implementations • CVPR 2023 • Shijie Wang, Jianlong Chang, Haojie Li, Zhihui Wang, Wanli Ouyang, Qi Tian

PLEor could leverage pre-trained CLIP model to infer the discrepancies encompassing both pre-defined and unknown subcategories, called category-specific discrepancies, and transfer them to the backbone network trained in the close-set scenarios.

Knowledge Distillation Retrieval +1

Paper
Add Code

Deep Learning-Derived Optimal Aviation Strategies to Control Pandemics

no code implementations • 12 Oct 2022 • Syed Rizvi, Akash Awasthi, Maria J. Peláez, Zhihui Wang, Vittorio Cristini, Hien Van Nguyen, Prashant Dogra

The COVID-19 pandemic has affected countries across the world, demanding drastic public health policies to mitigate the spread of infection, leading to economic crisis as a collateral damage.

Paper
Add Code

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers

no code implementations • 31 Aug 2022 • Zengyuan Guo, Yuechen Yu, Pengyuan Lv, Chengquan Zhang, Haojie Li, Zhihui Wang, Kun Yao, Jingtuo Liu, Jingdong Wang

The Vertex-based Merging Module is capable of aggregating local contextual information between adjacent basic grids, providing the ability to merge basic girds that belong to the same spanning cell accurately.

Ranked #5 on Table Recognition on PubTabNet

Table Recognition

Paper
Add Code

Semantic decomposition Network with Contrastive and Structural Constraints for Dental Plaque Segmentation

no code implementations • 12 Aug 2022 • Jian Shi, Baoli Sun, Xinchen Ye, Zhihui Wang, Xiaolong Luo, Jin Liu, Heli Gao, Haojie Li

Therefore, we propose a semantic decomposition network (SDNet) that introduces two single-task branches to separately address the segmentation of teeth and dental plaque and designs additional constraints to learn category-specific features for each branch, thus facilitating the semantic decomposition and improving the performance of dental plaque segmentation.

Segmentation

Paper
Add Code

Fine-grained Retrieval Prompt Tuning

no code implementations • 29 Jul 2022 • Shijie Wang, Jianlong Chang, Zhihui Wang, Haojie Li, Wanli Ouyang, Qi Tian

In this paper, we develop Fine-grained Retrieval Prompt Tuning (FRPT), which steers a frozen pre-trained model to perform the fine-grained retrieval task from the perspectives of sample prompting and feature adaptation.

Retrieval

Paper
Add Code

MonoDistill: Learning Spatial Features for Monocular 3D Object Detection

1 code implementation • ICLR 2022 • Zhiyu Chong, Xinzhu Ma, Hong Zhang, Yuxin Yue, Haojie Li, Zhihui Wang, Wanli Ouyang

Finally, this LiDAR Net can serve as the teacher to transfer the learned knowledge to the baseline model.

Monocular 3D Object Detection Object +2

Paper
Code

An Underwater Image Semantic Segmentation Method Focusing on Boundaries and a Real Underwater Scene Semantic Segmentation Dataset

2 code implementations • 26 Aug 2021 • Zhiwei Ma, Haojie Li, Zhihui Wang, Dan Yu, Tianyi Wang, Yingshuang Gu, Xin Fan, Zhongxuan Luo

Based on this dataset, we propose a semi-supervised underwater semantic segmentation network focusing on the boundaries(US-Net: Underwater Segmentation Network).

Boundary Detection Instance Segmentation +7

Paper
Code

Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection

1 code implementation • 29 Jul 2021 • Yinmin Zhang, Xinzhu Ma, Shuai Yi, Jun Hou, Zhihui Wang, Wanli Ouyang, Dan Xu

In this paper, we propose to learn geometry-guided depth estimation with projective modeling to advance monocular 3D object detection.

Ranked #10 on Monocular 3D Object Detection on KITTI Cars Moderate

Autonomous Driving Depth Estimation +4

Paper
Code

2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object Detection

1 code implementation • 16 Jun 2021 • Yueming Zhang, Xiaolin Song, Bing Bai, Tengfei Xing, Chao Liu, Xin Gao, Zhihui Wang, Yawei Wen, Haojin Liao, Guoshan Zhang, Pengfei Xu

In an autonomous driving system, it is essential to recognize vehicles, pedestrians and cyclists from images.

Ranked #1 on Object Detection on Waymo Open Dataset

Autonomous Driving object-detection +1

1,971

Paper
Code

2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object Detection

1 code implementation • The CVPR 2021 Workshop on Autonomous Driving (WAD) 2021 • Yueming Zhang, Xiaolin Song, Bing Bai, Tengfei Xing, Chao Liu, Xin Gao, Zhihui Wang, Yawei Wen, Haojin Liao, Guoshan Zhang, Pengfei Xu

In an autonomous driving system, it is essential to recognize vehicles, pedestrians and cyclists from images.

Autonomous Driving object-detection +1

1,971

Paper
Code

A Dataset And Benchmark Of Underwater Object Detection For Robot Picking

no code implementations • 10 Jun 2021 • Chongwei Liu, Haojie Li, Shuchang Wang, Ming Zhu, Dong Wang, Xin Fan, Zhihui Wang

Towards these challenges we introduce a dataset, Detecting Underwater Objects (DUO), and a corresponding benchmark, based on the collection and re-annotation of all relevant datasets.

object-detection Object Detection

Paper
Add Code

Learning Scene Structure Guidance via Cross-Task Knowledge Transfer for Single Depth Super-Resolution

no code implementations • CVPR 2021 • Baoli Sun, Xinchen Ye, Baopu Li, Haojie Li, Zhihui Wang, Rui Xu

First, we design a cross-task distillation scheme that encourages DSR and DE networks to learn from each other in a teacher-student role-exchanging fashion.

Depth Estimation Super-Resolution +1

Paper
Add Code

Quality-Aware Network for Human Parsing

1 code implementation • 10 Mar 2021 • Lu Yang, Qing Song, Zhihui Wang, Zhiwei Liu, Songcen Xu, Zhihao LI

How to estimate the quality of the network output is an important issue, and currently there is no effective solution in the field of human parsing.

Human Parsing Instance Segmentation +1

Paper
Code

A Unified Joint Maximum Mean Discrepancy for Domain Adaptation

no code implementations • 25 Jan 2021 • Wei Wang, Baopu Li, Shuhui Yang, Jing Sun, Zhengming Ding, Junyang Chen, Xiao Dong, Zhihui Wang, Haojie Li

From the revealed unified JMMD, we illustrate that JMMD degrades the feature-label dependence (discriminability) that benefits to classification, and it is sensitive to the label distribution shift when the label kernel is the weighted class conditional one.

Domain Adaptation

Paper
Add Code

Direct Depth Learning Network for Stereo Matching

no code implementations • 10 Dec 2020 • Hong Zhang, Haojie Li, Shenglun Chen, Tiantian Yan, Zhihui Wang, Guo Lu, Wanli Ouyang

To make the Adaptive-Grained Depth Refinement stage robust to the coarse depth and adaptive to the depth range of the points, the Granularity Uncertainty is introduced to Adaptive-Grained Depth Refinement stage.

Autonomous Driving Depth Estimation +1

Paper
Add Code

Full Matching on Low Resolution for Disparity Estimation

no code implementations • 10 Dec 2020 • Hong Zhang, Shenglun Chen, Zhihui Wang, Haojie Li, Wanli Ouyang

To this end, we first propose to decompose the full matching task into multiple stages of the cost aggregation module.

Disparity Estimation

Paper
Add Code

Character randomized benchmarking for non-multiplicity-free groups with applications to subspace, leakage, and matchgate randomized benchmarking

1 code implementation • 30 Oct 2020 • Jahan Claes, Eleanor Rieffel, Zhihui Wang

Finally, we derive a scalable RB protocol for the matchgate group, a group that like the Clifford group is non-universal but becomes universal with the addition of one additional gate.

Quantum Physics

Paper
Code

Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

no code implementations • 12 Oct 2020 • Shijie Wang, Zhihui Wang, Haojie Li, Wanli Ouyang

Existing deep learning based weakly supervised fine-grained image recognition (WFGIR) methods usually pick out the discriminative regions from the high-level feature (HLF) maps directly.

Attribute Fine-Grained Image Recognition

Paper
Add Code

Renovating Parsing R-CNN for Accurate Multiple Human Parsing

1 code implementation • ECCV 2020 • Lu Yang, Qing Song, Zhihui Wang, Mengjie Hu, Chun Liu, Xueshi Xin, Wenhe Jia, Songcen Xu

Multiple human parsing aims to segment various human parts and associate each part with the corresponding instance simultaneously.

Human Parsing

Paper
Code

Rethink Maximum Mean Discrepancy for Domain Adaptation

no code implementations • 1 Jul 2020 • Wei Wang, Haojie Li, Zhengming Ding, Zhihui Wang

On the other hand, we design two different strategies to boost the feature discriminability: 1) we directly impose a trade-off parameter on the implicit intra-class distance in MMD to regulate its change; 2) we impose the similar weights revealed in MMD on inter-class distance and maximize it, then a balanced factor could be introduced to quantitatively leverage the relative importance between the feature transferability and its discriminability.

Domain Adaptation

Paper
Add Code

Weakly Supervised Fine-Grained Image Classification via Guassian Mixture Model Oriented Discriminative Learning

no code implementations • CVPR 2020 • Zhihui Wang, Shijie Wang, Shuhui Yang, Haojie Li, Jianjun Li, Zezhou Li

Existing weakly supervised fine-grained image recognition (WFGIR) methods usually pick out the discriminative regions from the high-level feature maps directly.

Ranked #14 on Fine-Grained Image Classification on FGVC Aircraft

Fine-Grained Image Classification Fine-Grained Image Recognition +1

Paper
Add Code

Sparsely-Labeled Source Assisted Domain Adaptation

no code implementations • 8 May 2020 • Wei Wang, Zhihui Wang, Yuankai Xiang, Jing Sun, Haojie Li, Fuming Sun, Zhengming Ding

However, there are usually a large number of unlabeled data but only a few labeled data in the source domain, and how to transfer knowledge from this sparsely-labeled source domain to the target domain is still a challenge, which greatly limits their application in the wild.

Clustering Domain Adaptation

Paper
Add Code

Location-Aware Feature Selection Text Detection Network

no code implementations • 23 Apr 2020 • Zengyuan Guo, Zilin Wang, Zhihui Wang, Wanli Ouyang, Haojie Li, Wen Gao

However, they are behind in accuracy comparing with recent segmentation-based text detectors.

feature selection regression +2

Paper
Add Code

CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection

1 code implementation • 7 Mar 2020 • Bin Zhu, Qing Song, Lu Yang, Zhihui Wang, Chun Liu, Mengjie Hu

In object detection, offset-guided and point-guided regression dominate anchor-based and anchor-free method separately.

object-detection Object Detection

Paper
Code

A New Dataset, Poisson GAN and AquaNet for Underwater Object Grabbing

no code implementations • 3 Mar 2020 • Chongwei Liu, Zhihui Wang, Shijie Wang, Tao Tang, Yulong Tao, Caifei Yang, Haojie Li, Xing Liu, Xin Fan

We also propose a novel Poisson-blending Generative Adversarial Network (Poisson GAN) and an efficient object detection network (AquaNet) to address two common issues within related datasets: the class-imbalance problem and the problem of mass small object, respectively.

4k Generative Adversarial Network +2

Paper
Add Code

Planning for Compilation of a Quantum Algorithm for Graph Coloring

no code implementations • 23 Feb 2020 • Minh Do, Zhihui Wang, Bryan O'Gorman, Davide Venturelli, Eleanor Rieffel, Jeremy Frank

Previous work demonstrated that temporal planning is an attractive approach for part of this compilationtask, specifically, the routing of circuits that implement the Quantum Alternating Operator Ansatz (QAOA) applied to the MaxCut problem on a quantum processor architecture.

Paper
Add Code

Graph-propagation based Correlation Learning for Weakly Supervised Fine-grained Image Classification

no code implementations • AAAI-2020 2020 • Zhihui Wang, Shijie Wang, Haojie Li, Zhi Dou, Jianjun Li

The key of Weakly Supervised Fine-grained Image Classification (WFGIC) is how to pick out the discriminative regions and learn the discriminative features from them.

Ranked #25 on Fine-Grained Image Classification on FGVC Aircraft

Fine-Grained Image Classification General Classification

Paper
Add Code

Importance Filtered Cross-Domain Adaptation

no code implementations • 24 Dec 2019 • Wei Wang, Haojie Li, Zhihui Wang, Jing Sun, Zhengming Ding, Fuming Sun

Firstly, an importance filtered mechanism is devised to generate filtered soft labels to mitigate negative transfer desirably.

Domain Adaptation Object Recognition

Paper
Add Code

Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving

no code implementations • 27 Mar 2019 • Xinzhu Ma, Zhihui Wang, Haojie Li, Peng-Bo Zhang, Xin Fan, Wanli Ouyang

To this end, we first leverage a stand-alone module to transform the input data from 2D image plane to 3D point clouds space for a better input representation, then we perform the 3D detection using PointNet backbone net to obtain objects 3D locations, dimensions and orientations.

3D Reconstruction Autonomous Driving +2

Paper
Add Code

Parsing R-CNN for Instance-Level Human Analysis

2 code implementations • CVPR 2019 • Lu Yang, Qing Song, Zhihui Wang, Ming Jiang

Models need to distinguish different human instances in the image panel and learn rich features to represent the details of each instance.

Ranked #1 on Pose Estimation on DensePose-COCO

Human Part Segmentation Multi-Human Parsing +1

297

Paper
Code

User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks

2 code implementations • 9 Aug 2018 • Yuanzheng Ci, Xinzhu Ma, Zhihui Wang, Haojie Li, Zhongxuan Luo

Scribble colors based line art colorization is a challenging computer vision problem since neither greyscale values nor semantic information is presented in line arts, and the lack of authentic illustration-line art training pairs also increases difficulty of model generalization.

Benchmarking Line Art Colorization

Paper
Code

A Single Shot Text Detector with Scale-adaptive Anchors

no code implementations • 5 Jul 2018 • Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo

Currently, most top-performing text detection networks tend to employ fixed-size anchor boxes to guide the search for text instances.

Computational Efficiency Text Detection

Paper
Add Code

From the Quantum Approximate Optimization Algorithm to a Quantum Alternating Operator Ansatz

no code implementations • 11 Sep 2017 • Stuart Hadfield, Zhihui Wang, Bryan O'Gorman, Eleanor G. Rieffel, Davide Venturelli, Rupak Biswas

The essence of this extension, the Quantum Alternating Operator Ansatz, is the consideration of general parametrized families of unitaries rather than only those corresponding to the time-evolution under a fixed local Hamiltonian for a time specified by the parameter.

Quantum Physics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.