Search Results for author: Zhongang Qi

Found 16 papers, 4 papers with code

Weakly-supervised Action Localization via Hierarchical Mining

no code implementations22 Jun 2022 Jia-Chang Feng, Fa-Ting Hong, Jia-Run Du, Zhongang Qi, Ying Shan, XiaoHu Qie, Wei-Shi Zheng, Jianping Wu

In this work, we propose a hierarchical mining strategy under video-level and snippet-level manners, i. e., hierarchical supervision and hierarchical consistency mining, to maximize the usage of the given annotations and prediction-wise consistency.

Action Localization Multiple Instance Learning +2

Efficient U-Transformer with Boundary-Aware Loss for Action Segmentation

no code implementations26 May 2022 Dazhao Du, Bing Su, Yu Li, Zhongang Qi, Lingyu Si, Ying Shan

Most state-of-the-art methods focus on designing temporal convolution-based models, but the limitations on modeling long-term temporal dependencies and inflexibility of temporal convolutions limit the potential of these models.

Action Classification Action Segmentation +1

Accelerating the Training of Video Super-Resolution Models

no code implementations10 May 2022 Lijian Lin, Xintao Wang, Zhongang Qi, Ying Shan

In this work, we show that it is possible to gradually train video models from small to large spatial/temporal sizes, i. e., in an easy-to-hard manner.

Video Super-Resolution

CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation

no code implementations31 Mar 2022 Ziqi Zhang, Yuxin Chen, Zongyang Ma, Zhongang Qi, Chunfeng Yuan, Bing Li, Ying Shan, Weiming Hu

In this paper, we propose to CREATE, the first large-scale Chinese shoRt vidEo retrievAl and Title gEneration benchmark, to facilitate research and application in video titling and video retrieval in Chinese.

Video Captioning Video Retrieval

BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild

no code implementations CVPR 2022 Xixi Xu, Zhongang Qi, jianqi ma, Honglun Zhang, Ying Shan, XiaoHu Qie

Current researches mainly focus on only English characters and digits, while few work studies Chinese characters due to the lack of public large-scale and high-quality Chinese datasets, which limits the practical application scenarios of text segmentation.

Style Transfer Text Segmentation +1

From Heatmaps to Structural Explanations of Image Classifiers

no code implementations13 Sep 2021 Li Fuxin, Zhongang Qi, Saeed Khorram, Vivswan Shitole, Prasad Tadepalli, Minsuk Kahng, Alan Fern

This paper summarizes our endeavors in the past few years in terms of explaining image classifiers, with the aim of including negative results and insights we have gained.

Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution

1 code implementation NeurIPS 2021 Liangbin Xie, Xintao Wang, Chao Dong, Zhongang Qi, Ying Shan

Unlike previous integral gradient methods, our FAIG aims at finding the most discriminative filters instead of input pixels/features for degradation removal in blind SR networks.

Blind Super-Resolution Super-Resolution

Stochastic Block-ADMM for Training Deep Networks

no code implementations1 May 2021 Saeed Khorram, Xiao Fu, Mohamad H. Danesh, Zhongang Qi, Li Fuxin

We prove the convergence of our proposed method and justify its capabilities through experiments in supervised and weakly-supervised settings.

Open-book Video Captioning with Retrieve-Copy-Generate Network

no code implementations CVPR 2021 Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu

Due to the rapid emergence of short videos and the requirement for content understanding and creation, the video captioning task has received increasing attention in recent years.

Video Captioning

Visualizing Point Cloud Classifiers by Curvature Smoothing

1 code implementation23 Nov 2019 Chen Ziwen, Wenxuan Wu, Zhongang Qi, Li Fuxin

In this paper, we propose a novel approach to visualize features important to the point cloud classifiers.

Data Augmentation General Classification

Visualizing Deep Networks by Optimizing with Integrated Gradients

1 code implementation2 May 2019 Zhongang Qi, Saeed Khorram, Li Fuxin

Understanding and interpreting the decisions made by deep learning models is valuable in many domains.

Interactive Naming for Explaining Deep Neural Networks: A Formative Study

no code implementations18 Dec 2018 Mandana Hamidi-Haines, Zhongang Qi, Alan Fern, Fuxin Li, Prasad Tadepalli

For this purpose, we developed a user interface for "interactive naming," which allows a human annotator to manually cluster significant activation maps in a test set into meaningful groups called "visual concepts".

General Classification

PointConv: Deep Convolutional Networks on 3D Point Clouds

9 code implementations CVPR 2019 Wenxuan Wu, Zhongang Qi, Li Fuxin

Besides, our experiments converting CIFAR-10 into a point cloud showed that networks built on PointConv can match the performance of convolutional networks in 2D images of a similar structure.

3D Part Segmentation 3D Point Cloud Classification +1

Deep Air Learning: Interpolation, Prediction, and Feature Analysis of Fine-grained Air Quality

no code implementations2 Nov 2017 Zhongang Qi, Tianchun Wang, Guojie Song, Weisong Hu, Xi Li, Zhongfei, Zhang

The interpolation, prediction, and feature analysis of fine-gained air quality are three important topics in the area of urban air computing.

Embedding Deep Networks into Visual Explanations

no code implementations15 Sep 2017 Zhongang Qi, Saeed Khorram, Fuxin Li

The XNN works by learning a nonlinear embedding of a high-dimensional activation vector of a deep network layer into a low-dimensional explanation space while retaining faithfulness i. e., the original deep learning predictions can be constructed from the few concepts extracted by our explanation network.

Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.