Search Results for author: Zhi Tang

Found 30 papers, 17 papers with code

Mutual Enhancement for Detection of Multiple Logos in Sports Videos

no code implementations ICCV 2017 Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, Zhi Tang

Mutual enhancement is also included in our frame propagation mechanism that improves logo detection by utilizing the continuity of logos across frames.

object-detection Object Detection +1

Deep Dual Pyramid Network for Barcode Segmentation using Barcode-30k Database

no code implementations31 Jul 2018 Qijie Zhao, Feng Ni, Yang song, Yongtao Wang, Zhi Tang

Specifically, a synthesizing method was proposed to generate well-annotated images containing barcode and QR code labels, which contributes to largely decrease the annotation time.

Segmentation Semantic Segmentation

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network

12 code implementations12 Nov 2018 Qijie Zhao, Tao Sheng, Yongtao Wang, Zhi Tang, Ying Chen, Ling Cai, Haibin Ling

Finally, we gather up the decoder layers with equivalent scales (sizes) to develop a feature pyramid for object detection, in which every feature map consists of the layers (features) from multiple levels.

Object object-detection +1

Graph Attribute Aggregation Network with Progressive Margin Folding

no code implementations14 May 2019 Penghui Sun, Jingwei Qu, Xiaoqing Lyu, Haibin Ling, Zhi Tang

Graph convolutional neural networks (GCNNs) have been attracting increasing research attention due to its great potential in inference over graph structures.

Attribute

TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning

1 code implementation30 Aug 2019 Chenrui Zhang, Xiaoqing Lyu, Zhi Tang

A dual relation propagation approach is proposed, where relations captured by the generated graph are separately propagated from the seen and unseen subgraphs.

Few-Shot Learning Graph Attention +3

CBNet: A Novel Composite Backbone Network Architecture for Object Detection

6 code implementations9 Sep 2019 Yudong Liu, Yongtao Wang, Siwei Wang, Ting-Ting Liang, Qijie Zhao, Zhi Tang, Haibin Ling

In existing CNN based detectors, the backbone network is a very important component for basic feature extraction, and the performance of the detectors highly depends on it.

Instance Segmentation object-detection +2

Automatic Generation of Headlines for Online Math Questions

1 code implementation27 Nov 2019 Ke Yuan, Dafang He, Zhuoren Jiang, Liangcai Gao, Zhi Tang, C. Lee Giles

Compared to conventional summarization tasks, this task has two extra and essential constraints: 1) Detailed math questions consist of text and math equations which require a unified framework to jointly model textual and mathematical information; 2) Unlike text, math equations contain semantic and structural features, and both of them should be captured together.

Math

MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion

no code implementations19 Jan 2020 Kaiyu Shan, Yongtao Wang, Zhuoying Wang, TingTing Liang, Zhi Tang, Ying Chen, Yangyan Li

To efficiently extract spatiotemporal features of video for action recognition, most state-of-the-art methods integrate 1D temporal convolution into a conventional 2D CNN backbone.

Action Recognition

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Pixel Labeling

1 code implementation27 May 2020 Zhuoying Wang, Yongtao Wang, Zhi Tang, Yangyan Li, Ying Chen, Haibin Ling, Weisi Lin

Existing CNN-based methods for pixel labeling heavily depend on multi-scale features to meet the requirements of both semantic comprehension and detail preservation.

Pose Estimation Semantic Segmentation

ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition

no code implementations23 Dec 2020 Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang

Despite the recent advances in optical character recognition (OCR), mathematical expressions still face a great challenge to recognize due to their two-dimensional graphical layout.

Optical Character Recognition Optical Character Recognition (OCR)

OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection

1 code implementation CVPR 2021 TingTing Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling

Encouraged by the success, we propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.

Neural Architecture Search object-detection +1

RPATTACK: Refined Patch Attack on General Object Detectors

1 code implementation23 Mar 2021 Hao Huang, Yongtao Wang, Zhaoyu Chen, Zhi Tang, Wenqiang Zhang, Kai-Kuang Ma

Firstly, we propose a patch selection and refining scheme to find the pixels which have the greatest importance for attack and remove the inconsequential perturbations gradually.

Object

Community-based Cyberreading for Information Understanding

no code implementations27 Mar 2021 Zhuoren Jiang, Xiaozhong Liu, Liangcai Gao, Zhi Tang

Although the content in scientific publications is increasingly challenging, it is necessary to investigate another important problem, that of scientific information understanding.

Learning-To-Rank

Automatic Description Construction for Math Expression via Topic Relation Graph

no code implementations24 Apr 2021 Ke Yuan, Zuoyu Yan, Yibo Li, Liangcai Gao, Zhi Tang

In the Selector, a Topic Relation Graph (TRG) is proposed to obtain the relevant documents which contain the comprehensive information of math expressions.

Math Relation

MathBERT: A Pre-Trained Model for Mathematical Formula Understanding

no code implementations2 May 2021 Shuai Peng, Ke Yuan, Liangcai Gao, Zhi Tang

Large-scale pre-trained models like BERT, have obtained a great success in various Natural Language Processing (NLP) tasks, while it is still a challenge to adapt them to the math-related tasks.

Headline Generation Information Retrieval +4

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes

1 code implementation23 May 2021 Hao Huang, Yongtao Wang, Zhaoyu Chen, Yuze Zhang, Yuheng Li, Zhi Tang, Wei Chu, Jingdong Chen, Weisi Lin, Kai-Kuang Ma

Then, we design a two-level perturbation fusion strategy to alleviate the conflict between the adversarial watermarks generated by different facial images and models.

Adversarial Attack Face Swapping +1

CBNet: A Composite Backbone Network Architecture for Object Detection

5 code implementations1 Jul 2021 TingTing Liang, Xiaojie Chu, Yudong Liu, Yongtao Wang, Zhi Tang, Wei Chu, Jingdong Chen, Haibin Ling

With multi-scale testing, we push the current best single model result to a new record of 60. 1% box AP and 52. 3% mask AP without using extra training data.

Ranked #6 on Object Detection on COCO-O (using extra training data)

Instance Segmentation Object +2

Adaptive Edge Attention for Graph Matching with Outliers

2 code implementations International Joint Conference on Artificial Intelligence 2021 Jingwei Qu, Haibin Ling, Chenrui Zhang, Xiaoqing Lyu, Zhi Tang

To explore the potential of edges, EAGM learns edge attention on the assignment graph to 1) reveal the impact of each edge on graph matching, as well as 2) adjust the learning of edge representations adaptively.

Ranked #10 on Graph Matching on PASCAL VOC (matching accuracy metric)

Edge Classification Graph Matching

Cycle Representation Learning for Inductive Relation Prediction

1 code implementation6 Oct 2021 Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen

In this paper, we consider rules as cycles and show that the space of cycles has a unique structure based on the mathematics of algebraic topology.

Graph Representation Learning Inductive Relation Prediction +1

Neural Approximation of Graph Topological Features

1 code implementation28 Jan 2022 Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Yusu Wang, Chao Chen

Topological features based on persistent homology capture high-order structural information so as to augment graph neural network methods.

Graph Learning Graph Representation Learning +1

FlowNAS: Neural Architecture Search for Optical Flow Estimation

1 code implementation4 Jul 2022 Zhiwei Lin, TingTing Liang, Taihong Xiao, Yongtao Wang, Zhi Tang, Ming-Hsuan Yang

To address this issue, we propose a neural architecture search method named FlowNAS to automatically find the better encoder architecture for flow estimation task.

Image Classification Neural Architecture Search +1

Efficiently Counting Substructures by Subgraph GNNs without Running GNN on Subgraphs

1 code implementation19 Mar 2023 Zuoyu Yan, Junru Zhou, Liangcai Gao, Zhi Tang, Muhan Zhang

Among these works, a popular way is to use subgraph GNNs, which decompose the input graph into a collection of subgraphs and enhance the representation of the graph by applying GNN to individual subgraphs.

Graph Learning

Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution

no code implementations22 Nov 2023 Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei

Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR).

Denoising Image Super-Resolution +1

Cycle Invariant Positional Encoding for Graph Representation Learning

1 code implementation24 Nov 2023 Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen, Yusu Wang

To efficiently encode the space of all cycles, we start with a cycle basis (i. e., a minimal set of cycles generating the cycle space) which we compute via the kernel of the 1-dimensional Hodge Laplacian of the input graph.

Graph Learning Graph Representation Learning

Combining Impression Feature Representation for Multi-turn Conversational Question Answering

no code implementations CCL 2020 Shaoling Jing, Shibo Hong, Dongyan Zhao, Haihua Xie, Zhi Tang

Multi-turn conversational Question Answering (ConvQA) is a practical task that requires the understanding of conversation history, such as previous QA pairs, the passage context, and current question.

Conversational Question Answering feature selection

Cannot find the paper you are looking for? You can Submit a new open access paper.