Search Results for author: Zhi Tang

Found 30 papers, 17 papers with code

Mutual Enhancement for Detection of Multiple Logos in Sports Videos

no code implementations • ICCV 2017 • Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, Zhi Tang

Mutual enhancement is also included in our frame propagation mechanism that improves logo detection by utilizing the continuity of logos across frames.

object-detection Object Detection +1

Paper
Add Code

Deep Dual Pyramid Network for Barcode Segmentation using Barcode-30k Database

no code implementations • 31 Jul 2018 • Qijie Zhao, Feng Ni, Yang song, Yongtao Wang, Zhi Tang

Specifically, a synthesizing method was proposed to generate well-annotated images containing barcode and QR code labels, which contributes to largely decrease the annotation time.

Segmentation Semantic Segmentation

Paper
Add Code

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network

12 code implementations • 12 Nov 2018 • Qijie Zhao, Tao Sheng, Yongtao Wang, Zhi Tang, Ying Chen, Ling Cai, Haibin Ling

Finally, we gather up the decoder layers with equivalent scales (sizes) to develop a feature pyramid for object detection, in which every feature map consists of the layers (features) from multiple levels.

Ranked #145 on Object Detection on COCO test-dev

Object object-detection +1

Paper
Code

Graph Attribute Aggregation Network with Progressive Margin Folding

no code implementations • 14 May 2019 • Penghui Sun, Jingwei Qu, Xiaoqing Lyu, Haibin Ling, Zhi Tang

Graph convolutional neural networks (GCNNs) have been attracting increasing research attention due to its great potential in inference over graph structures.

Attribute

Paper
Add Code

TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning

1 code implementation • 30 Aug 2019 • Chenrui Zhang, Xiaoqing Lyu, Zhi Tang

A dual relation propagation approach is proposed, where relations captured by the generated graph are separately propagated from the seen and unseen subgraphs.

Few-Shot Learning Graph Attention +3

Paper
Code

CBNet: A Novel Composite Backbone Network Architecture for Object Detection

6 code implementations • 9 Sep 2019 • Yudong Liu, Yongtao Wang, Siwei Wang, Ting-Ting Liang, Qijie Zhao, Zhi Tang, Haibin Ling

In existing CNN based detectors, the backbone network is a very important component for basic feature extraction, and the performance of the detectors highly depends on it.

Ranked #47 on Instance Segmentation on COCO test-dev

Instance Segmentation object-detection +2

27,708

Paper
Code

Automatic Generation of Headlines for Online Math Questions

1 code implementation • 27 Nov 2019 • Ke Yuan, Dafang He, Zhuoren Jiang, Liangcai Gao, Zhi Tang, C. Lee Giles

Compared to conventional summarization tasks, this task has two extra and essential constraints: 1) Detailed math questions consist of text and math equations which require a unified framework to jointly model textual and mathematical information; 2) Unlike text, math equations contain semantic and structural features, and both of them should be captured together.

Math

Paper
Code

MFPN: A Novel Mixture Feature Pyramid Network of Multiple Architectures for Object Detection

no code implementations • 20 Dec 2019 • Ting-Ting Liang, Yongtao Wang, Qijie Zhao, huan zhang, Zhi Tang, Haibin Ling

Feature pyramids are widely exploited in many detectors to solve the scale variation problem for object detection.

object-detection Object Detection

Paper
Add Code

MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion

no code implementations • 19 Jan 2020 • Kaiyu Shan, Yongtao Wang, Zhuoying Wang, TingTing Liang, Zhi Tang, Ying Chen, Yangyan Li

To efficiently extract spatiotemporal features of video for action recognition, most state-of-the-art methods integrate 1D temporal convolution into a conventional 2D CNN backbone.

Action Recognition

Paper
Add Code

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Pixel Labeling

1 code implementation • 27 May 2020 • Zhuoying Wang, Yongtao Wang, Zhi Tang, Yangyan Li, Ying Chen, Haibin Ling, Weisi Lin

Existing CNN-based methods for pixel labeling heavily depend on multi-scale features to meet the requirements of both semantic comprehension and detail preservation.

Pose Estimation Semantic Segmentation

Paper
Code

ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition

no code implementations • 23 Dec 2020 • Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang

Despite the recent advances in optical character recognition (OCR), mathematical expressions still face a great challenge to recognize due to their two-dimensional graphical layout.

Optical Character Recognition Optical Character Recognition (OCR)

Paper
Add Code

Link Prediction with Persistent Homology: An Interactive View

1 code implementation • 20 Feb 2021 • Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen

Link prediction is an important learning task for graph-structured data.

Graph Learning Link Prediction

Paper
Code

OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection

1 code implementation • CVPR 2021 • TingTing Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling

Encouraged by the success, we propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.

Neural Architecture Search object-detection +1

Paper
Code

RPATTACK: Refined Patch Attack on General Object Detectors

1 code implementation • 23 Mar 2021 • Hao Huang, Yongtao Wang, Zhaoyu Chen, Zhi Tang, Wenqiang Zhang, Kai-Kuang Ma

Firstly, we propose a patch selection and refining scheme to find the pixels which have the greatest importance for attack and remove the inconsequential perturbations gradually.

Object

Paper
Code

Community-based Cyberreading for Information Understanding

no code implementations • 27 Mar 2021 • Zhuoren Jiang, Xiaozhong Liu, Liangcai Gao, Zhi Tang

Although the content in scientific publications is increasingly challenging, it is necessary to investigate another important problem, that of scientific information understanding.

Learning-To-Rank

Paper
Add Code

Automatic Description Construction for Math Expression via Topic Relation Graph

no code implementations • 24 Apr 2021 • Ke Yuan, Zuoyu Yan, Yibo Li, Liangcai Gao, Zhi Tang

In the Selector, a Topic Relation Graph (TRG) is proposed to obtain the relevant documents which contain the comprehensive information of math expressions.

Math Relation

Paper
Add Code

MathBERT: A Pre-Trained Model for Mathematical Formula Understanding

no code implementations • 2 May 2021 • Shuai Peng, Ke Yuan, Liangcai Gao, Zhi Tang

Large-scale pre-trained models like BERT, have obtained a great success in various Natural Language Processing (NLP) tasks, while it is still a challenge to adapt them to the math-related tasks.

Headline Generation Information Retrieval +4

Paper
Add Code

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes

1 code implementation • 23 May 2021 • Hao Huang, Yongtao Wang, Zhaoyu Chen, Yuze Zhang, Yuheng Li, Zhi Tang, Wei Chu, Jingdong Chen, Weisi Lin, Kai-Kuang Ma

Then, we design a two-level perturbation fusion strategy to alleviate the conflict between the adversarial watermarks generated by different facial images and models.

Adversarial Attack Face Swapping +1

Paper
Code

CBNet: A Composite Backbone Network Architecture for Object Detection

5 code implementations • 1 Jul 2021 • TingTing Liang, Xiaojie Chu, Yudong Liu, Yongtao Wang, Zhi Tang, Wei Chu, Jingdong Chen, Haibin Ling

With multi-scale testing, we push the current best single model result to a new record of 60. 1% box AP and 52. 3% mask AP without using extra training data.

Ranked #6 on Object Detection on COCO-O (using extra training data)

Instance Segmentation Object +2

12,029

Paper
Code

Adaptive Edge Attention for Graph Matching with Outliers

2 code implementations • International Joint Conference on Artificial Intelligence 2021 • Jingwei Qu, Haibin Ling, Chenrui Zhang, Xiaoqing Lyu, Zhi Tang

To explore the potential of edges, EAGM learns edge attention on the assignment graph to 1) reveal the impact of each edge on graph matching, as well as 2) adjust the learning of edge representations adaptively.

Ranked #10 on Graph Matching on PASCAL VOC (matching accuracy metric)

Edge Classification Graph Matching

Paper
Code

A Topological View of Rule Learning in Knowledge Graphs

no code implementations • 29 Sep 2021 • Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen

We propose to collect cycle bases that span the space of cycles.

Inductive Relation Prediction Relation

Paper
Add Code

Cycle Representation Learning for Inductive Relation Prediction

1 code implementation • 6 Oct 2021 • Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen

In this paper, we consider rules as cycles and show that the space of cycles has a unique structure based on the mathematics of algebraic topology.

Graph Representation Learning Inductive Relation Prediction +1

Paper
Code

Neural Approximation of Graph Topological Features

1 code implementation • 28 Jan 2022 • Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Yusu Wang, Chao Chen

Topological features based on persistent homology capture high-order structural information so as to augment graph neural network methods.

Graph Learning Graph Representation Learning +1

Paper
Code

BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework

2 code implementations • 27 May 2022 • TingTing Liang, Hongwei Xie, Kaicheng Yu, Zhongyu Xia, Zhiwei Lin, Yongtao Wang, Tao Tang, Bing Wang, Zhi Tang

Fusing the camera and LiDAR information has become a de-facto standard for 3D object detection tasks.

3D Object Detection Autonomous Driving +1

651

Paper
Code

FlowNAS: Neural Architecture Search for Optical Flow Estimation

1 code implementation • 4 Jul 2022 • Zhiwei Lin, TingTing Liang, Taihong Xiao, Yongtao Wang, Zhi Tang, Ming-Hsuan Yang

To address this issue, we propose a neural architecture search method named FlowNAS to automatically find the better encoder architecture for flow estimation task.

Image Classification Neural Architecture Search +1

Paper
Code

Efficiently Counting Substructures by Subgraph GNNs without Running GNN on Subgraphs

1 code implementation • 19 Mar 2023 • Zuoyu Yan, Junru Zhou, Liangcai Gao, Zhi Tang, Muhan Zhang

Among these works, a popular way is to use subgraph GNNs, which decompose the input graph into a collection of subgraphs and enhance the representation of the graph by applying GNN to individual subgraphs.

Graph Learning

Paper
Code

Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution

no code implementations • 22 Nov 2023 • Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei

Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR).

Denoising Image Super-Resolution +1

Paper
Add Code

Cycle Invariant Positional Encoding for Graph Representation Learning

1 code implementation • 24 Nov 2023 • Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen, Yusu Wang

To efficiently encode the space of all cycles, we start with a cycle basis (i. e., a minimal set of cycles generating the cycle space) which we compute via the kernel of the 1-dimensional Hodge Laplacian of the input graph.

Graph Learning Graph Representation Learning

Paper
Code

基于数据增强和多任务特征学习的中文语法错误检测方法(Chinese Grammar Error Detection based on Data Enhancement and Multi-task Feature Learning)

no code implementations • CCL 2020 • Haihua Xie, Zhiyou Chen, Jing Cheng, Xiaoqing Lyu, Zhi Tang

Paper
Add Code

Combining Impression Feature Representation for Multi-turn Conversational Question Answering

no code implementations • CCL 2020 • Shaoling Jing, Shibo Hong, Dongyan Zhao, Haihua Xie, Zhi Tang

Multi-turn conversational Question Answering (ConvQA) is a practical task that requires the understanding of conversation history, such as previous QA pairs, the passage context, and current question.

Conversational Question Answering feature selection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.