Search Results for author: Zheng Zhang

Found 280 papers, 143 papers with code

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

71 code implementations • ICCV 2021 • Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo

This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone for computer vision.

Ranked #2 on Image Classification on OmniBenchmark

Image Classification Instance Segmentation +3

125,545

Paper
Code

CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset

2 code implementations • TACL 2020 • Qi Zhu, Kaili Huang, Zheng Zhang, Xiaoyan Zhu, Minlie Huang

To advance multi-domain (cross-domain) dialogue modeling as well as alleviate the shortage of Chinese task-oriented datasets, we propose CrossWOZ, the first large-scale Chinese Cross-Domain Wizard-of-Oz task-oriented dataset.

Dialogue State Tracking Task-Oriented Dialogue Systems +1

64,287

Paper
Code

Swin Transformer V2: Scaling Up Capacity and Resolution

19 code implementations • CVPR 2022 • Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo

Three main techniques are proposed: 1) a residual-post-norm method combined with cosine attention to improve training stability; 2) A log-spaced continuous position bias method to effectively transfer models pre-trained using low-resolution images to downstream tasks with high-resolution inputs; 3) A self-supervised pre-training method, SimMIM, to reduce the needs of vast labeled images.

Ranked #4 on Image Classification on ImageNet V2 (using extra training data)

Action Classification Image Classification +3

29,890

Paper
Code

An Empirical Study of Spatial Attention Mechanisms in Deep Networks

1 code implementation • ICCV 2019 • Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin, Jifeng Dai

Attention mechanisms have become a popular component in deep neural networks, yet there has been little examination of how different influencing factors and methods for computing attention from these factors affect performance.

Decoder

27,894

Paper
Code

MMDetection: Open MMLab Detection Toolbox and Benchmark

144 code implementations • 17 Jun 2019 • Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

In this paper, we introduce the various features of this toolbox.

Benchmarking Instance Segmentation +2

27,894

Paper
Code

MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems

2 code implementations • 3 Dec 2015 • Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao, Bing Xu, Chiyuan Zhang, Zheng Zhang

This paper describes both the API design and the system implementation of MXNet, and explains how embedding of both symbolic expression and tensor operation is handled in a unified fashion.

BIG-bench Machine Learning Clustering +2

20,714

Paper
Code

Star-Transformer

2 code implementations • NAACL 2019 • Qipeng Guo, Xipeng Qiu, PengFei Liu, Yunfan Shao, xiangyang xue, Zheng Zhang

Although Transformer has achieved great successes on many NLP tasks, its heavy structure with fully-connected attention connections leads to dependencies on large training data.

Ranked #13 on Sentiment Analysis on SST-5 Fine-grained classification

Named Entity Recognition (NER) Natural Language Inference +2

13,030

Paper
Code

Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks

7 code implementations • 3 Sep 2019 • Minjie Wang, Da Zheng, Zihao Ye, Quan Gan, Mufei Li, Xiang Song, Jinjing Zhou, Chao Ma, Lingfan Yu, Yu Gai, Tianjun Xiao, Tong He, George Karypis, Jinyang Li, Zheng Zhang

Advancing research in the emerging field of deep graph learning requires new tools to support tensor computation over graphs.

Ranked #35 on Node Classification on Cora

Graph Learning Node Classification

13,030

Paper
Code

DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs

1 code implementation • 11 Oct 2020 • Da Zheng, Chao Ma, Minjie Wang, Jinjing Zhou, Qidong Su, Xiang Song, Quan Gan, Zheng Zhang, George Karypis

To minimize the overheads associated with distributed computations, DistDGL uses a high-quality and light-weight min-cut graph partitioning algorithm along with multiple balancing constraints.

Fraud Detection graph partitioning

13,030

Paper
Code

Learning Hierarchical Graph Neural Networks for Image Clustering

2 code implementations • ICCV 2021 • Yifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto

Our hierarchical GNN uses a novel approach to merge connected components predicted at each level of the hierarchy to form a new graph at the next level.

Clustering Face Clustering

13,030

Paper
Code

BP-Transformer: Modelling Long-Range Context via Binary Partitioning

2 code implementations • 11 Nov 2019 • Zihao Ye, Qipeng Guo, Quan Gan, Xipeng Qiu, Zheng Zhang

The Transformer model is widely successful on many natural language processing tasks.

Ranked #1 on Machine Translation on IWSLT2015 Chinese-English

Language Modelling Machine Translation +4

13,026

Paper
Code

Local Relation Networks for Image Recognition

3 code implementations • ICCV 2019 • Han Hu, Zheng Zhang, Zhenda Xie, Stephen Lin

The convolution layer has been the dominant feature extractor in computer vision for years.

Ranked #867 on Image Classification on ImageNet

General Classification Image Classification +2

13,010

Paper
Code

Self-Supervised Learning with Swin Transformers

6 code implementations • 10 May 2021 • Zhenda Xie, Yutong Lin, Zhuliang Yao, Zheng Zhang, Qi Dai, Yue Cao, Han Hu

We are witnessing a modeling shift from CNN to Transformers in computer vision.

Ranked #75 on Self-Supervised Image Classification on ImageNet

object-detection Object Detection +3

13,010

Paper
Code

On Data Scaling in Masked Image Modeling

1 code implementation • CVPR 2023 • Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu

Our study reveals that: (i) Masked image modeling is also demanding on larger data.

Self-Supervised Learning

13,010

Paper
Code

Disentangled Non-Local Neural Networks

5 code implementations • ECCV 2020 • Minghao Yin, Zhuliang Yao, Yue Cao, Xiu Li, Zheng Zhang, Stephen Lin, Han Hu

This paper first studies the non-local block in depth, where we find that its attention computation can be split into two terms, a whitened pairwise term accounting for the relationship between two pixels and a unary term representing the saliency of every pixel.

Ranked #20 on Semantic Segmentation on Cityscapes test (using extra training data)

Action Recognition object-detection +2

8,273

Paper
Code

Video Swin Transformer

14 code implementations • CVPR 2022 • Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, Han Hu

The vision community is witnessing a modeling shift from CNNs to Transformers, where pure Transformer architectures have attained top accuracy on the major video recognition benchmarks.

Ranked #28 on Action Classification on Kinetics-600 (using extra training data)

Action Classification Action Recognition +5

3,916

Paper
Code

DGL-KE: Training Knowledge Graph Embeddings at Scale

1 code implementation • 18 Apr 2020 • Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, George Karypis

Experiments on knowledge graphs consisting of over 86M nodes and 338M edges show that DGL-KE can compute embeddings in 100 minutes on an EC2 instance with 8 GPUs and 30 minutes on an EC2 cluster with 4 machines with 48 cores/machine.

Distributed, Parallel, and Cluster Computing

1,237

Paper
Code

Relation Networks for Object Detection

6 code implementations • CVPR 2018 • Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei

Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era.

Object object-detection +3

1,088

Paper
Code

End-to-End Semi-Supervised Object Detection with Soft Teacher

8 code implementations • ICCV 2021 • Mengde Xu, Zheng Zhang, Han Hu, JianFeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu

This paper presents an end-to-end semi-supervised object detection approach, in contrast to previous more complex multi-stage methods.

Ranked #6 on Semi-Supervised Object Detection on COCO 100% labeled data (using extra training data)

Instance Segmentation object-detection +4

885

Paper
Code

SimMIM: A Simple Framework for Masked Image Modeling

4 code implementations • CVPR 2022 • Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu

We also leverage this approach to facilitate the training of a 3B model (SwinV2-G), that by $40\times$ less data than that in previous practice, we achieve the state-of-the-art on four representative vision benchmarks.

Ranked #10 on Self-Supervised Image Classification on ImageNet (finetuned)

Representation Learning Self-Supervised Image Classification +1

870

Paper
Code

ConvLab: Multi-Domain End-to-End Dialog System Platform

2 code implementations • ACL 2019 • Sungjin Lee, Qi Zhu, Ryuichi Takanobu, Xiang Li, Yaoqin Zhang, Zheng Zhang, Jinchao Li, Baolin Peng, Xiujun Li, Minlie Huang, Jianfeng Gao

We present ConvLab, an open-source multi-domain end-to-end dialog system platform, that enables researchers to quickly set up experiments with reusable components and compare a large set of different approaches, ranging from conventional pipeline systems to end-to-end neural models, in common environments.

821

Paper
Code

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

2 code implementations • 3 Aug 2021 • Hao Zhou, Pei Ke, Zheng Zhang, Yuxian Gu, Yinhe Zheng, Chujie Zheng, Yida Wang, Chen Henry Wu, Hao Sun, Xiaocong Yang, Bosi Wen, Xiaoyan Zhu, Minlie Huang, Jie Tang

Although pre-trained language models have remarkably enhanced the generation ability of dialogue systems, open-domain Chinese dialogue systems are still limited by the dialogue data and the model size compared with English ones.

565

Paper
Code

FP8-LM: Training FP8 Large Language Models

1 code implementation • 27 Oct 2023 • Houwen Peng, Kan Wu, Yixuan Wei, Guoshuai Zhao, Yuxiang Yang, Ze Liu, Yifan Xiong, Ziyue Yang, Bolin Ni, Jingcheng Hu, Ruihang Li, Miaosen Zhang, Chen Li, Jia Ning, Ruizhe Wang, Zheng Zhang, Shuguang Liu, Joe Chau, Han Hu, Peng Cheng

In this paper, we explore FP8 low-bit data formats for efficient training of large language models (LLMs).

459

Paper
Code

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

1 code implementation • ACL 2020 • Qi Zhu, Zheng Zhang, Yan Fang, Xiang Li, Ryuichi Takanobu, Jinchao Li, Baolin Peng, Jianfeng Gao, Xiaoyan Zhu, Minlie Huang

We present ConvLab-2, an open-source toolkit that enables researchers to build task-oriented dialogue systems with state-of-the-art models, perform an end-to-end evaluation, and diagnose the weakness of systems.

Task-Oriented Dialogue Systems

443

Paper
Code

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

1 code implementation • 7 Sep 2023 • Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo

We present InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

Keypoint Detection

335

Paper
Code

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning

7 code implementations • CVPR 2021 • Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu

We argue that the power of contrastive learning has yet to be fully unleashed, as current methods are trained only on instance-level pretext tasks, leading to representations that may be sub-optimal for downstream tasks requiring dense pixel predictions.

Contrastive Learning object-detection +3

328

Paper
Code

EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training

1 code implementation • 17 Mar 2022 • Yuxian Gu, Jiaxin Wen, Hao Sun, Yi Song, Pei Ke, Chujie Zheng, Zheng Zhang, Jianzhu Yao, Lei Liu, Xiaoyan Zhu, Minlie Huang

Large-scale pre-training has shown remarkable performance in building open-domain dialogue systems.

Chatbot

299

Paper
Code

Dense RepPoints: Representing Visual Objects with Dense Point Sets

2 code implementations • ECCV 2020 • Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang, Raquel Urtasun, Li-Wei Wang, Stephen Lin, Han Hu

We present a new object representation, called Dense RepPoints, that utilizes a large set of points to describe an object at multiple levels, including both box level and pixel level.

Object Object Detection

296

Paper
Code

RepPoints V2: Verification Meets Regression for Object Detection

1 code implementation • NeurIPS 2020 • Yihong Chen, Zheng Zhang, Yue Cao, Li-Wei Wang, Stephen Lin, Han Hu

Though RepPoints provides high performance, we find that its heavy reliance on regression for object localization leaves room for improvement.

Ranked #27 on Object Detection on COCO-O

Instance Segmentation Object +6

296

Paper
Code

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

1 code implementation • 11 Oct 2023 • Cunxiang Wang, Xiaoze Liu, Yuanhao Yue, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang

This survey addresses the crucial issue of factuality in Large Language Models (LLMs).

Retrieval Specificity

284

Paper
Code

Side Adapter Network for Open-Vocabulary Semantic Segmentation

3 code implementations • CVPR 2023 • Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai

A side network is attached to a frozen CLIP model with two branches: one for predicting mask proposals, and the other for predicting attention bias which is applied in the CLIP model to recognize the class of masks.

Ranked #4 on Open Vocabulary Semantic Segmentation on PascalVOC-20

Language Modelling Open Vocabulary Semantic Segmentation +3

272

Paper
Code

TrojanZoo: Towards Unified, Holistic, and Practical Evaluation of Neural Backdoors

1 code implementation • 16 Dec 2020 • Ren Pang, Zheng Zhang, Xiangshan Gao, Zhaohan Xi, Shouling Ji, Peng Cheng, Xiapu Luo, Ting Wang

To bridge this gap, we design and implement TROJANZOO, the first open-source platform for evaluating neural backdoor attacks/defenses in a unified, holistic, and practical manner.

268

Paper
Code

A Closer Look at Local Aggregation Operators in Point Cloud Analysis

1 code implementation • ECCV 2020 • Ze Liu, Han Hu, Yue Cao, Zheng Zhang, Xin Tong

Our investigation reveals that despite the different designs of these operators, all of these operators make surprisingly similar contributions to the network performance under the same network input and feature numbers and result in the state-of-the-art accuracy on standard benchmarks.

Ranked #4 on 3D Semantic Segmentation on PartNet

3D Semantic Segmentation

251

Paper
Code

Group-Free 3D Object Detection via Transformers

4 code implementations • ICCV 2021 • Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong

Instead of grouping local points to each object candidate, our method computes the feature of an object from all the points in the point cloud with the help of an attention mechanism in the Transformers \cite{vaswani2017attention}, where the contribution of each point is automatically learned in the network training.

Ranked #3 on 3D Object Detection on SUN-RGBD

3D Object Detection Object +1

240

Paper
Code

Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours

1 code implementation • 2 Aug 2022 • Eyal Shnarch, Alon Halfon, Ariel Gera, Marina Danilevsky, Yannis Katsis, Leshem Choshen, Martin Santillan Cooper, Dina Epelboim, Zheng Zhang, Dakuo Wang, Lucy Yip, Liat Ein-Dor, Lena Dankin, Ilya Shnayderman, Ranit Aharonov, Yunyao Li, Naftali Liberman, Philip Levin Slesarev, Gwilym Newton, Shila Ofek-Koifman, Noam Slonim, Yoav Katz

Text classification can be useful in many real-world scenarios, saving a lot of time for end users.

Text Classification

240

Paper
Code

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

1 code implementation • 27 May 2022 • Yixuan Wei, Han Hu, Zhenda Xie, Zheng Zhang, Yue Cao, Jianmin Bao, Dong Chen, Baining Guo

These properties, which we aggregately refer to as optimization friendliness, are identified and analyzed by a set of attention- and optimization-related diagnosis tools.

Ranked #2 on Instance Segmentation on COCO test-dev (using extra training data)

Contrastive Learning Image Classification +5

220

Paper
Code

Improving CLIP Fine-tuning Performance

1 code implementation • ICCV 2023 • Yixuan Wei, Han Hu, Zhenda Xie, Ze Liu, Zheng Zhang, Yue Cao, Jianmin Bao, Dong Chen, Baining Guo

Experiments suggest that the feature map distillation approach significantly boosts the fine-tuning performance of CLIP models on several typical downstream vision tasks.

object-detection Object Detection +1

220

Paper
Code

A Unified Generative Framework for Various NER Subtasks

1 code implementation • ACL 2021 • Hang Yan, Tao Gui, Junqi Dai, Qipeng Guo, Zheng Zhang, Xipeng Qiu

To that end, we propose to formulate the NER subtasks as an entity span sequence generation task, which can be solved by a unified sequence-to-sequence (Seq2Seq) framework.

Ranked #10 on Nested Named Entity Recognition on GENIA

named-entity-recognition Named Entity Recognition +2

208

Paper
Code

Efficiently Measuring the Cognitive Ability of LLMs: An Adaptive Testing Perspective

1 code implementation • 18 Jun 2023 • Yan Zhuang, Qi Liu, Yuting Ning, Weizhe Huang, Rui Lv, Zhenya Huang, Guanhao Zhao, Zheng Zhang, Qingyang Mao, Shijin Wang, Enhong Chen

Different tests for different models using efficient adaptive testing -- we believe this has the potential to become a new norm in evaluating large language models.

Mathematical Reasoning

199

Paper
Code

DETR Doesn't Need Multi-Scale or Locality Design

1 code implementation • 3 Aug 2023 • Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu

This paper presents an improved DETR detector that maintains a "plain" nature: using a single-scale feature map and global cross-attention calculations without specific locality constraints, in contrast to previous leading DETR-based detectors that reintroduce architectural inductive biases of multi-scale and locality into the decoder.

Decoder

184

Paper
Code

DETR Does Not Need Multi-Scale or Locality Design

1 code implementation • ICCV 2023 • Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu

Decoder

184

Paper
Code

A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model

2 code implementations • 29 Dec 2021 • Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai

However, semantic segmentation and the CLIP model perform on different visual granularity, that semantic segmentation processes on pixels while CLIP performs on images.

Ranked #2 on Open Vocabulary Semantic Segmentation on COCO-Stuff-171

Image Classification Language Modelling +8

158

Paper
Code

Fusing Hierarchical Convolutional Features for Human Body Segmentation and Clothing Fashion Classification

1 code implementation • 9 Mar 2018 • Zheng Zhang, Chengfang Song, Qin Zou

The clothing fashion reflects the common aesthetics that people share with each other in dressing.

Classification General Classification +1

154

Paper
Code

Revealing the Dark Secrets of Masked Image Modeling

1 code implementation • CVPR 2023 • Zhenda Xie, Zigang Geng, Jingcheng Hu, Zheng Zhang, Han Hu, Yue Cao

In this paper, we compare MIM with the long-dominant supervised pre-trained models from two perspectives, the visualizations and the experiments, to uncover their key representational differences.

Ranked #3 on Depth Estimation on NYU-Depth V2

Inductive Bias Monocular Depth Estimation +3

153

Paper
Code

Segment and Caption Anything

1 code implementation • 1 Dec 2023 • Xiaoke Huang, JianFeng Wang, Yansong Tang, Zheng Zhang, Han Hu, Jiwen Lu, Lijuan Wang, Zicheng Liu

We propose a method to efficiently equip the Segment Anything Model (SAM) with the ability to generate regional captions.

Caption Generation object-detection +2

152

Paper
Code

Hallucination of Multimodal Large Language Models: A Survey

1 code implementation • 29 Apr 2024 • Zechen Bai, Pichao Wang, Tianjun Xiao, Tong He, Zongbo Han, Zheng Zhang, Mike Zheng Shou

By drawing the granular classification and landscapes of hallucination causes, evaluation benchmarks, and mitigation methods, this survey aims to deepen the understanding of hallucinations in MLLMs and inspire further advancements in the field.

Hallucination

150

Paper
Code

TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models

2 code implementations • CVPR 2023 • Sucheng Ren, Fangyun Wei, Zheng Zhang, Han Hu

Our TinyMIM model of tiny size achieves 79. 6% top-1 accuracy on ImageNet-1K image classification, which sets a new record for small vision models of the same size and computation budget.

Image Classification Semantic Segmentation

147

Paper
Code

Negative Margin Matters: Understanding Margin in Few-shot Classification

1 code implementation • ECCV 2020 • Bin Liu, Yue Cao, Yutong Lin, Qi Li, Zheng Zhang, Mingsheng Long, Han Hu

This paper introduces a negative margin loss to metric learning based few-shot learning methods.

Ranked #2 on Few-Shot Image Classification on Mini-ImageNet to CUB - 5 shot learning

Classification Few-Shot Image Classification +3

144

Paper
Code

A Unified Generative Framework for Aspect-Based Sentiment Analysis

3 code implementations • ACL 2021 • Hang Yan, Junqi Dai, Tuo ji, Xipeng Qiu, Zheng Zhang

Aspect-based Sentiment Analysis (ABSA) aims to identify the aspect terms, their corresponding sentiment polarities, and the opinion terms.

Ranked #1 on Aspect Sentiment Triplet Extraction on SemEval

Aspect-Based Sentiment Analysis Aspect-oriented Opinion Extraction +2

140

Paper
Code

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

1 code implementation • 21 Mar 2024 • Zheng Zhang, Yeyao Ma, Enming Zhang, Xiang Bai

PSALM is a powerful extension of the Large Multi-modal Model (LMM) to address the segmentation task challenges.

Decoder Generalized Referring Expression Segmentation +6

124

Paper
Code

CoLAKE: Contextualized Language and Knowledge Embedding

1 code implementation • COLING 2020 • Tianxiang Sun, Yunfan Shao, Xipeng Qiu, Qipeng Guo, Yaru Hu, Xuanjing Huang, Zheng Zhang

With the emerging branch of incorporating factual knowledge into pre-trained language models such as BERT, most existing models consider shallow, static, and separately pre-trained entity embeddings, which limits the performance gains of these models.

Entity Embeddings Knowledge Graph Completion +1

114

Paper
Code

ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format

1 code implementation • 30 Nov 2022 • Qi Zhu, Christian Geishauser, Hsien-Chin Lin, Carel van Niekerk, Baolin Peng, Zheng Zhang, Michael Heck, Nurul Lubis, Dazhen Wan, Xiaochen Zhu, Jianfeng Gao, Milica Gašić, Minlie Huang

Task-oriented dialogue (TOD) systems function as digital assistants, guiding users through various tasks such as booking flights or finding restaurants.

Reinforcement Learning (RL) Transfer Learning

Paper
Code

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token

1 code implementation • ICCV 2023 • Jia Ning, Chen Li, Zheng Zhang, Zigang Geng, Qi Dai, Kun He, Han Hu

With these new techniques and other designs, we show that the proposed general-purpose task-solver can perform both instance segmentation and depth estimation well.

Ranked #14 on Monocular Depth Estimation on NYU-Depth V2

Instance Segmentation Monocular Depth Estimation +1

Paper
Code

CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training

2 code implementations • ACL (WebNLG, INLG) 2020 • Qipeng Guo, Zhijing Jin, Xipeng Qiu, Wei-Nan Zhang, David Wipf, Zheng Zhang

Due to the difficulty and high cost of data collection, the supervised data available in the two fields are usually on the magnitude of tens of thousands, for example, 18K in the WebNLG~2017 dataset after preprocessing, which is far fewer than the millions of data for other tasks such as machine translation.

Graph Generation Knowledge Graphs +2

Paper
Code

Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings

1 code implementation • 14 Dec 2020 • Qipeng Guo, Zhijing Jin, Ziyu Wang, Xipeng Qiu, Weinan Zhang, Jun Zhu, Zheng Zhang, David Wipf

Cycle-consistent training is widely used for jointly learning a forward and inverse mapping between two domains of interest without the cumbersome requirement of collecting matched pairs within each domain.

Knowledge Graphs Text Generation

Paper
Code

Exploring Discrete Diffusion Models for Image Captioning

1 code implementation • 21 Nov 2022 • Zixin Zhu, Yixuan Wei, JianFeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu

The image captioning task is typically realized by an auto-regressive method that decodes the text tokens one by one.

Image Captioning Image Generation

Paper
Code

An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph

1 code implementation • 1 Jul 2020 • Jiarui Jin, Jiarui Qin, Yuchen Fang, Kounianhua Du, Wei-Nan Zhang, Yong Yu, Zheng Zhang, Alexander J. Smola

To the best of our knowledge, this is the first work providing an efficient neighborhood-based interaction model in the HIN-based recommendations.

Recommendation Systems

Paper
Code

GraphHINGE: Learning Interaction Models of Structured Neighborhood on Heterogeneous Information Network

1 code implementation • 25 Nov 2020 • Jiarui Jin, Kounianhua Du, Weinan Zhang, Jiarui Qin, Yuchen Fang, Yong Yu, Zheng Zhang, Alexander J. Smola

Heterogeneous information network (HIN) has been widely used to characterize entities of various types and their complex relations.

Click-Through Rate Prediction

Paper
Code

Multi-Oriented Text Detection with Fully Convolutional Networks

1 code implementation • CVPR 2016 • Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu, Xiang Bai

In this paper, we propose a novel approach for text detec- tion in natural images.

Ranked #40 on Scene Text Detection on ICDAR 2015

Scene Text Detection Text Detection

Paper
Code

DS-TransUNet:Dual Swin Transformer U-Net for Medical Image Segmentation

1 code implementation • 12 Jun 2021 • Ailiang Lin, Bingzhi Chen, Jiayu Xu, Zheng Zhang, Guangming Lu

To alleviate these problems, we propose a novel deep medical image segmentation framework called Dual Swin Transformer U-Net (DS-TransUNet), which might be the first attempt to concurrently incorporate the advantages of hierarchical Swin Transformer into both encoder and decoder of the standard U-shaped architecture to enhance the semantic segmentation quality of varying medical images.

Decoder Image Segmentation +3

Paper
Code

Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation

1 code implementation • ECCV 2020 • Zhenda Xie, Zheng Zhang, Xizhou Zhu, Gao Huang, Stephen Lin

In the feature maps of CNNs, there commonly exists considerable spatial redundancy that leads to much repetitive processing.

Paper
Code

Bridging the Gap to Real-World Object-Centric Learning

3 code implementations • 29 Sep 2022 • Maximilian Seitzer, Max Horn, Andrii Zadaianchuk, Dominik Zietlow, Tianjun Xiao, Carl-Johann Simon-Gabriel, Tong He, Zheng Zhang, Bernhard Schölkopf, Thomas Brox, Francesco Locatello

Humans naturally decompose their environment into entities at the appropriate level of abstraction to act in the world.

Object

Paper
Code

Bootstrap Your Object Detector via Mixed Training

1 code implementation • NeurIPS 2021 • Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Stephen Lin, Han Hu, Xiang Bai

We introduce MixTraining, a new training paradigm for object detection that can improve the performance of existing detectors for free.

Data Augmentation Missing Labels +3

Paper
Code

Cross-Modal Retrieval: A Systematic Review of Methods and Future Directions

1 code implementation • 28 Aug 2023 • Fengling Li, Lei Zhu, Tianshi Wang, Jingjing Li, Zheng Zhang, Heng Tao Shen

With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval methods struggle to meet the needs of users demanding access to data from various modalities.

Cross-Modal Retrieval Retrieval

Paper
Code

Parametric Instance Classification for Unsupervised Visual Feature Learning

1 code implementation • NeurIPS 2020 • Yue Cao, Zhenda Xie, Bin Liu, Yutong Lin, Zheng Zhang, Han Hu

This paper presents parametric instance classification (PIC) for unsupervised visual feature learning.

Classification General Classification

Paper
Code

DeepMIM: Deep Supervision for Masked Image Modeling

1 code implementation • 15 Mar 2023 • Sucheng Ren, Fangyun Wei, Samuel Albanie, Zheng Zhang, Han Hu

Deep supervision, which involves extra supervisions to the intermediate features of a neural network, was widely used in image classification in the early deep learning era since it significantly reduces the training difficulty and eases the optimization like avoiding gradient vanish over the vanilla training.

Image Classification object-detection +2

Paper
Code

Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

1 code implementation • Findings (EMNLP) 2021 • Ruiqi Zhong, Kristy Lee, Zheng Zhang, Dan Klein

However, the next word prediction training objective is still misaligned with the target zero-shot learning objective.

Language Modelling Natural Language Inference +2

Paper
Code

Graph Neural Networks Inspired by Classical Iterative Algorithms

1 code implementation • 10 Mar 2021 • Yongyi Yang, Tang Liu, Yangkun Wang, Jinjing Zhou, Quan Gan, Zhewei Wei, Zheng Zhang, Zengfeng Huang, David Wipf

Despite the recent success of graph neural networks (GNN), common architectures often exhibit significant limitations, including sensitivity to oversmoothing, long-range dependencies, and spurious edges, e. g., as can occur as a result of graph heterophily or adversarial attacks.

Node Classification

Paper
Code

Bag of Tricks for Node Classification with Graph Neural Networks

2 code implementations • 24 Mar 2021 • Yangkun Wang, Jiarui Jin, Weinan Zhang, Yong Yu, Zheng Zhang, David Wipf

Over the past few years, graph neural networks (GNN) and label propagation-based methods have made significant progress in addressing node classification tasks on graphs.

Ranked #1 on Node Property Prediction on ogbn-proteins

Classification General Classification +2

Paper
Code

Vega-MT: The JD Explore Academy Translation System for WMT22

1 code implementation • 20 Sep 2022 • Changtong Zan, Keqin Peng, Liang Ding, Baopu Qiu, Boan Liu, Shwai He, Qingyu Lu, Zheng Zhang, Chuang Liu, Weifeng Liu, Yibing Zhan, DaCheng Tao

As for model sizes, we scale the Transformer-Big up to the extremely large model that owns nearly 4. 7 Billion parameters, to fully enhance the model capacity for our Vega-MT.

Ranked #1 on Machine Translation on WMT 2022 English-Russian

Data Augmentation Machine Translation +1

Paper
Code

Semantics Disentangling for Generalized Zero-Shot Learning

1 code implementation • ICCV 2021 • Zhi Chen, Yadan Luo, Ruihong Qiu, Sen Wang, Zi Huang, Jingjing Li, Zheng Zhang

Generalized zero-shot learning (GZSL) aims to classify samples under the assumption that some classes are not observable during training.

Generalized Zero-Shot Learning Relation Network

Paper
Code

Prototype-supervised Adversarial Network for Targeted Attack of Deep Hashing

1 code implementation • CVPR 2021 • Xunguang Wang, Zheng Zhang, Baoyuan Wu, Fumin Shen, Guangming Lu

However, deep hashing networks are vulnerable to adversarial examples, which is a practical secure problem but seldom studied in hashing-based retrieval field.

Deep Hashing Image Retrieval +1

Paper
Code

A Survey on Incomplete Multi-view Clustering

1 code implementation • 17 Aug 2022 • Jie Wen, Zheng Zhang, Lunke Fei, Bob Zhang, Yong Xu, Zhao Zhang, Jinxing Li

However, in practical applications, such as disease diagnosis, multimedia analysis, and recommendation system, it is common to observe that not all views of samples are available in many cases, which leads to the failure of the conventional multi-view clustering methods.

Clustering Incomplete multi-view clustering

Paper
Code

SSEGCN: Syntactic and Semantic Enhanced Graph Convolutional Network for Aspect-based Sentiment Analysis

1 code implementation • NAACL 2022 • Zheng Zhang, Zili Zhou, Yanna Wang

Furthermore, to combine syntactic structure and semantic information, we equip the attention score matrices by syntactic mask matrices.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

Coarse-to-Fine Amodal Segmentation with Shape Prior

1 code implementation • ICCV 2023 • Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu

To address this issue, we propose a convolution refine module to inject fine-grained information and provide a more precise amodal object segmentation based on visual features and coarse-predicted segmentation.

Object Segmentation +1

Paper
Code

Transformer on a Diet

1 code implementation • 14 Feb 2020 • Chenguang Wang, Zihao Ye, Aston Zhang, Zheng Zhang, Alexander J. Smola

Transformer has been widely used thanks to its ability to capture sequence information in an efficient way.

Language Modelling

Paper
Code

Object-Centric Multiple Object Tracking

1 code implementation • ICCV 2023 • Zixu Zhao, Jiaze Wang, Max Horn, Yizhuo Ding, Tong He, Zechen Bai, Dominik Zietlow, Carl-Johann Simon-Gabriel, Bing Shuai, Zhuowen Tu, Thomas Brox, Bernt Schiele, Yanwei Fu, Francesco Locatello, Zheng Zhang, Tianjun Xiao

Unsupervised object-centric learning methods allow the partitioning of scenes into entities without additional localization information and are excellent candidates for reducing the annotation burden of multiple-object tracking (MOT) pipelines.

Multiple Object Tracking Object +3

Paper
Code

MConv: An Environment for Multimodal Conversational Search across Multiple Domains

1 code implementation • SIGIR 2021 • Lizi Liao, Le Hong Long, Zheng Zhang, Minlie Huang, Tat-Seng Chua

Second, a set of benchmark results for dialogue state tracking, conversational recommendation, response generation as well as a unified model for multiple tasks are reported.

Ranked #2 on Dialogue State Tracking on MMConv

Conversational Search Dialogue State Tracking +1

Paper
Code

Zero-1-to-3: Domain-level Zero-shot Cognitive Diagnosis via One Batch of Early-bird Students towards Three Diagnostic Objectives

2 code implementations • 20 Dec 2023 • Weibo Gao, Qi Liu, Hao Wang, Linan Yue, Haoyang Bi, Yin Gu, Fangzhou Yao, Zheng Zhang, Xin Li, Yuanjing He

Consequently, we refine the cognitive states of cold-start students as diagnostic outcomes via virtual data, aligning with the diagnosis-oriented goal.

cognitive diagnosis Domain Adaptation +1

Paper
Code

Relation-Aware Diffusion Model for Controllable Poster Layout Generation

1 code implementation • 15 Jun 2023 • Fengheng Li, An Liu, Wei Feng, Honghe Zhu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Jingping Shao

To advance research in this field, we have constructed a poster layout dataset named CGL-Dataset V2.

Relation

Paper
Code

Relation of the Relations: A New Paradigm of the Relation Extraction Problem

1 code implementation • 5 Jun 2020 • Zhijing Jin, Yongyi Yang, Xipeng Qiu, Zheng Zhang

In natural language, often multiple entities appear in the same text.

Relation Relation Extraction

Paper
Code

It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books

2 code implementations • 8 Sep 2021 • Bingsheng Yao, Dakuo Wang, Tongshuang Wu, Zheng Zhang, Toby Jia-Jun Li, Mo Yu, Ying Xu

Existing question answering (QA) techniques are created mainly to answer questions asked by humans.

Answer Generation Data Augmentation +3

Paper
Code

DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training

1 code implementation • 3 Oct 2023 • Aochuan Chen, Yimeng Zhang, Jinghan Jia, James Diffenderfer, Jiancheng Liu, Konstantinos Parasyris, Yihua Zhang, Zheng Zhang, Bhavya Kailkhura, Sijia Liu

Our extensive experiments show that DeepZero achieves state-of-the-art (SOTA) accuracy on ResNet-20 trained on CIFAR-10, approaching FO training performance for the first time.

Adversarial Defense Computational Efficiency +1

Paper
Code

Efficient Generation and Processing of Word Co-occurrence Networks Using corpus2graph

1 code implementation • WS 2018 • Zheng Zhang, Pierre Zweigenbaum, Ruiqing Yin

Corpus2graph is an open-source NLP-application-oriented tool that generates a word co-occurrence network from a large corpus.

Keyword Extraction

Paper
Code

StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement

1 code implementation • 13 Feb 2022 • Zheng Zhang, Ying Xu, Yanhao Wang, Bingsheng Yao, Daniel Ritchie, Tongshuang Wu, Mo Yu, Dakuo Wang, Toby Jia-Jun Li

Despite its benefits for children's skill development and parent-child bonding, many parents do not often engage in interactive storytelling by having story-related dialogues with their child due to limited availability or challenges in coming up with appropriate questions.

Chatbot

Paper
Code

Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension

1 code implementation • 26 Mar 2022 • Ying Xu, Dakuo Wang, Mo Yu, Daniel Ritchie, Bingsheng Yao, Tongshuang Wu, Zheng Zhang, Toby Jia-Jun Li, Nora Bradford, Branda Sun, Tran Bao Hoang, Yisi Sang, Yufang Hou, Xiaojuan Ma, Diyi Yang, Nanyun Peng, Zhou Yu, Mark Warschauer

Through benchmarking with QG models, we show that the QG model trained on FairytaleQA is capable of asking high-quality and more diverse questions.

Ranked #1 on Question Generation on FairytaleQA

Benchmarking Question Answering +2

Paper
Code

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts

1 code implementation • 23 Oct 2023 • Tengxiao Liu, Qipeng Guo, Yuqing Yang, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang

As large language models (LLMs) have shown effectiveness with different prompting methods, such as Chain of Thought, Program of Thought, we find that these methods have formed a great complementarity to each other on math reasoning tasks.

Logical Reasoning Math

Paper
Code

GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation

1 code implementation • COLING 2020 • Zhijing Jin, Qipeng Guo, Xipeng Qiu, Zheng Zhang

With a human-annotated test set, we provide this new benchmark dataset for future research on unsupervised text generation from knowledge graphs.

Ranked #1 on Unsupervised KG-to-Text Generation on GenWiki (Fine)

Knowledge Graphs Text Generation +1

Paper
Code

Learning Enhanced Representations for Tabular Data via Neighborhood Propagation

1 code implementation • 14 Jun 2022 • Kounianhua Du, Weinan Zhang, Ruiwen Zhou, Yangkun Wang, Xilong Zhao, Jiarui Jin, Quan Gan, Zheng Zhang, David Wipf

Prediction over tabular data is an essential and fundamental problem in many important downstream tasks.

Retrieval

Paper
Code

Masked Structural Growth for 2x Faster Language Model Pre-training

1 code implementation • 4 May 2023 • Yiqun Yao, Zheng Zhang, Jing Li, Yequan Wang

In terms of growth schedule, the impact of each single dimension on a schedule's efficiency is under-explored by existing work.

Language Modelling Large Language Model +1

Paper
Code

An AMR-based Link Prediction Approach for Document-level Event Argument Extraction

1 code implementation • 30 May 2023 • Yuqing Yang, Qipeng Guo, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang

Motivated by the fact that all event structures can be inferred from AMR, this work reformulates EAE as a link prediction problem on AMR graphs.

Event Argument Extraction Link Prediction +1

Paper
Code

TransAttUnet: Multi-level Attention-guided U-Net with Transformer for Medical Image Segmentation

1 code implementation • 12 Jul 2021 • Bingzhi Chen, Yishu Liu, Zheng Zhang, Guangming Lu, Adams Wai Kin Kong

Accurate segmentation of organs or lesions from medical images is crucial for reliable diagnosis of diseases and organ morphometry.

Decoder Image Segmentation +3

Paper
Code

Evaluating Open-QA Evaluation

1 code implementation • NeurIPS 2023 • Cunxiang Wang, Sirui Cheng, Qipeng Guo, Yuanhao Yue, Bowen Ding, Zhikun Xu, Yidong Wang, Xiangkun Hu, Zheng Zhang, Yue Zhang

This study focuses on the evaluation of the Open Question Answering (Open-QA) task, which can directly estimate the factuality of large language models (LLMs).

Question Answering

Paper
Code

Improved Deep Hashing with Soft Pairwise Similarity for Multi-label Image Retrieval

1 code implementation • 8 Mar 2018 • Zheng Zhang, Qin Zou, Yuewei Lin, Long Chen, Song Wang

In this paper, a new deep hashing method is proposed for multi-label image retrieval by re-defining the pairwise similarity into an instance similarity, where the instance similarity is quantified into a percentage based on the normalized semantic labels.

Deep Hashing Multi-Label Image Retrieval

Paper
Code

Transductive Zero-Shot Hashing for Multilabel Image Retrieval

1 code implementation • 17 Nov 2019 • Qin Zou, Zheng Zhang, Ling Cao, Long Chen, Song Wang

Given semantic annotations such as class labels and pairwise similarities of the training data, hashing methods can learn and generate effective and compact binary codes.

Multi-Label Image Retrieval Quantization +1

Paper
Code

Representation Learning on Spatial Networks

1 code implementation • NeurIPS 2021 • Zheng Zhang, Liang Zhao

Specifically, a provably information-lossless and roto-translation invariant representation of spatial information on networks is presented.

Representation Learning Translation

Paper
Code

Trojaning Language Models for Fun and Profit

1 code implementation • 1 Aug 2020 • Xinyang Zhang, Zheng Zhang, Shouling Ji, Ting Wang

Recent years have witnessed the emergence of a new paradigm of building natural language processing (NLP) systems: general-purpose, pre-trained language models (LMs) are composed with simple downstream models and fine-tuned for a variety of NLP tasks.

Question Answering Specificity +1

Paper
Code

Towards Robust Neural Networks via Close-loop Control

1 code implementation • ICLR 2021 • Zhuotong Chen, Qianxiao Li, Zheng Zhang

We connect the robustness of neural networks with optimal control using the geometrical information of underlying data to design the control objective.

Paper
Code

AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation

1 code implementation • 26 Feb 2022 • Chujie Zheng, Sahand Sabour, Jiaxin Wen, Zheng Zhang, Minlie Huang

Applying this approach, we construct AugESC, an augmented dataset for the ESC task, which largely extends the scale and topic coverage of the crowdsourced ESConv corpus.

Data Augmentation Dialogue Generation +2

Paper
Code

CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation

1 code implementation • 18 Aug 2022 • Jinfeng Zhou, Chujie Zheng, Bo wang, Zheng Zhang, Minlie Huang

Empathetic conversation is psychologically supposed to be the result of conscious alignment and interaction between the cognition and affection of empathy.

Dialogue Generation Empathetic Response Generation +1

Paper
Code

Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning

1 code implementation • 6 Jun 2023 • Chujie Zheng, Pei Ke, Zheng Zhang, Minlie Huang

It has always been an important yet challenging problem to control language models to avoid generating texts with undesirable attributes, such as toxic language and unnatural repetition.

Contrastive Learning Text Generation

Paper
Code

Partial Vessels Annotation-based Coronary Artery Segmentation with Self-training and Prototype Learning

1 code implementation • 10 Jul 2023 • Zheng Zhang, XiaoLei Zhang, Yaolei Qi, Guanyu Yang

To this end, we propose partial vessels annotation (PVA) based on the challenges of coronary artery segmentation and clinical diagnostic characteristics.

Coronary Artery Segmentation Segmentation +1

Paper
Code

4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs

1 code implementation • 28 Apr 2024 • Minjie Wang, Quan Gan, David Wipf, Zhenkun Cai, Ning li, Jianheng Tang, Yanlin Zhang, Zizhao Zhang, Zunyao Mao, Yakun Song, Yanbo Wang, Jiahang Li, Han Zhang, Guang Yang, Xiao Qin, Chuan Lei, Muhan Zhang, Weinan Zhang, Christos Faloutsos, Zheng Zhang

Although RDBs store vast amounts of rich, informative data spread across interconnected tables, the progress of predictive machine learning models as applied to such tasks arguably falls well behind advances in other domains such as computer vision or natural language processing.

Benchmarking

Paper
Code

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning

1 code implementation • 12 May 2021 • Yansong Tang, Zhenyu Jiang, Zhenda Xie, Yue Cao, Zheng Zhang, Philip H. S. Torr, Han Hu

Previous cycle-consistency correspondence learning methods usually leverage image patches for training.

Landmark Tracking Pose Tracking +3

Paper
Code

Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval

1 code implementation • ICCV 2023 • Jianfeng Dong, Minsong Zhang, Zheng Zhang, Xianke Chen, Daizong Liu, Xiaoye Qu, Xun Wang, Baolong Liu

During the knowledge distillation, an inheritance student branch is devised to absorb the knowledge from the teacher model.

Knowledge Distillation Language Modelling +3

Paper
Code

Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus

1 code implementation • 22 Nov 2023 • Tianhang Zhang, Lin Qiu, Qipeng Guo, Cheng Deng, Yue Zhang, Zheng Zhang, Chenghu Zhou, Xinbing Wang, Luoyi Fu

Large Language Models (LLMs) have gained significant popularity for their impressive performance across diverse fields.

Hallucination Retrieval

Paper
Code

LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models

1 code implementation • 18 Feb 2024 • Yifan Yang, Jiajun Zhou, Ngai Wong, Zheng Zhang

Various parameter-efficient fine-tuning (PEFT) techniques have been proposed to enable computationally efficient fine-tuning while maintaining model performance.

Multi-Task Learning

Paper
Code

Attentional Neural Network: Feature Selection Using Cognitive Feedback

1 code implementation • NeurIPS 2014 • Qian Wang, Jiaxing Zhang, Sen Song, Zheng Zhang

Attentional Neural Network is a new framework that integrates top-down cognitive bias and bottom-up feature extraction in one coherent architecture.

feature selection General Classification

Paper
Code

Towards Compact Neural Networks via End-to-End Training: A Bayesian Tensor Approach with Automatic Rank Determination

1 code implementation • 17 Oct 2020 • Cole Hawkins, Xing Liu, Zheng Zhang

This paper presents a novel end-to-end framework for low-rank tensorized training of neural networks.

Model Compression Tensor Decomposition +1

Paper
Code

DORE: Document Ordered Relation Extraction based on Generative Framework

1 code implementation • 28 Oct 2022 • Qipeng Guo, Yuqing Yang, Hang Yan, Xipeng Qiu, Zheng Zhang

In this paper, we investigate the root cause of the underwhelming performance of the existing generative DocRE models and discover that the culprit is the inadequacy of the training paradigm, instead of the capacities of the models.

Document-level Relation Extraction Relation

Paper
Code

RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees

1 code implementation • 31 Oct 2022 • Tengxiao Liu, Qipeng Guo, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang

RLET iteratively performs single step reasoning with sentence selection and deduction generation modules, from which the training signal is accumulated across the tree with elaborately designed aligned reward function that is consistent with the evaluation.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Partition-A-Medical-Image: Extracting Multiple Representative Sub-regions for Few-shot Medical Image Segmentation

1 code implementation • 20 Sep 2023 • Yazhou Zhu, Shidong Wang, Tong Xin, Zheng Zhang, Haofeng Zhang

In this work, we present an approach to extract multiple representative sub-regions from a given support medical image, enabling fine-grained selection over the generated image regions.

Image Segmentation Medical Image Segmentation +1

Paper
Code

RTQ: Rethinking Video-language Understanding Based on Image-text Model

2 code implementations • 1 Dec 2023 • Xiao Wang, Yaoyu Li, Tian Gan, Zheng Zhang, Jingjing Lv, Liqiang Nie

Recent advancements in video-language understanding have been established on the foundation of image-text models, resulting in promising outcomes due to the shared knowledge between images and videos.

Ranked #9 on Video Retrieval on MSR-VTT-1kA

Video Captioning Video Question Answering +1

Paper
Code

Adversarial Bipartite Graph Learning for Video Domain Adaptation

1 code implementation • 31 Jul 2020 • Yadan Luo, Zi Huang, Zijian Wang, Zheng Zhang, Mahsa Baktashmotlagh

To further enhance the model capacity and testify the robustness of the proposed architecture on difficult transfer tasks, we extend our model to work in a semi-supervised setting using an additional video-level bipartite graph.

Ranked #2 on Domain Adaptation on HMDB --> UCF (full)

Domain Adaptation Graph Learning +1

Paper
Code

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning

4 code implementations • 3 Oct 2022 • Weicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, WeiHong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu

Vision transformers have recently achieved competitive results across various vision tasks but still suffer from heavy computation costs when processing a large number of tokens.

Clustering Depth Estimation +6

Paper
Code

CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection

1 code implementation • 5 Dec 2022 • Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin, Jinghe Hu, Jingping Shao

Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion.

Scene Text Detection Segmentation +1

Paper
Code

Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations

1 code implementation • 12 May 2023 • Yuan Tian, Zheng Zhang, Zheng Ning, Toby Jia-Jun Li, Jonathan K. Kummerfeld, Tianyi Zhang

Many techniques have been proposed to automatically generate SQL from natural language, but they suffer from two issues: (1) they still make many mistakes, particularly for complex queries, and (2) they do not provide a flexible way for non-expert users to validate and refine incorrect queries.

Text-To-SQL

Paper
Code

Active Subspace of Neural Networks: Structural Analysis and Universal Attacks

1 code implementation • 29 Oct 2019 • Chunfeng Cui, Kaiqi Zhang, Talgat Daulbaev, Julia Gusak, Ivan Oseledets, Zheng Zhang

Secondly, we propose analyzing the vulnerability of a neural network using active subspace and finding an additive universal adversarial attack vector that can misclassify a dataset with a high probability.

Adversarial Attack Uncertainty Quantification

Paper
Code

Quantum-Inspired Hamiltonian Monte Carlo for Bayesian Sampling

1 code implementation • 4 Dec 2019 • Ziming Liu, Zheng Zhang

Hamiltonian Monte Carlo (HMC) is an efficient Bayesian sampling method that can make distant proposals in the parameter space by simulating a Hamiltonian dynamical system.

BIG-bench Machine Learning Image Denoising +1

Paper
Code

Dialogue Meaning Representation for Task-Oriented Dialogue Systems

1 code implementation • 23 Apr 2022 • Xiangkun Hu, Junqi Dai, Hang Yan, Yi Zhang, Qipeng Guo, Xipeng Qiu, Zheng Zhang

We propose Dialogue Meaning Representation (DMR), a pliable and easily extendable representation for task-oriented dialogue.

coreference-resolution Negation +1

Paper
Code

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation

1 code implementation • ICCV 2023 • Ke Fan, Jingshi Lei, Xuelin Qian, Miaopeng Yu, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu

Furthermore, we propose a multi-view fusion layer based temporal module which is equipped with a set of object slots and interacts with features from different views by attention mechanism to fulfill sufficient object representation completion.

Object Video Segmentation +1

Paper
Code

Self-Healing Robust Neural Networks via Closed-Loop Control

1 code implementation • 26 Jun 2022 • Zhuotong Chen, Qianxiao Li, Zheng Zhang

While numerous attack and defense techniques have been developed, this work investigates the robustness issue from a new angle: can we design a self-healing neural network that can automatically detect and fix the vulnerability issue by itself?

Paper
Code

Exploiting Abstract Meaning Representation for Open-Domain Question Answering

1 code implementation • 26 May 2023 • Cunxiang Wang, Zhikun Xu, Qipeng Guo, Xiangkun Hu, Xuefeng Bai, Zheng Zhang, Yue Zhang

The Open-Domain Question Answering (ODQA) task involves retrieving and subsequently generating answers from fine-grained relevant passages within a database.

Natural Questions Open-Domain Question Answering +1

Paper
Code

StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding

1 code implementation • 19 Oct 2023 • Cheng Jiayang, Lin Qiu, Tsz Ho Chan, Tianqing Fang, Weiqi Wang, Chunkit Chan, Dongyu Ru, Qipeng Guo, Hongming Zhang, Yangqiu Song, Yue Zhang, Zheng Zhang

Analogy-making between narratives is crucial for human reasoning.

Multiple-choice Natural Language Understanding +3

Paper
Code

Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval

1 code implementation • IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2023 • Xu Yuan, Zheng Zhang, Xunguang Wang, Lin Wu

Further, we, for the first time, formulate the formalized adversarial training of deep hashing into a unified minimax optimization under the guidance of the generated mainstay codes.

Adversarial Attack Adversarial Robustness +2

Paper
Code

BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual Polynomials

1 code implementation • 14 Dec 2023 • Xingrun Xing, Li Du, Xinyuan Wang, Xianlin Zeng, Yequan Wang, Zheng Zhang, Jiajun Zhang

Specifically, we first analyze the binarization error in self-attention operations and derive the polynomials of binarization error.

Binarization Natural Language Understanding

Paper
Code

Bayesian Inference with Certifiable Adversarial Robustness

1 code implementation • 10 Feb 2021 • Matthew Wicker, Luca Laurenti, Andrea Patane, Zhoutong Chen, Zheng Zhang, Marta Kwiatkowska

We consider adversarial training of deep neural networks through the lens of Bayesian learning, and present a principled framework for adversarial training of Bayesian Neural Networks (BNNs) with certifiable guarantees.

Adversarial Robustness Bayesian Inference

Paper
Code

Tractable Clustering of Data on the Curve Manifold

1 code implementation • 13 Apr 2017 • Stephen Tierney, Junbin Gao, Yi Guo, Zheng Zhang

However the data may actually be functional i. e.\ each data point is a function of some variable such as time and the function is discretely sampled.

Clustering

Paper
Code

Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios

1 code implementation • 30 Mar 2023 • Jie Xu, Yazhou Ren, Xiaolong Wang, Lei Feng, Zheng Zhang, Gang Niu, Xiaofeng Zhu

Multi-view clustering (MVC) aims at exploring category structures among multi-view data in self-supervised manners.

Clustering Representation Learning

Paper
Code

KLoB: a Benchmark for Assessing Knowledge Locating Methods in Language Models

1 code implementation • 28 Sep 2023 • Yiming Ju, Zheng Zhang

KLoB can serve as a benchmark for evaluating existing locating methods in language models, and can contributes a method to reassessing the validity of locality hypothesis of factual knowledge.

Paper
Code

Bayesian Tensorized Neural Networks with Automatic Rank Selection

1 code implementation • 24 May 2019 • Cole Hawkins, Zheng Zhang

Tensor decomposition is an effective approach to compress over-parameterized neural networks and to enable their deployment on resource-constrained hardware platforms.

Model Compression Tensor Decomposition

Paper
Code

Online, Informative MCMC Thinning with Kernelized Stein Discrepancy

1 code implementation • 18 Jan 2022 • Cole Hawkins, Alec Koppel, Zheng Zhang

A fundamental challenge in Bayesian inference is efficient representation of a target distribution.

Bayesian Inference

Paper
Code

Faint Features Tell: Automatic Vertebrae Fracture Screening Assisted by Contrastive Learning

1 code implementation • 23 Aug 2022 • Xin Wei, Huaiwei Cong, Zheng Zhang, Junran Peng, Guoping Chen, Jinpeng Li

Long-term vertebral fractures severely affect the life quality of patients, causing kyphotic, lumbar deformity and even paralysis.

Binary Classification Computed Tomography (CT) +3

Paper
Code

Whole-Body Lesion Segmentation in 18F-FDG PET/CT

1 code implementation • 16 Sep 2022 • Jia Zhang, Yukun Huang, Zheng Zhang, Yuhang Shi

There has been growing research interest in using deep learning based method to achieve fully automated segmentation of lesion in Positron emission tomography computed tomography(PET CT) scans for the prognosis of various cancers.

Image Segmentation Lesion Segmentation +2

Paper
Code

EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs

1 code implementation • 30 Mar 2024 • Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song, Zheng Zhang

In this work, we propose an initial comprehensive framework called EventGround, which aims to tackle the problem of grounding free-texts to eventuality-centric KGs for contextualized narrative reasoning.

Knowledge Graphs Language Modelling +2

Paper
Code

Structure Regularized Attentive Network for Automatic Femoral Head Necrosis Diagnosis and Localization

1 code implementation • 23 Aug 2022 • Lingfeng li, Huaiwei Cong, Gangming Zhao, Junran Peng, Zheng Zhang, Jinpeng Li

However, due to the tissue overlap, X-ray images are difficult to provide fine-grained features for early diagnosis.

Computed Tomography (CT)

Paper
Code

Self-supervised Amodal Video Object Segmentation

1 code implementation • 23 Oct 2022 • Jian Yao, Yuxin Hong, Chiyu Wang, Tianjun Xiao, Tong He, Francesco Locatello, David Wipf, Yanwei Fu, Zheng Zhang

The key intuition is that the occluded part of an object can be explained away if that part is visible in other frames, possibly deformed as long as the deformation can be reasonably learned.

Object Segmentation +6

Paper
Code

EASpace: Enhanced Action Space for Policy Transfer

1 code implementation • 7 Dec 2022 • Zheng Zhang, Qingrui Zhang, Bo Zhu, Xiaohan Wang, Tianjiang Hu

In this paper, a novel algorithm named EASpace (Enhanced Action Space) is proposed, which formulates macro actions in an alternative form to accelerate the learning process using multiple available sub-optimal expert policies.

Q-Learning Transfer Learning

Paper
Code

Mutual Query Network for Multi-Modal Product Image Segmentation

1 code implementation • 26 Jun 2023 • Yun Guo, Wei Feng, Zheng Zhang, Xiancong Ren, Yaoyu Li, Jingjing Lv, Xin Zhu, Zhangang Lin, Jingping Shao

Product image segmentation is vital in e-commerce.

Image Segmentation Segmentation +1

Paper
Code

Non-Euclidean Spatial Graph Neural Network

1 code implementation • 17 Dec 2023 • Zheng Zhang, Sirui Li, Jingcheng Zhou, Junxiang Wang, Abhinav Angirekula, Allen Zhang, Liang Zhao

Besides, existing spatial network representation learning methods can only consider networks embedded in Euclidean space, and can not well exploit the rich geometric information carried by irregular and non-uniform non-Euclidean space.

Representation Learning

Paper
Code

Memory-augmented Dialogue Management for Task-oriented Dialogue Systems

no code implementations • 1 May 2018 • Zheng Zhang, Minlie Huang, Zhongzhou Zhao, Feng Ji, Haiqing Chen, Xiaoyan Zhu

Dialogue management (DM) decides the next action of a dialogue system according to the current dialogue state, and thus plays a central role in task-oriented dialogue systems.

Dialogue Management Management +1

Paper
Add Code

Loss Functions for Multiset Prediction

no code implementations • ICLR 2018 • Sean Welleck, Zixin Yao, Yu Gai, Jialin Mao, Zheng Zhang, Kyunghyun Cho

In this paper, we propose a novel multiset loss function by viewing this problem from the perspective of sequential decision making.

Decision Making Reinforcement Learning (RL)

Paper
Add Code

Saliency-based Sequential Image Attention with Multiset Prediction

no code implementations • NeurIPS 2017 • Sean Welleck, Jialin Mao, Kyunghyun Cho, Zheng Zhang

Humans process visual scenes selectively and sequentially using attention.

Classification General Classification +3

Paper
Add Code

Discriminative Block-Diagonal Representation Learning for Image Recognition

no code implementations • 12 Jul 2017 • Zheng Zhang, Yong Xu, Ling Shao, Jian Yang

In particular, the elaborate BDLRR is formulated as a joint optimization problem of shrinking the unfavorable representation from off-block-diagonal elements and strengthening the compact block-diagonal representation under the semi-supervised framework of low-rank representation.

Representation Learning

Paper
Add Code

FERA 2017 - Addressing Head Pose in the Third Facial Expression Recognition and Analysis Challenge

no code implementations • 14 Feb 2017 • Michel F. Valstar, Enrique Sánchez-Lozano, Jeffrey F. Cohn, László A. Jeni, Jeffrey M. Girard, Zheng Zhang, Lijun Yin, Maja Pantic

The FG 2017 Facial Expression Recognition and Analysis challenge (FERA 2017) extends FERA 2015 to the estimation of Action Units occurrence and intensity under different camera views.

Benchmarking Facial Action Unit Detection +4

Paper
Add Code

Who Leads the Clothing Fashion: Style, Color, or Texture? A Computational Study

no code implementations • 26 Aug 2016 • Qin Zou, Zheng Zhang, Qian Wang, Qingquan Li, Long Chen, Song Wang

Specifically, a classification-based model is proposed to quantify the influence of different visual stimuli, in which each visual stimulus's influence is quantified by its corresponding accuracy in fashion classification.

General Classification

Paper
Add Code

Learning Word Embeddings from Intrinsic and Extrinsic Views

no code implementations • 20 Aug 2016 • Jifan Chen, Kan Chen, Xipeng Qiu, Qi Zhang, Xuanjing Huang, Zheng Zhang

To prove the effectiveness of our model, we evaluate it on four tasks, including word similarity, reverse dictionaries, Wiki link prediction, and document classification.

Descriptive Document Classification +4

Paper
Add Code

Local Multiple Directional Pattern of Palmprint Image

no code implementations • 21 Jul 2016 • Lunke Fei, Jie Wen, Zheng Zhang, Ke Yan, Zuofeng Zhong

Conventional methods usually capture the only one of the most dominant direction of palmprint images.

Paper
Add Code

Natural Scene Character Recognition Using Robust PCA and Sparse Representation

no code implementations • 15 Jun 2016 • Zheng Zhang, Yong Xu, Cheng-Lin Liu

Natural scene character recognition is challenging due to the cluttered background, which is hard to separate from text.

Paper
Add Code

A survey of sparse representation: algorithms and applications

no code implementations • 23 Feb 2016 • Zheng Zhang, Yong Xu, Jian Yang, Xuelong. Li, David Zhang

The main purpose of this article is to provide a comprehensive study and an updated review on sparse representation and to supply a guidance for researchers.

Paper
Add Code

Learning to Point and Count

no code implementations • 8 Dec 2015 • Jie Shao, Dequan Wang, xiangyang xue, Zheng Zhang

This paper proposes the problem of point-and-count as a test case to break the what-and-where deadlock.

General Classification

Paper
Add Code

First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks

no code implementations • 19 Nov 2015 • Quan Gan, Qipeng Guo, Zheng Zhang, Kyunghyun Cho

In this paper, we propose and study a novel visual object tracking approach based on convolutional networks and recurrent networks.

Object Visual Object Tracking +1

Paper
Add Code

The Application of Two-level Attention Models in Deep Convolutional Neural Network for Fine-grained Image Classification

no code implementations • CVPR 2015 • Tianjun Xiao, Yichong Xu, Kuiyuan Yang, Jiaxing Zhang, Yuxin Peng, Zheng Zhang

Our pipeline integrates three types of attention: the bottom-up attention that propose candidate patches, the object-level top-down attention that selects relevant patches to a certain object, and the part-level top-down attention that localizes discriminative parts.

Classification Fine-Grained Image Classification +2

Paper
Add Code

Scale-Invariant Convolutional Neural Networks

no code implementations • 24 Nov 2014 • Yichong Xu, Tianjun Xiao, Jiaxing Zhang, Kuiyuan Yang, Zheng Zhang

Even though convolutional neural networks (CNN) has achieved near-human performance in various computer vision tasks, its ability to tolerate scale variations is limited.

Data Augmentation General Classification

Paper
Add Code

Effective Occlusion Handling for Fast Correlation Filter-based Trackers

no code implementations • 13 Jul 2018 • Zheng Zhang, T. T. Wong

Correlation filter-based trackers heavily suffer from the problem of multiple peaks in their response maps incurred by occlusions.

Occlusion Handling

Paper
Add Code

Top-Down Tree Structured Text Generation

no code implementations • 14 Aug 2018 • Qipeng Guo, Xipeng Qiu, xiangyang xue, Zheng Zhang

Text generation is a fundamental building block in natural language processing tasks.

Sentence Text Generation

Paper
Add Code

Variational Bayesian Inference for Robust Streaming Tensor Factorization and Completion

no code implementations • 6 Sep 2018 • Cole Hawkins, Zheng Zhang

Streaming tensor factorization is a powerful tool for processing high-volume and multi-way temporal data in Internet networks, recommender systems and image/video data analysis.

Bayesian Inference Recommendation Systems

Paper
Add Code

Incomplete Multi-view Clustering via Graph Regularized Matrix Factorization

no code implementations • 17 Sep 2018 • Jie Wen, Zheng Zhang, Yong Xu, Zuofeng Zhong

Clustering with incomplete views is a challenge in multi-view clustering.

Clustering Graph Embedding +1

Paper
Add Code

Highly-Economized Multi-View Binary Compression for Scalable Image Clustering

no code implementations • ECCV 2018 • Zheng Zhang, Li Liu, Jie Qin, Fan Zhu, Fumin Shen, Yong Xu, Ling Shao, Heng Tao Shen

How to economically cluster large-scale multi-view images is a long-standing problem in computer vision.

Clustering Image Clustering +1

Paper
Add Code

Melodic Phrase Segmentation By Deep Neural Networks

no code implementations • 14 Nov 2018 • Yixing Guan, Jinyu Zhao, Yiqin Qiu, Zheng Zhang, Gus Xia

Automated melodic phrase detection and segmentation is a classical task in content-based music information retrieval and also the key towards automated music structure analysis.

Information Retrieval Music Information Retrieval +2

Paper
Add Code

Integrated Object Detection and Tracking with Tracklet-Conditioned Detection

no code implementations • 27 Nov 2018 • Zheng Zhang, Dazhi Cheng, Xizhou Zhu, Stephen Lin, Jifeng Dai

Accurate detection and tracking of objects is vital for effective video understanding.

Ranked #19 on Video Object Detection on ImageNet VID

Object object-detection +2

Paper
Add Code

GNEG: Graph-Based Negative Sampling for word2vec

no code implementations • ACL 2018 • Zheng Zhang, Pierre Zweigenbaum

Negative sampling is an important component in word2vec for distributed word representation learning.

Language Modelling Representation Learning +1

Paper
Add Code

zNLP: Identifying Parallel Sentences in Chinese-English Comparable Corpora

no code implementations • WS 2017 • Zheng Zhang, Pierre Zweigenbaum

This paper describes the zNLP system for the BUCC 2017 shared task.

Machine Translation Sentence

Paper
Add Code

Gradient-based learning for F-measure and other performance metrics

no code implementations • ICLR 2019 • Yu Gai, Zheng Zhang, Kyunghyun Cho

Many important classification performance metrics, e. g. $F$-measure, are non-differentiable and non-decomposable, and are thus unfriendly to gradient descent algorithm.

General Classification

Paper
Add Code

Prediction of multi-dimensional spatial variation data via Bayesian tensor completion

no code implementations • 3 Jan 2019 • Jiali Luan, Zheng Zhang

This paper presents a multi-dimensional computational method to predict the spatial variation data inside and across multiple dies of a wafer.

Paper
Add Code

Adaptive Locality Preserving Regression

no code implementations • 3 Jan 2019 • Jie Wen, Zuofeng Zhong, Zheng Zhang, Lunke Fei, Zhihui Lai, Runze Chen

This paper proposes a novel discriminative regression method, called adaptive locality preserving regression (ALPR) for classification.

feature selection regression

Paper
Add Code

Bilinear Supervised Hashing Based on 2D Image Features

no code implementations • 5 Jan 2019 • Yujuan Ding, Wai Kueng Wong, Zhihui Lai, Zheng Zhang

Hashing has been recognized as an efficient representation learning method to effectively handle big data due to its low computational complexity and memory cost.

Representation Learning Retrieval

Paper
Add Code

Symmetry-Based Text Line Detection in Natural Scenes

no code implementations • CVPR 2015 • Zheng Zhang, Wei Shen, Cong Yao, Xiang Bai

Recently, a variety of real-world applications have triggered huge demand for techniques that can extract textual information from natural scenes.

Line Detection Scene Text Detection +1

Paper
Add Code

Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis

no code implementations • CVPR 2016 • Zheng Zhang, Jeff M. Girard, Yue Wu, Xing Zhang, Peng Liu, Umur Ciftci, Shaun Canavan, Michael Reale, Andy Horowitz, Huiyuan Yang, Jeffrey F. Cohn, Qiang Ji, Lijun Yin

The corpus further includes derived features from 3D, 2D, and IR (infrared) sensors and baseline results for facial expression and action unit detection.

Action Unit Detection

Paper
Add Code

Multiple Granularity Descriptors for Fine-Grained Categorization

no code implementations • ICCV 2015 • Dequan Wang, Zhiqiang Shen, Jie Shao, Wei zhang, xiangyang xue, Zheng Zhang

Fine-grained categorization, which aims to distinguish subordinate-level categories such as bird species or dog breeds, is an extremely challenging task.

Paper
Add Code

Explainable Text-Driven Neural Network for Stock Prediction

no code implementations • 13 Feb 2019 • Linyi Yang, Zheng Zhang, Su Xiong, Lirui Wei, James Ng, Lina Xu, Ruihai Dong

It has been shown that financial news leads to the fluctuation of stock prices.

Stock Prediction Stock Price Prediction

Paper
Add Code

SADIH: Semantic-Aware DIscrete Hashing

no code implementations • 3 Apr 2019 • Zheng Zhang, Guo-Sen Xie, Yang Li, Sheng Li, Zi Huang

Due to its low storage cost and fast query speed, hashing has been recognized to accomplish similarity search in large-scale multimedia retrieval applications.

Retrieval

Paper
Add Code

Spatial-Temporal Relation Networks for Multi-Object Tracking

no code implementations • ICCV 2019 • Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu

Recent progress in multiple object tracking (MOT) has shown that a robust similarity score is key to the success of trackers.

Multi-Object Tracking Multiple Object Tracking +2

Paper
Add Code

Scalable Block-Diagonal Locality-Constrained Projective Dictionary Learning

no code implementations • 25 May 2019 • Zhao Zhang, Weiming Jiang, Zheng Zhang, Sheng Li, Guangcan Liu, Jie Qin

More importantly, LC-PDL avoids using the complementary data matrix to learn the sub-dictionary over each class.

Dictionary Learning

Paper
Add Code

Joint Subspace Recovery and Enhanced Locality Driven Robust Flexible Discriminative Dictionary Learning

no code implementations • 11 Jun 2019 • Zhao Zhang, Jiahuan Ren, Weiming Jiang, Zheng Zhang, Richang Hong, Shuicheng Yan, Meng Wang

We propose a joint subspace recovery and enhanced locality based robust flexible label consistent dictionary learning method called Robust Flexible Discriminative Dictionary Learning (RFDDL).

Dictionary Learning

Paper
Add Code

Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation

no code implementations • 1 Aug 2019 • Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Jingjing Li, Yang Yang

Visual paragraph generation aims to automatically describe a given image from different perspectives and organize sentences in a coherent way.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Adaptive Structure-constrained Robust Latent Low-Rank Coding for Image Recovery

no code implementations • 21 Aug 2019 • Zhao Zhang, Lei Wang, Sheng Li, Yang Wang, Zheng Zhang, Zheng-Jun Zha, Meng Wang

Specifically, AS-LRC performs the latent decomposition of given data into a low-rank reconstruction by a block-diagonal codes matrix, a group sparse locality-adaptive salient feature part and a sparse error part.

Representation Learning

Paper
Add Code

Learning Structured Twin-Incoherent Twin-Projective Latent Dictionary Pairs for Classification

no code implementations • 21 Aug 2019 • Zhao Zhang, Yulin Sun, Zheng Zhang, Yang Wang, Guangcan Liu, Meng Wang

In this setting, our TP-DPL integrates the twin-incoherence based latent flexible DPL and the joint embedding of codes as well as salient features by twin-projection into a unified model in an adaptive neighborhood-preserving manner.

General Classification

Paper
Add Code

Equity2Vec: End-to-end Deep Learning Framework for Cross-sectional Asset Pricing

no code implementations • 7 Sep 2019 • Qiong Wu, Christopher G. Brinton, Zheng Zhang, Andrea Pizzoferrato, Zhenming Liu, Mihai Cucuringu

Pricing assets has attracted significant attention from the financial technology community.

Paper
Add Code

Cross-Lingual Contextual Word Embeddings Mapping With Multi-Sense Words In Mind

no code implementations • 18 Sep 2019 • Zheng Zhang, Ruiqing Yin, Jun Zhu, Pierre Zweigenbaum

Recent work in cross-lingual contextual word embedding learning cannot handle multi-sense words well.

Bilingual Lexicon Induction Word Embeddings

Paper
Add Code

Deep Collaborative Discrete Hashing with Semantic-Invariant Structure

no code implementations • 5 Nov 2019 • Zijian Wang, Zheng Zhang, Yadan Luo, Zi Huang

Existing deep hashing approaches fail to fully explore semantic correlations and neglect the effect of linguistic context on visual attention learning, leading to inferior performance.

Deep Hashing

Paper
Add Code

Learning from the Past: Continual Meta-Learning via Bayesian Graph Modeling

no code implementations • 12 Nov 2019 • Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Mahsa Baktashmotlagh, Yang Yang

Meta-learning for few-shot learning allows a machine to leverage previously acquired knowledge as a prior, thus improving the performance on novel tasks with only small amounts of data.

Continual Learning Few-Shot Learning

Paper
Add Code

Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning

no code implementations • 20 Nov 2019 • Yulin Sun, Zhao Zhang, Weiming Jiang, Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

In this paper, we propose a structured Robust Adaptive Dic-tionary Pair Learning (RA-DPL) framework for the discrim-inative sparse representation learning.

Representation Learning

Paper
Add Code

Multi-Scale Self-Attention for Text Classification

no code implementations • 2 Dec 2019 • Qipeng Guo, Xipeng Qiu, PengFei Liu, xiangyang xue, Zheng Zhang

In this paper, we introduce the prior knowledge, multi-scale structure, into self-attention modules.

General Classification text-classification +1

Paper
Add Code

Fully-Convolutional Intensive Feature Flow Neural Network for Text Recognition

no code implementations • 13 Dec 2019 • Zhao Zhang, Zemin Tang, Zheng Zhang, Yang Wang, Jie Qin, Meng Wang

But existing CNNs based frameworks still have several drawbacks: 1) the traditaional pooling operation may lose important feature information and is unlearnable; 2) the tradi-tional convolution operation optimizes slowly and the hierar-chical features from different layers are not fully utilized.

Paper
Add Code

Deep Self-representative Concept Factorization Network for Representation Learning

no code implementations • 13 Dec 2019 • Yan Zhang, Zhao Zhang, Zheng Zhang, Mingbo Zhao, Li Zhang, Zheng-Jun Zha, Meng Wang

In this paper, we investigate the unsupervised deep representation learning issue and technically propose a novel framework called Deep Self-representative Concept Factorization Network (DSCF-Net), for clustering deep features.

Clustering Representation Learning

Paper
Add Code

Tucker Tensor Decomposition on FPGA

no code implementations • 28 Jun 2019 • Kaiqi Zhang, Xiyuan Zhang, Zheng Zhang

This paper presents an hardware accelerator for a classical tensor computation framework, Tucker decomposition.

Signal Processing Hardware Architecture

Paper
Add Code

Dense Residual Network: Enhancing Global Dense Feature Flow for Character Recognition

no code implementations • 23 Jan 2020 • Zhao Zhang, Zemin Tang, Yang Wang, Zheng Zhang, Choujun Zhan, ZhengJun Zha, Meng Wang

To construct FDRN, we propose a new fast residual dense block (f-RDB) to retain the ability of local feature fusion and local residual learning of original RDB, which can reduce the computing efforts at the same time.

Paper
Add Code

Recent Advances and Challenges in Task-oriented Dialog System

no code implementations • 17 Mar 2020 • Zheng Zhang, Ryuichi Takanobu, Qi Zhu, Minlie Huang, Xiaoyan Zhu

Due to the significance and value in human-computer interaction and natural language processing, task-oriented dialog systems are attracting more and more attention in both academic and industrial communities.

Paper
Add Code

Task-adaptive Asymmetric Deep Cross-modal Hashing

no code implementations • 1 Apr 2020 • Fengling Li, Tong Wang, Lei Zhu, Zheng Zhang, Xinhua Wang

Unlike previous cross-modal hashing approaches, our learning framework jointly optimizes semantic preserving that transforms deep features of multimedia data into binary hash codes, and the semantic regression which directly regresses query modality representation to explicit label.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Zheng Zhang, Lizi Liao, Xiaoyan Zhu, Tat-Seng Chua, Zitao Liu, Yan Huang, Minlie Huang

Most existing approaches for goal-oriented dialogue policy learning used reinforcement learning, which focuses on the target agent policy and simply treat the opposite agent policy as part of the environment.

Decision Making

Paper
Add Code

Repurpose Open Data to Discover Therapeutics for COVID-19 using Deep Learning

no code implementations • 21 May 2020 • Xiangxiang Zeng, Xiang Song, Tengfei Ma, Xiaoqin Pan, Yadi Zhou, Yuan Hou, Zheng Zhang, George Karypis, Feixiong Cheng

While this study, by no means recommends specific drugs, it demonstrates a powerful deep learning methodology to prioritize existing drugs for further investigation, which holds the potential of accelerating therapeutic development for COVID-19.

Paper
Add Code

Dual-level Semantic Transfer Deep Hashing for Efficient Social Image Retrieval

1 code implementation • 10 Jun 2020 • Lei Zhu, Hui Cui, Zhiyong Cheng, Jingjing Li, Zheng Zhang

Specifically, we design a complementary dual-level semantic transfer mechanism to efficiently discover the potential semantics of tags and seamlessly transfer them into binary hash codes.

Deep Hashing Image Retrieval +1

Paper
Code

FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems

no code implementations • 26 Aug 2020 • Yuwei Hu, Zihao Ye, Minjie Wang, Jiali Yu, Da Zheng, Mu Li, Zheng Zhang, Zhiru Zhang, Yida Wang

FeatGraph provides a flexible programming interface to express diverse GNN models by composing coarse-grained sparse templates with fine-grained user-defined functions (UDFs) on each vertex/edge.

Paper
Add Code

Dual-constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior

no code implementations • 8 Sep 2020 • Yan Zhang, Zhao Zhang, Yang Wang, Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

Nonnegative matrix factorization is usually powerful for learning the "shallow" parts-based representation, but it clearly fails to discover deep hierarchical information within both the basis and representation spaces.

Clustering Graph Learning +1

Paper
Add Code

Region Graph Embedding Network for Zero-Shot Learning

no code implementations • ECCV 2020 • Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao

To exploit the progressive interactions among these regions, we represent them as a region graph, on which the parts relation reasoning is performed with graph convolutions, thus leading to our PRR branch.

Graph Embedding Relation +1

Paper
Add Code

Causal Inference of General Treatment Effects using Neural Networks with A Diverging Number of Confounders

no code implementations • 15 Sep 2020 • Xiaohong Chen, Ying Liu, Shujie Ma, Zheng Zhang

This paper considers a generalized optimization framework for efficient estimation of general treatment effects using artificial neural networks (ANNs) to approximate the unknown nuisance function of growing-dimensional confounders.

Causal Inference