Search Results for author: David Zhang

Found 78 papers, 27 papers with code

Perceptive self-supervised learning network for noisy image watermark removal

1 code implementation4 Mar 2024 Chunwei Tian, Menghua Zheng, Bo Li, Yanning Zhang, Shichao Zhang, David Zhang

Specifically, mentioned paired watermark images are obtained in a self supervised way, and paired noisy images (i. e., noisy and reference images) are obtained in a supervised way.

Self-Supervised Learning

Towards Efficient Hyperdimensional Computing Using Photonics

no code implementations29 Nov 2023 Farbin Fayza, Cansu Demirkiran, Hanning Chen, Che-Kai Liu, Avi Mohan, Hamza Errahmouni, Sanggeon Yun, Mohsen Imani, David Zhang, Darius Bunandar, Ajay Joshi

Over the past few years, silicon photonics-based computing has emerged as a promising alternative to CMOS-based computing for Deep Neural Networks (DNN).

Multi-adversarial Faster-RCNN with Paradigm Teacher for Unrestricted Object Detection

no code implementations International Journal of Computer Vision 2022 Zhenwei He, Lei Zhang, Xinbo Gao, David Zhang

Our proposed MAF has two distinct contributions: (1) The Hierarchical Domain Feature Alignment (HDFA) module is introduced to minimize the image-level domain disparity, where Scale Reduction Module (SRM) reduces the feature map size without information loss and increases the training efficiency.

Domain Adaptation Knowledge Distillation +2

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

no code implementations30 Nov 2022 David Zhang, Micah Carroll, Andreea Bobu, Anca Dragan

One of the most successful paradigms for reward learning uses human feedback in the form of comparisons.

Dimensionality Reduction

Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities

no code implementations10 Nov 2022 Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer

Later, we use our optimal tokenization strategy to train multiple embedding and output model to further improve our result.

A heterogeneous group CNN for image super-resolution

1 code implementation26 Sep 2022 Chunwei Tian, Yanning Zhang, WangMeng Zuo, Chia-Wen Lin, David Zhang, Yixuan Yuan

To prevent loss of original information, a multi-level enhancement mechanism guides a CNN to achieve a symmetric architecture for promoting expressive ability of HGSRCNN.

Image Super-Resolution

Multi-stage image denoising with the wavelet transform

1 code implementation26 Sep 2022 Chunwei Tian, Menghua Zheng, WangMeng Zuo, Bob Zhang, Yanning Zhang, David Zhang

In this paper, we propose a multi-stage image denoising CNN with the wavelet transform (MWDCNN) via three stages, i. e., a dynamic convolutional block (DCB), two cascaded wavelet transform and enhancement blocks (WEBs) and a residual block (RB).

Image Denoising

Learning with Local Gradients at the Edge

no code implementations17 Aug 2022 Michael Lomnitz, Zachary Daniels, David Zhang, Michael Piacentino

To enable learning on edge devices with fast convergence and low memory, we present a novel backpropagation-free optimization algorithm dubbed Target Projection Stochastic Gradient Descent (tpSGD).

Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification

1 code implementation CVPR 2022 Xinyu Lin, Jinxing Li, Zeyu Ma, Huafeng Li, Shuang Li, Kaixiong Xu, Guangming Lu, David Zhang

Based on our constructed dataset, we prove that with the increase of frames in a tracklet, the performance does meet more enhancement, demonstrating the significance of video-to-video matching in RGB-IR person Re-ID.

Cross-Modal Retrieval Person Re-Identification +2

Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

no code implementations10 Jun 2022 Indhumathi Kandaswamy, Saurabh Farkya, Zachary Daniels, Gooitzen van der Wal, Aswin Raghavan, Yuzheng Zhang, Jun Hu, Michael Lomnitz, Michael Isnardi, David Zhang, Michael Piacentino

In this paper we present Hyper-Dimensional Reconfigurable Analytics at the Tactical Edge (HyDRATE) using low-SWaP embedded hardware that can perform real-time reconfiguration at the edge leveraging non-MAC (free of floating-point MultiplyACcumulate operations) deep neural nets (DNN) combined with hyperdimensional (HD) computing accelerators.

Few-Shot Learning Quantization

Saccade Mechanisms for Image Classification, Object Detection and Tracking

no code implementations10 Jun 2022 Saurabh Farkya, Zachary Daniels, Aswin Nadamuni Raghavan, David Zhang, Michael Piacentino

We examine how the saccade mechanism from biological vision can be used to make deep neural networks more efficient for classification and object detection problems.

Classification Image Classification +4

Image Super-resolution with An Enhanced Group Convolutional Neural Network

1 code implementation29 May 2022 Chunwei Tian, Yixuan Yuan, Shichao Zhang, Chia-Wen Lin, WangMeng Zuo, David Zhang

In this paper, we present an enhanced super-resolution group CNN (ESRGCNN) with a shallow architecture by fully fusing deep and wide channel features to extract more accurate low-frequency information in terms of correlations of different channels in single image super-resolution (SISR).

Image Super-Resolution

HIPA: Hierarchical Patch Transformer for Single Image Super Resolution

no code implementations19 Mar 2022 Qing Cai, Yiming Qian, Jinxing Li, Jun Lv, Yee-Hong Yang, Feng Wu, David Zhang

Transformer-based architectures start to emerge in single image super resolution (SISR) and have achieved promising performance.

Image Super-Resolution

Pseudocylindrical Convolutions for Learned Omnidirectional Image Compression

1 code implementation25 Dec 2021 Mu Li, Kede Ma, Jinxing Li, David Zhang

We first describe parametric pseudocylindrical representation as a generalization of common pseudocylindrical map projections.

ERP Image Compression

Pedestrian Detection by Exemplar-Guided Contrastive Learning

no code implementations17 Nov 2021 Zebin Lin, Wenjie Pei, Fanglin Chen, David Zhang, Guangming Lu

Instead of learning each of these diverse pedestrian appearance features individually as most existing methods do, we propose to perform contrastive learning to guide the feature learning in such a way that the semantic distance between pedestrians with different appearances in the learned feature space is minimized to eliminate the appearance diversities, whilst the distance between pedestrians and background is maximized.

Contrastive Learning Pedestrian Detection

Deep multi-modal aggregation network for MR image reconstruction with auxiliary modality

2 code implementations15 Oct 2021 Chun-Mei Feng, Huazhu Fu, Tianfei Zhou, Yong Xu, Ling Shao, David Zhang

Magnetic resonance (MR) imaging produces detailed images of organs and tissues with better contrast, but it suffers from a long acquisition time, which makes the image quality vulnerable to say motion artifacts.

Image Reconstruction

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain

no code implementations10 Oct 2021 Zengwei Yao, Wenjie Pei, Fanglin Chen, Guangming Lu, David Zhang

Existing methods for speech separation either transform the speech signals into frequency domain to perform separation or seek to learn a separable embedding space by constructing a latent domain based on convolutional filters.

speech-recognition Speech Recognition +1

BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion

1 code implementation4 Oct 2021 Zhaoqun Li, Xu Liang, Dandan Fan, Jinxing Li, David Zhang

Bimodal palmprint recognition leverages palmprint and palm vein images simultaneously, which achieves high accuracy by multi-model information fusion and has strong anti-falsification property.

Keypoint Detection Translation

Generative Memory-Guided Semantic Reasoning Model for Image Inpainting

no code implementations1 Oct 2021 Xin Feng, Wenjie Pei, Fengjun Li, Fanglin Chen, David Zhang, Guangming Lu

Most existing methods for image inpainting focus on learning the intra-image priors from the known regions of the current input image to infer the content of the corrupted regions in the same image.

Image Inpainting

Dual-Stream Reciprocal Disentanglement Learning for Domain Adaptation Person Re-Identification

1 code implementation26 Jun 2021 Huafeng Li, Kaixiong Xu, Jinxing Li, Guangming Lu, Yong Xu, Zhengtao Yu, David Zhang

Since human-labeled samples are free for the target set, unsupervised person re-identification (Re-ID) has attracted much attention in recent years, by additionally exploiting the source set.

Disentanglement Domain Adaptation +2

Asymmetric CNN for image super-resolution

1 code implementation25 Mar 2021 Chunwei Tian, Yong Xu, WangMeng Zuo, Chia-Wen Lin, David Zhang

In this paper, we propose an asymmetric CNN (ACNet) comprising an asymmetric block (AB), a memory enhancement block (MEB) and a high-frequency feature enhancement block (HFFEB) for image super-resolution.

Image Super-Resolution

Touchless Palmprint Recognition based on 3D Gabor Template and Block Feature Refinement

no code implementations3 Mar 2021 Zhaoqun Li, Xu Liang, Dandan Fan, Jinxing Li, Wei Jia, David Zhang

To our best knowledge, it is the largest contactless palmprint image benchmark ever collected with regard to the number of individuals and palms.

Person Identification

Deep-Masking Generative Network: A Unified Framework for Background Restoration from Superimposed Images

1 code implementation9 Oct 2020 Xin Feng, Wenjie Pei, Zihui Jia, Fanglin Chen, David Zhang, Guangming Lu

In this work we present the Deep-Masking Generative Network (DMGN), which is a unified framework for background restoration from the superimposed images and is able to cope with different types of noise.

Image Dehazing Image Generation +3

Designing and Training of A Dual CNN for Image Denoising

1 code implementation8 Jul 2020 Chunwei Tian, Yong Xu, WangMeng Zuo, Bo Du, Chia-Wen Lin, David Zhang

The enhancement block gathers and fuses the global and local features to provide complementary information for the latter network.

Image Denoising

Learning Context-Based Non-local Entropy Modeling for Image Compression

no code implementations10 May 2020 Mu Li, Kai Zhang, WangMeng Zuo, Radu Timofte, David Zhang

To address this issue, we propose a non-local operation for context modeling by employing the global similarity within the context.

Image Compression

Biometrics Recognition Using Deep Learning: A Survey

1 code implementation30 Nov 2019 Shervin Minaee, Amirali Abdolrashidi, Hang Su, Mohammed Bennamoun, David Zhang

Deep learning-based models have been very successful in achieving state-of-the-art results in many of the computer vision, speech recognition, and natural language processing tasks in the last few years.

Gait Recognition speech-recognition +1

Bit Efficient Quantization for Deep Neural Networks

no code implementations7 Oct 2019 Prateeth Nayak, David Zhang, Sek Chai

Quantization for deep neural networks have afforded models for edge devices that use less on-board memory and enable efficient low-power inference.

Clustering Quantization

Non-negative Sparse and Collaborative Representation for Pattern Classification

no code implementations20 Aug 2019 Jun Xu, Zhou Xu, Wangpeng An, Haoqian Wang, David Zhang

In this paper, we propose a novel Non-negative Sparse and Collaborative Representation (NSCR) for pattern classification.

Classification Face Recognition +1

Efficient and Effective Context-Based Convolutional Entropy Modeling for Image Compression

2 code implementations24 Jun 2019 Mu Li, Kede Ma, Jane You, David Zhang, WangMeng Zuo

For the former, we directly apply a CCN to the binarized representation of an image to compute the Bernoulli distribution of each code for entropy estimation.

Image Compression

Remove Cosine Window from Correlation Filter-based Visual Trackers: When and How

1 code implementation16 May 2019 Feng Li, Xiaohe Wu, WangMeng Zuo, David Zhang, Lei Zhang

Therefore, we in this paper investigate the feasibility to remove cosine window from CF trackers with spatial regularization.

Learning Content-Weighted Deep Image Compression

1 code implementation1 Apr 2019 Mu Li, WangMeng Zuo, Shuhang Gu, Jane You, David Zhang

Learning-based lossy image compression usually involves the joint optimization of rate-distortion performance.

Image Compression

Manifold Criterion Guided Transfer Learning via Intermediate Domain Generation

1 code implementation25 Mar 2019 Lei Zhang, Shan-Shan Wang, Guang-Bin Huang, WangMeng Zuo, Jian Yang, David Zhang

The merits of the proposed MCTL are four-fold: 1) the concept of manifold criterion (MC) is first proposed as a measure validating the distribution matching across domains, and domain adaptation is achieved if the MC is satisfied; 2) the proposed MC can well guide the generation of the intermediate domain sharing similar distribution with the target domain, by minimizing the local domain discrepancy; 3) a global generative discrepancy metric (GGDM) is presented, such that both the global and local discrepancy can be effectively and positively reduced; 4) a simplified version of MCTL called MCTL-S is presented under a perfect domain generation assumption for more generic learning scenario.

Transfer Learning Unsupervised Domain Adaptation

Scaled Simplex Representation for Subspace Clustering

3 code implementations26 Jul 2018 Jun Xu, Mengyang Yu, Ling Shao, WangMeng Zuo, Deyu Meng, Lei Zhang, David Zhang

However, the negative entries in the coefficient matrix are forced to be positive when constructing the affinity matrix via exponentiation, absolute symmetrization, or squaring operations.

Clustering

Sparse, Collaborative, or Nonnegative Representation: Which Helps Pattern Classification?

1 code implementation12 Jun 2018 Jun Xu, Wangpeng An, Lei Zhang, David Zhang

The use of sparse representation (SR) and collaborative representation (CR) for pattern classification has been widely studied in tasks such as face recognition and object categorization.

Classification Face Recognition +2

A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping

no code implementations CVPR 2018 Zhetong Liang, Jun Xu, David Zhang, Zisheng Cao, Lei Zhang

State-of-the-art tone mapping algorithms mostly decompose an image into a base layer and a detail layer, and process them accordingly.

Tone Mapping

Simultaneous Fidelity and Regularization Learning for Image Restoration

1 code implementation12 Apr 2018 Dongwei Ren, WangMeng Zuo, David Zhang, Lei Zhang, Ming-Hsuan Yang

For blind deconvolution, as estimation error of blur kernel is usually introduced, the subsequent non-blind deconvolution process does not restore the latent image well.

Denoising Image Deconvolution +1

Real-world Noisy Image Denoising: A New Benchmark

2 code implementations7 Apr 2018 Jun Xu, Hui Li, Zhetong Liang, David Zhang, Lei Zhang

In order to promote the study on this problem while implementing the concurrent real-world image denoising datasets, we construct a new benchmark dataset which contains comprehensive real-world noisy images of different natural scenes.

Image Denoising

Dual Asymmetric Deep Hashing Learning

1 code implementation25 Jan 2018 Jinxing Li, Bob Zhang, Guangming Lu, David Zhang

The deep hash functions are then learned through two networks by minimizing the gap between the learned features and discrete codes.

Deep Hashing

Enlarging Context with Low Cost: Efficient Arithmetic Coding with Trimmed Convolution

no code implementations15 Jan 2018 Mu Li, Shuhang Gu, David Zhang, WangMeng Zuo

One key issue of arithmetic encoding method is to predict the probability of the current coding symbol from its context, i. e., the preceding encoded symbols, which usually can be executed by building a look-up table (LUT).

Computational Efficiency Image Compression

Multi-channel Weighted Nuclear Norm Minimization for Real Color Image Denoising

no code implementations ICCV 2017 Jun Xu, Lei Zhang, David Zhang, Xiangchu Feng

Most of the existing denoising algorithms are developed for grayscale images, while it is not a trivial work to extend them for color image denoising because the noise statistics in R, G, B channels can be very different for real noisy images.

Color Image Denoising Image Denoising

External Prior Guided Internal Prior Learning for Real-World Noisy Image Denoising

no code implementations12 May 2017 Jun Xu, Lei Zhang, David Zhang

We propose to exploit the information in both external data and the given noisy image, and develop an external prior guided internal prior learning method for real-world noisy image denoising.

Image Denoising

Learning Convolutional Networks for Content-weighted Image Compression

1 code implementation CVPR 2018 Mu Li, WangMeng Zuo, Shuhang Gu, Debin Zhao, David Zhang

Therefore, the encoder, decoder, binarizer and importance map can be jointly optimized in an end-to-end manner by using a subset of the ImageNet database.

Binarization Image Compression +1

Low Precision Neural Networks using Subband Decomposition

no code implementations24 Mar 2017 Sek Chai, Aswin Raghavan, David Zhang, Mohamed Amer, Tim Shields

In this paper, we present a unique approach using lower precision weights for more efficient and faster training phase.

Deep Identity-aware Transfer of Facial Attributes

no code implementations18 Oct 2016 Mu Li, WangMeng Zuo, David Zhang

In general, our model consists of a mask network and an attribute transform network which work in synergy to generate a photo-realistic facial image with the reference attribute.

Attribute Denoising +3

Convolutional Network for Attribute-driven and Identity-preserving Human Face Generation

no code implementations23 Aug 2016 Mu Li, WangMeng Zuo, David Zhang

Here we address this problem from the view of optimization, and suggest an optimization model to generate human face with the given attributes while keeping the identity of the reference image.

Attribute Face Generation

Joint Learning of Single-Image and Cross-Image Representations for Person Re-Identification

no code implementations CVPR 2016 Faqiang Wang, WangMeng Zuo, Liang Lin, David Zhang, Lei Zhang

Person re-identification has been usually solved as either the matching of single-image representation (SIR) or the classification of cross-image representation (CIR).

Person Re-Identification

Multi-modal Fusion for Diabetes Mellitus and Impaired Glucose Regulation Detection

no code implementations12 Apr 2016 Jinxing Li, David Zhang, Yongcheng Li, Jian Wu

has proved that tongue, face and sublingual diagnosis as a noninvasive method is a reasonable way for disease detection.

Multi-modal Classification

Quadratic Projection Based Feature Extraction with Its Application to Biometric Recognition

no code implementations25 Mar 2016 Yan Yan, Hanzi Wang, Si Chen, Xiaochun Cao, David Zhang

This paper presents a novel quadratic projection based feature extraction framework, where a set of quadratic matrices is learned to distinguish each class from all other classes.

Learning Domain-Invariant Subspace using Domain Features and Independence Maximization

2 code implementations15 Mar 2016 Ke Yan, Lu Kou, David Zhang

In this paper, we focus on the problem of instrumental variation and time-varying drift in the field of sensors and measurement, which can be viewed as discrete and continuous distributional change in the feature space.

Domain Adaptation

A survey of sparse representation: algorithms and applications

no code implementations23 Feb 2016 Zheng Zhang, Yong Xu, Jian Yang, Xuelong. Li, David Zhang

The main purpose of this article is to provide a comprehensive study and an updated review on sparse representation and to supply a guidance for researchers.

Robust Scene Text Recognition Using Sparse Coding based Features

no code implementations29 Dec 2015 Da-Han Wang, Hanzi Wang, Dong Zhang, Jonathan Li, David Zhang

For character detection, we use the HSC features instead of using the Histograms of Oriented Gradients (HOG) features.

Scene Text Recognition

Patch Group Based Nonlocal Self-Similarity Prior Learning for Image Denoising

no code implementations ICCV 2015 Jun Xu, Lei Zhang, WangMeng Zuo, David Zhang, Xiangchu Feng

PGs are extracted from training images by putting nonlocal similar patches into groups, and a PG based Gaussian Mixture Model (PG-GMM) learning algorithm is developed to learn the NSS prior.

Image Denoising

SVM and ELM: Who Wins? Object Recognition with Deep Convolutional Features from ImageNet

no code implementations8 Jun 2015 Lei Zhang, David Zhang

In this report, we have discussed the nearest neighbor, support vector machines and extreme learning machines for image classification under deep convolutional activation feature representation.

General Classification Image Classification +1

Domain Adaptation Extreme Learning Machines for Drift Compensation in E-nose Systems

no code implementations24 May 2015 Lei Zhang, David Zhang

This paper proposes a unified framework, referred to as Domain Adaptation Extreme Learning Machine (DAELM), which learns a robust classifier by leveraging a limited number of labeled data from target domain for drift compensation as well as gases recognition in E-nose systems, without loss of the computational efficiency and learning ability of traditional ELM.

Computational Efficiency Domain Adaptation

Visual Understanding via Multi-Feature Shared Learning with Global Consistency

no code implementations20 May 2015 Lei Zhang, David Zhang

This paper studies visual understanding via a newly proposed l_2-norm based multi-feature shared learning framework, which can simultaneously learn a global label matrix and multiple sub-classifiers with the labeled multi-feature data.

Evolutionary Cost-sensitive Extreme Learning Machine

no code implementations17 May 2015 Lei Zhang, David Zhang

Conventional extreme learning machines solve a Moore-Penrose generalized inverse of hidden layer activated matrix and analytically determine the output weights to achieve generalized performance, by assuming the same loss from different types of misclassification.

Face Recognition

Robust Visual Knowledge Transfer via EDA

no code implementations17 May 2015 Lei Zhang, David Zhang

It allows us to learn a category transformation and an ELM classifier with random projection by minimizing the l_(2, 1)-norm of the network output weights and the learning error simultaneously.

Domain Adaptation Object Recognition +1

Iterated Support Vector Machines for Distance Metric Learning

no code implementations2 Feb 2015 Wangmeng Zuo, Faqiang Wang, David Zhang, Liang Lin, Yuchi Huang, Deyu Meng, Lei Zhang

Distance metric learning aims to learn from the given training data a valid distance metric, with which the similarity between data samples can be more effectively evaluated for classification.

Classification Face Verification +5

Fast Tracking via Spatio-Temporal Context Learning

no code implementations8 Nov 2013 Kaihua Zhang, Lei Zhang, Ming-Hsuan Yang, David Zhang

In this paper, we present a simple yet fast and robust algorithm which exploits the spatio-temporal context for visual tracking.

Position Visual Tracking

A Kernel Classification Framework for Metric Learning

no code implementations23 Sep 2013 Faqiang Wang, WangMeng Zuo, Lei Zhang, Deyu Meng, David Zhang

Learning a distance metric from the given training samples plays a crucial role in many machine learning tasks, and various models and optimization algorithms have been proposed in the past decade.

Classification General Classification +1

Image Set based Collaborative Representation for Face Recognition

no code implementations30 Aug 2013 Pengfei Zhu, WangMeng Zuo, Lei Zhang, Simon C. K. Shiu, David Zhang

One key issue of ISFR is how to effectively and efficiently represent the query face image set by using the gallery face image sets.

Face Recognition General Classification

Texture Enhanced Image Denoising via Gradient Histogram Preservation

no code implementations CVPR 2013 Wangmeng Zuo, Lei Zhang, Chunwei Song, David Zhang

Image denoising is a classical yet fundamental problem in low level vision, as well as an ideal test bed to evaluate various statistical image modeling methods.

Image Denoising

A Local Active Contour Model for Image Segmentation with Intensity Inhomogeneity

no code implementations30 May 2013 Kaihua Zhang, Lei Zhang, Kin-Man Lam, David Zhang

The means of the Gaussian distributions in the transformed domain can be adaptively estimated by multiplying a bias field with the original signal within the window.

Image Segmentation Semantic Segmentation

Collaborative Representation based Classification for Face Recognition

no code implementations11 Apr 2012 Lei Zhang, Meng Yang, Xiangchu Feng, Yi Ma, David Zhang

It is widely believed that the l1- norm sparsity constraint on coding coefficients plays a key role in the success of SRC, while its use of all training samples to collaboratively represent the query sample is rather ignored.

Classification Face Recognition +3

Regularized Robust Coding for Face Recognition

no code implementations20 Feb 2012 Meng Yang, Lei Zhang, Jian Yang, David Zhang

Recently the sparse representation based classification (SRC) has been proposed for robust face recognition (FR).

Face Recognition Robust Face Recognition +1

Essence of kernel Fisher discriminant: KPCA plus LDA

no code implementations Pattern Recognition 2003 Jian Yang; Zhong Jin; Jing-yu Yang, David Zhang, Alejandro F. Frangi

In this paper, the method of kernel Fisher discriminant (KFD) is analyzed and its nature is revealed, i. e., KFD is equivalent to kernel principal component analysis (KPCA) plus Fisher linear discriminant analysis (LDA).

Cannot find the paper you are looking for? You can Submit a new open access paper.