Search Results for author: Yandong Guo

Found 22 papers, 6 papers with code

Improving Robustness of Adversarial Attacks Using an Affine-Invariant Gradient Estimator

no code implementations13 Sep 2021 Wenzhao Xiang, Hang Su, Chang Liu, Yandong Guo, Shibao Zheng

Adversarial examples can deceive a deep neural network (DNN) by significantly altering its response with imperceptible perturbations, which poses new potential vulnerabilities as the growing ubiquity of DNNs.

Adversarial Attack Affine Transformation

The 2nd Anti-UAV Workshop & Challenge: Methods and Results

no code implementations23 Aug 2021 Jian Zhao, Gang Wang, Jianan Li, Lei Jin, Nana Fan, Min Wang, Xiaojuan Wang, Ting Yong, Yafeng Deng, Yandong Guo, Shiming Ge, Guodong Guo

The 2nd Anti-UAV Workshop \& Challenge aims to encourage research in developing novel and accurate methods for multi-scale object tracking.

Object Tracking

Generator Pyramid for High-Resolution Image Inpainting

no code implementations4 Dec 2020 Leilei Cao, Tong Yang, Yixu Wang, Bo Yan, Yandong Guo

Thus, our model consists of a pyramid of fully convolutional GANs, wherein the content GAN is responsible for completing contents in the lowest-resolution masked image, and each texture GAN is responsible for synthesizing textures in a higher-resolution image.

Image Inpainting Texture Synthesis

Perceptual Extreme Super Resolution Network with Receptive Field Block

1 code implementation26 May 2020 Taizhang Shang, Qiuju Dai, Shengchen Zhu, Tong Yang, Yandong Guo

Third, we alternately use different upsampling methods in the upsampling stage to reduce the high computation complexity and still remain satisfactory performance.

Image Super-Resolution Object Detection

Discriminative Multi-modality Speech Recognition

2 code implementations CVPR 2020 Bo Xu, Cheng Lu, Yandong Guo, Jacob Wang

Vision is often used as a complementary modality for audio speech recognition (ASR), especially in the noisy environment where performance of solo audio modality significantly deteriorates.

Audio-Visual Speech Recognition Lipreading +1

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild

no code implementations7 Apr 2020 Zhecan Wang, Jian Zhao, Cheng Lu, Han Huang, Fan Yang, Lianji Li, Yandong Guo

To better demonstrate the advantage of our methods, we further propose a new benchmark dataset with the most rich distribution of head-gaze combination reflecting real-world scenarios.

Gaze Estimation

To See in the Dark: N2DGAN for Background Modeling in Nighttime Scene

no code implementations12 Dec 2019 Zhenfeng Zhu, Yingying Meng, Deqiang Kong, Xingxing Zhang, Yandong Guo, Yao Zhao

Due to the deteriorated conditions of \mbox{illumination} lack and uneven lighting, nighttime images have lower contrast and higher noise than their daytime counterparts of the same scene, which limits seriously the performances of conventional background modeling methods.

Dually Supervised Feature Pyramid for Object Detection and Segmentation

1 code implementation8 Dec 2019 Fan Yang, Cheng Lu, Yandong Guo, Longin Jan Latecki, Haibin Ling

Feature pyramid architecture has been broadly adopted in object detection and segmentation to deal with multi-scale problem.

Object Detection

Generative One-Shot Face Recognition

no code implementations28 Sep 2019 Zhengming Ding, Yandong Guo, Lei Zhang, Yun Fu

Specifically, we target at building a more effective general face classifier for both normal persons and one-shot persons.

Face Recognition One-Shot Learning +1

Edge Heuristic GAN for Non-uniform Blind Deblurring

no code implementations11 Jul 2019 Shuai Zheng, Zhenfeng Zhu, Jian Cheng, Yandong Guo, Yao Zhao

Non-uniform blur, mainly caused by camera shake and motions of multiple objects, is one of the most common causes of image quality degradation.

Deblurring

Large Scale Incremental Learning

2 code implementations CVPR 2019 Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo, Yun Fu

We believe this is because of the combination of two factors: (a) the data imbalance between the old and new classes, and (b) the increasing number of visually similar classes.

Incremental Learning

Learning to Count Objects with Few Exemplar Annotations

no code implementations20 May 2019 Jianfeng Wang, Rong Xiao, Yandong Guo, Lei Zhang

In this paper, we study the problem of object counting with incomplete annotations.

Object Counting Object Detection

Revisit Multinomial Logistic Regression in Deep Learning: Data Dependent Model Initialization for Image Recognition

no code implementations17 Sep 2018 Bowen Cheng, Rong Xiao, Yandong Guo, Yuxiao Hu, Jian-Feng Wang, Lei Zhang

We study in this paper how to initialize the parameters of multinomial logistic regression (a fully connected layer followed with softmax and cross entropy loss), which is widely used in deep neural network (DNN) models for classification problems.

Classification General Classification +3

Incremental Classifier Learning with Generative Adversarial Networks

no code implementations2 Feb 2018 Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo, Zhengyou Zhang, Yun Fu

To address these problems, we propose (a) a new loss function to combine the cross-entropy loss and distillation loss, (b) a simple way to estimate and remove the unbalance between the old and new classes , and (c) using Generative Adversarial Networks (GANs) to generate historical data and select representative exemplars during generation.

General Classification

One-shot Face Recognition by Promoting Underrepresented Classes

1 code implementation18 Jul 2017 Yandong Guo, Lei Zhang

First, we build a face feature extraction model, and improve its performance, especially for the persons with very limited training samples, by introducing a regularizer to the cross entropy loss for the multi-nomial logistic regression (MLR) learning.

Face Identification Face Recognition

Model-based Iterative Restoration for Binary Document Image Compression with Dictionary Learning

no code implementations CVPR 2017 Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman

Experimental results with a variety of document images demonstrate that our method improves the image quality compared with the observed image, and simultaneously improves the compression ratio.

Dictionary Learning Image Compression

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

10 code implementations27 Jul 2016 Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, Jianfeng Gao

In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base.

Face Recognition Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.