Search Results for author: Yifan Zhao

Found 35 papers, 16 papers with code

Image Quality Assessment for Omnidirectional Cross-reference Stitching

no code implementations10 Apr 2019 Kaiwen Yu, Jia Li, Yu Zhang, Yifan Zhao, Long Xu

Along with the development of virtual reality (VR), omnidirectional images play an important role in producing multimedia content with immersive experience.

Image Quality Assessment Image Stitching

Cartoon Face Recognition: A Benchmark Dataset

1 code implementation31 Jul 2019 Yi Zheng, Yifan Zhao, Mengyuan Ren, He Yan, Xiangju Lu, Junhui Liu, Jia Li

Recent years have witnessed increasing attention in cartoon media, powered by the strong demands of industrial applications.

Domain Adaptation Face Detection +4

Multi-Class Part Parsing With Joint Boundary-Semantic Awareness

no code implementations ICCV 2019 Yifan Zhao, Jia Li, Yu Zhang, Yonghong Tian

In this paper, we propose a joint parsing framework with boundary and semantic awareness to address this challenging problem.

2D Semantic Segmentation

Misalignment Resilient Diffractive Optical Networks

no code implementations23 May 2020 Deniz Mengu, Yifan Zhao, Nezih T. Yardimci, Yair Rivenson, Mona Jarrahi, Aydogan Ozcan

By modeling the undesired layer-to-layer misalignments in 3D as continuous random variables in the optical forward model, diffractive networks are trained to maintain their inference accuracy over a large range of misalignments; we term this diffractive network design as vaccinated D2NN (v-D2NN).

Object Recognition

Is Depth Really Necessary for Salient Object Detection?

1 code implementation30 May 2020 Jia-Wei Zhao, Yifan Zhao, Jia Li, Xiaowu Chen

To solve this, many recent RGBD-based networks are proposed by adopting the depth map as an independent input and fuse the features with RGB information.

Object object-detection +3

Cooperative Bi-path Metric for Few-shot Learning

1 code implementation10 Aug 2020 Zeyuan Wang, Yifan Zhao, Jia Li, Yonghong Tian

Given base classes with sufficient labeled samples, the target of few-shot classification is to recognize unlabeled samples of novel classes with only a few labeled samples.

Classification Few-Shot Learning +1

DanceIt: Music-inspired Dancing Video Synthesis

1 code implementation17 Sep 2020 Xin Guo, Yifan Zhao, Jia Li

To explore the relationship between music and dance movements, we propose a cross-modal alignment module that focuses on dancing video clips, accompanied on pre-designed music, to learn a system that can judge the consistency between the visual features of pose sequences and the acoustic features of music.

Holistic Combination of Structural and Textual Code Information for Context based API Recommendation

no code implementations15 Oct 2020 Chi Chen, Xin Peng, Zhenchang Xing, Jun Sun, Xin Wang, Yifan Zhao, Wenyun Zhao

APIRec-CST is a deep learning model that combines the API usage with the text information in the source code based on an API Context Graph Network and a Code Token Network that simultaneously learn structural and textual features for API recommendation.

Learning Predictive Communication by Imagination in Networked System Control

no code implementations1 Jan 2021 Yali Du, Yifan Zhao, Meng Fang, Jun Wang, Gangyan Xu, Haifeng Zhang

Dealing with multi-agent control in networked systems is one of the biggest challenges in Reinforcement Learning (RL) and limited success has been presented compared to recent deep reinforcement learning in single-agent domain.

reinforcement-learning Reinforcement Learning (RL)

SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering

1 code implementation1 Aug 2021 Yifan Zhao, Le Hui, Jin Xie

To achieve this, we exploit the consistency between the input sparse point cloud and generated dense point cloud for the shapes and rendered images.

point cloud upsampling

FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning

1 code implementation ICCV 2021 Chenxu Zhang, Yifan Zhao, Yifei HUANG, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo

In this paper, we propose a talking face generation method that takes an audio signal as input and a short target video clip as reference, and synthesizes a photo-realistic video of the target face with natural lip motions, head poses, and eye blinks that are in-sync with the input audio signal.

3D Face Animation Attribute +2

Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification

1 code implementation8 Sep 2021 Zhongxing Ma, Yifan Zhao, Jia Li

Therefore, we propose a Pose-guided inter-and intra-part relational transformer (Pirt) for occluded person Re-Id, which builds part-aware long-term correlations by introducing transformers.

Person Re-Identification

RGB-D Salient Object Detection with Ubiquitous Target Awareness

no code implementations8 Sep 2021 Yifan Zhao, Jiawei Zhao, Jia Li, Xiaowu Chen

To construct our framework as well as achieving accurate salient detection results, we propose a Ubiquitous Target Awareness (UTA) network to solve three important challenges in RGB-D SOD task: 1) a depth awareness module to excavate depth information and to mine ambiguous regions via adaptive depth-error weights, 2) a spatial-aware cross-modal interaction and a channel-aware cross-level interaction, exploiting the low-level boundary cues and amplifying high-level salient channels, and 3) a gated multi-scale predictor module to perceive the object saliency in different contextual scales.

Object object-detection +4

Heterogeneous Relational Complement for Vehicle Re-identification

1 code implementation ICCV 2021 Jiajian Zhao, Yifan Zhao, Jia Li, Ke Yan, Yonghong Tian

The crucial problem in vehicle re-identification is to find the same vehicle identity when reviewing this object from cross-view cameras, which sets a higher demand for learning viewpoint-invariant representations.

Vehicle Re-Identification

Transformer-based Dual Relation Graph for Multi-label Image Recognition

1 code implementation ICCV 2021 Jiawei Zhao, Ke Yan, Yifan Zhao, Xiaowei Guo, Feiyue Huang, Jia Li

Different from these researches, in this paper, we propose a novel Transformer-based Dual Relation learning framework, constructing complementary relationships by exploring two aspects of correlation, i. e., structural relation graph and semantic relation graph.

Multi-Label Classification Relation

A Structure Feature Algorithm for Multi-modal Forearm Registration

no code implementations10 Nov 2021 Jiaxin Li, Yan Ding, Weizhong Zhang, Yifan Zhao, Lingxi Guo, Zhe Yang

Augmented reality technology based on image registration is becoming increasingly popular for the convenience of pre-surgery preparation and medical education.

Image Registration

To image, or not to image: Class-specific diffractive cameras with all-optical erasure of undesired objects

no code implementations26 May 2022 Bijie Bai, Yi Luo, Tianyi Gan, Jingtian Hu, Yuhang Li, Yifan Zhao, Deniz Mengu, Mona Jarrahi, Aydogan Ozcan

Here, we demonstrate a camera design that performs class-specific imaging of target objects with instantaneous all-optical erasure of other classes of objects.

Privacy Preserving

Super-resolution image display using diffractive decoders

no code implementations15 Jun 2022 Cagatay Isil, Deniz Mengu, Yifan Zhao, Anika Tabassum, Jingxi Li, Yi Luo, Mona Jarrahi, Aydogan Ozcan

We report a deep learning-enabled diffractive display design that is based on a jointly-trained pair of an electronic encoder and a diffractive optical decoder to synthesize/project super-resolved images using low-resolution wavefront modulators.

Super-Resolution

Diffractive Interconnects: All-Optical Permutation Operation Using Diffractive Networks

no code implementations21 Jun 2022 Deniz Mengu, Yifan Zhao, Anika Tabassum, Mona Jarrahi, Aydogan Ozcan

Permutation matrices form an important computational building block frequently used in various fields including e. g., communications, information security and data processing.

Unidirectional Imaging using Deep Learning-Designed Materials

no code implementations5 Dec 2022 Jingxi Li, Tianyi Gan, Yifan Zhao, Bijie Bai, Che-Yung Shen, Songyu Sun, Mona Jarrahi, Aydogan Ozcan

A unidirectional imager would only permit image formation along one direction, from an input field-of-view (FOV) A to an output FOV B, and in the reverse path, the image formation would be blocked.

Blocking

Part-guided Relational Transformers for Fine-grained Visual Recognition

1 code implementation28 Dec 2022 Yifan Zhao, Jia Li, Xiaowu Chen, Yonghong Tian

This framework, namely PArt-guided Relational Transformers (PART), is proposed to learn the discriminative part features with an automatic part discovery module, and to explore the intrinsic correlations with a feature transformation module by adapting the Transformer models from the field of natural language processing.

Fine-Grained Image Classification Fine-Grained Visual Recognition +1

Parsing Objects at a Finer Granularity: A Survey

no code implementations28 Dec 2022 Yifan Zhao, Jia Li, Yonghong Tian

Fine-grained visual parsing, including fine-grained part segmentation and fine-grained object recognition, has attracted considerable critical attention due to its importance in many real-world applications, e. g., agriculture, remote sensing, and space technologies.

Object Recognition Segmentation

Mobiprox: Supporting Dynamic Approximate Computing on Mobiles

no code implementations16 Mar 2023 Matevž Fabjančič, Octavian Machidon, Hashim Sharif, Yifan Zhao, Saša Misailović, Veljko Pejović

Runtime-tunable context-dependent network compression would make mobile deep learning (DL) adaptable to often varying resource availability, input "difficulty", or user needs.

Human Activity Recognition

Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning

1 code implementation CVPR 2023 Zeyin Song, Yifan Zhao, Yujun Shi, Peixi Peng, Li Yuan, Yonghong Tian

However, in this work, we find that the CE loss is not ideal for the base session training as it suffers poor class separation in terms of representations, which further degrades generalization to novel classes.

Contrastive Learning Few-Shot Class-Incremental Learning +1

Universal Polarization Transformations: Spatial programming of polarization scattering matrices using a deep learning-designed diffractive polarization transformer

no code implementations12 Apr 2023 Yuhang Li, Jingxi Li, Yifan Zhao, Tianyi Gan, Jingtian Hu, Mona Jarrahi, Aydogan Ozcan

We demonstrate universal polarization transformers based on an engineered diffractive volume, which can synthesize a large set of arbitrarily-selected, complex-valued polarization scattering matrices between the polarization states at different positions within its input and output field-of-views (FOVs).

Picking Up Quantization Steps for Compressed Image Classification

1 code implementation21 Apr 2023 Li Ma, Peixi Peng, Guangyao Chen, Yifan Zhao, Siwei Dong, Yonghong Tian

The sensitivity of deep neural networks to compressed images hinders their usage in many real applications, which means classification networks may fail just after taking a screenshot and saving it as a compressed file.

Classification Image Classification +1

Dual Adaptive Representation Alignment for Cross-domain Few-shot Learning

1 code implementation18 Jun 2023 Yifan Zhao, Tong Zhang, Jia Li, Yonghong Tian

Recent progress in this setting assumes that the base knowledge and novel query samples are distributed in the same domains, which are usually infeasible for realistic applications.

cross-domain few-shot learning

Semantic Contrastive Bootstrapping for Single-positive Multi-label Recognition

1 code implementation15 Jul 2023 Cheng Chen, Yifan Zhao, Jia Li

Learning multi-label image recognition with incomplete annotation is gaining popularity due to its superior performance and significant labor savings when compared to training with fully labeled datasets.

Contrastive Learning Multi-Label Classification

SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream

1 code implementation17 Mar 2024 Lin Zhu, Kangmin Jia, Yifan Zhao, Yunshan Qi, Lizhi Wang, Hua Huang

Spike cameras, leveraging spike-based integration sampling and high temporal resolution, offer distinct advantages over standard cameras.

Cannot find the paper you are looking for? You can Submit a new open access paper.