Search Results for author: Yan Luo

Found 44 papers, 22 papers with code

n-Reference Transfer Learning for Saliency Prediction

1 code implementation • ECCV 2020 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

The proposed framework is gradient-based and model-agnostic.

Paper
Code

NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

3 code implementations • 22 Apr 2024 • Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin, Yu Zhu, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang, Qingsen Yan, Wenbin Zou, Weipeng Yang, Yunxiang Li, Qiaomu Wei, Tian Ye, Sixiang Chen, Zhao Zhang, Suiyi Zhao, Bo wang, Yan Luo, Zhichao Zuo, Mingshen Wang, Junhu Wang, Yanyan Wei, Xiaopeng Sun, Yu Gao, Jiancheng Huang, Hongming Chen, Xiang Chen, Hui Tang, Yuanbin Chen, Yuanbo Zhou, Xinwei Dai, Xintao Qiu, Wei Deng, Qinquan Gao, Tong Tong, Mingjia Li, Jin Hu, Xinyu He, Xiaojie Guo, sabarinathan, K Uma, A Sasithradevi, B Sathya Bama, S. Mohamed Mansoor Roomi, V. Srivatsav, Jinjuan Wang, Long Sun, Qiuying Chen, Jiahong Shao, Yizhi Zhang, Marcos V. Conde, Daniel Feijoo, Juan C. Benito, Alvaro García, Jaeho Lee, Seongwan Kim, Sharif S M A, Nodirkhuja Khujaev, Roman Tsoy, Ali Murtaza, Uswah Khairuddin, Ahmad 'Athif Mohd Faudzi, Sampada Malagi, Amogh Joshi, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Wenyi Lian, Wenjing Lian, Jagadeesh Kalyanshetti, Vijayalaxmi Ashok Aralikatti, Palani Yashaswini, Nitish Upasi, Dikshit Hegde, Ujwala Patil, Sujata C, Xingzhuo Yan, Wei Hao, Minghan Fu, Pooja Choksy, Anjali Sarvaiya, Kishor Upla, Kiran Raja, Hailong Yan, Yunkai Zhang, Baiang Li, Jingyi Zhang, Huan Zheng

This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results.

4k Low-Light Image Enhancement +1

488

Paper
Code

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

1 code implementation • 16 Apr 2024 • Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi

In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.

Image Super-Resolution

Paper
Code

NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

1 code implementation • 15 Apr 2024 • Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou, Hongyu An, Xinfeng Zhang, Zhiyuan Song, Ziyue Dong, Qing Zhao, Xiaogang Xu, Pengxu Wei, Zhi-chao Dou, Gui-ling Wang, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Cansu Korkmaz, A. Murat Tekalp, Yubin Wei, Xiaole Yan, Binren Li, Haonan Chen, Siqi Zhang, Sihan Chen, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi, Anjali Sarvaiya, Pooja Choksy, Jagrit Joshi, Shubh Kawa, Kishor Upla, Sushrut Patwardhan, Raghavendra Ramachandra, Sadat Hossain, Geongi Park, S. M. Nadim Uddin, Hao Xu, Yanhui Guo, Aman Urumbekov, Xingzhuo Yan, Wei Hao, Minghan Fu, Isaac Orais, Samuel Smith, Ying Liu, Wangwang Jia, Qisheng Xu, Kele Xu, Weijun Yuan, Zhan Li, Wenqin Kuang, Ruijin Guan, Ruting Deng, Zhao Zhang, Bo wang, Suiyi Zhao, Yan Luo, Yanyan Wei, Asif Hussain Khan, Christian Micheloni, Niki Martinel

This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained.

Image Super-Resolution valid

Paper
Code

Streamlined Photoacoustic Image Processing with Foundation Models: A Training-Free Solution

no code implementations • 11 Apr 2024 • Handi Deng, Yucheng Zhou, Jiaxuan Xiang, Liujie Gu, Yan Luo, Hai Feng, Mingyuan Liu, Cheng Ma

Foundation models have rapidly evolved and have achieved significant accomplishments in computer vision tasks.

Image Segmentation Segmentation +1

Paper
Add Code

FairCLIP: Harnessing Fairness in Vision-Language Learning

1 code implementation • 29 Mar 2024 • Yan Luo, Min Shi, Muhammad Osama Khan, Muhammad Muneeb Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, Luo Song, Ava Kouhana, Tobias Elze, Yi Fang, Mengyu Wang

Fairness is a critical concern in deep learning, especially in healthcare, where these models influence diagnoses and treatment decisions.

Fairness

Paper
Code

MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder

no code implementations • 7 Mar 2024 • Lei LI, Tianfang Zhang, Xinglin Zhang, Jiaqi Liu, Bingqi Ma, Yan Luo, Tao Chen

Within the domain of medical analysis, extensive research has explored the potential of mutual learning between Masked Autoencoders(MAEs) and multimodal data.

Representation Learning Zero-Shot Learning

Paper
Add Code

TransFlower: An Explainable Transformer-Based Model with Flow-to-Flow Attention for Commuting Flow Prediction

1 code implementation • 23 Feb 2024 • Yan Luo, Zhuoyue Wan, Yuzhong Chen, Gengchen Mai, Fu-Lai Chung, Kent Larson

Understanding the link between urban planning and commuting flows is crucial for guiding urban development and policymaking.

Paper
Code

Prismatic: Interactive Multi-View Cluster Analysis of Concept Stocks

no code implementations • 14 Feb 2024 • Wong Kam-Kwai, Yan Luo, Xuanwu Yue, Wei Chen, Huamin Qu

Financial cluster analysis allows investors to discover investment alternatives and avoid undertaking excessive risks.

Clustering

Paper
Add Code

Cut-and-Paste: Subject-Driven Video Editing with Attention Control

no code implementations • 20 Nov 2023 • Zhichao Zuo, Zhao Zhang, Yan Luo, Yang Zhao, Haijun Zhang, Yi Yang, Meng Wang

This paper presents a novel framework termed Cut-and-Paste for real-word semantic video editing under the guidance of text prompt and additional reference image.

Object Video Editing

Paper
Add Code

Passive Handwriting Tracking via Weak mmWave Communication Signals

no code implementations • 3 Nov 2023 • Chao Yu, Yan Luo, Renqi Chen, Rui Wang

In this letter, a cooperative sensing framework based on millimeter wave (mmWave) communication systems is proposed to detect tiny motions with a millimeter-level resolution.

Paper
Add Code

FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

1 code implementation • 3 Nov 2023 • Yu Tian, Min Shi, Yan Luo, Ava Kouhana, Tobias Elze, Mengyu Wang

Existing medical fairness datasets are all for classification tasks, and no fairness datasets are available for medical segmentation, while medical segmentation is an equally important clinical task as classifications, which can provide detailed spatial information on organ abnormalities ready to be assessed by clinicians.

Fairness Image Segmentation +3

Paper
Code

FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

no code implementations • 3 Oct 2023 • Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

To address this research gap, we conduct the first comprehensive study on the fairness of 3D medical imaging models across multiple protected attributes.

Fairness

Paper
Add Code

Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning

no code implementations • ICCV 2023 • Yan Luo, Min Shi, Yu Tian, Tobias Elze, Mengyu Wang

This is the largest glaucoma detection dataset with 3D OCT imaging data and the first glaucoma progression forecasting dataset that is publicly available.

Fairness

Paper
Add Code

Focus the Discrepancy: Intra- and Inter-Correlation Learning for Image Anomaly Detection

1 code implementation • ICCV 2023 • Xincheng Yao, Ruoqi Li, Zefeng Qian, Yan Luo, Chongyang Zhang

Humans recognize anomalies through two aspects: larger patch-wise representation discrepancies and weaker patch-to-normal-patch correlations.

Anomaly Detection Self-Supervised Learning

Paper
Code

Harvard Glaucoma Fairness: A Retinal Nerve Disease Dataset for Fairness Learning and Fair Identity Normalization

1 code implementation • 15 Jun 2023 • Yan Luo, Yu Tian, Min Shi, Louis R. Pasquale, Lucy Q. Shen, Nazlee Zebardast, Tobias Elze, Mengyu Wang

To address this gap, we introduce Harvard Glaucoma Fairness (Harvard-GF), a retinal nerve disease dataset with both 2D and 3D imaging data and balanced racial groups for glaucoma detection.

Fairness Feature Importance

Paper
Code

Timestamps as Prompts for Geography-Aware Location Recommendation

no code implementations • 9 Apr 2023 • Yan Luo, Haoyi Duan, Ye Liu, Fu-Lai Chung

In this paper, we revisit the problem of location recommendation and point out that explicitly modeling temporal information is a great help when the model needs to predict not only the next location but also further locations.

Paper
Add Code

End-to-End Personalized Next Location Recommendation via Contrastive User Preference Modeling

no code implementations • 22 Mar 2023 • Yan Luo, Ye Liu, Fu-Lai Chung, Yu Liu, Chang Wen Chen

History encoder is designed to model mobility patterns from historical check-in sequences, while query generator explicitly learns user preferences to generate user-specific intention queries.

Paper
Add Code

mmAlert: mmWave Link Blockage Prediction via Passive Sensing

no code implementations • 22 Feb 2023 • Chao Yu, Yifei Sun, Yan Luo, Rui Wang

It is demonstrated via experiments that the mmAlert system can always detect the motions of the walking person close to the LoS path, and predict 90\% of the LoS blockage with sensing time of 1. 4 seconds.

Paper
Add Code

Long-Range Zero-Shot Generative Deep Network Quantization

no code implementations • 13 Nov 2022 • Yan Luo, Yangcheng Gao, Zhao Zhang, Haijun Zhang, Mingliang Xu, Meng Wang

We find it is because: 1) a normal generator is hard to obtain high diversity of synthetic data, since it lacks long-range information to allocate attention to global features; 2) the synthetic images aim to simulate the statistics of real data, which leads to weak intra-class heterogeneity and limited feature richness.

Knowledge Distillation Quantization

Paper
Add Code

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

1 code implementation • 28 Oct 2022 • Jia-Xin Ye, Xin-Cheng Wen, Xuan-Ze Wang, Yong Xu, Yan Luo, Chang-Li Wu, Li-Yan Chen, Kun-Hong Liu

In this paper, we propose a Gated Multi-scale Temporal Convolutional Network (GM-TCNet) to construct a novel emotional causality representation learning component with a multi-scale receptive field.

Representation Learning Speech Emotion Recognition

139

Paper
Code

A New Task: Deriving Semantic Class Targets for the Physical Sciences

1 code implementation • 26 Oct 2022 • Micah Bowles, Hongming Tang, Eleni Vardoulaki, Emma L. Alexander, Yan Luo, Lawrence Rudnick, Mike Walmsley, Fiona Porter, Anna M. M. Scaife, Inigo Val Slijepcevic, Gary Segal

We define deriving semantic class targets as a novel multi-modal task.

Astronomy

Paper
Code

Artifact-Tolerant Clustering-Guided Contrastive Embedding Learning for Ophthalmic Images

1 code implementation • 2 Sep 2022 • Min Shi, Anagha Lokhande, Mojtaba S. Fazli, Vishal Sharma, Yu Tian, Yan Luo, Louis R. Pasquale, Tobias Elze, Michael V. Boland, Nazlee Zebardast, David S. Friedman, Lucy Q. Shen, Mengyu Wang

Ophthalmic images and derivatives such as the retinal nerve fiber layer (RNFL) thickness map are crucial for detecting and monitoring ophthalmic diseases (e. g., glaucoma).

Clustering Contrastive Learning +1

Paper
Code

CTL-MTNet: A Novel CapsNet and Transfer Learning-Based Mixed Task Net for the Single-Corpus and Cross-Corpus Speech Emotion Recognition

no code implementations • 18 Jul 2022 • Xin-Cheng Wen, Jia-Xin Ye, Yan Luo, Yong Xu, Xuan-Ze Wang, Chang-Li Wu, Kun-Hong Liu

For the single-corpus task, the combination of Convolution-Pooling and Attention CapsNet module CPAC) is designed by embedding the self-attention mechanism to the CapsNet, guiding the module to focus on the important features that can be fed into different capsules.

Cross-corpus Speech Emotion Recognition +1

Paper
Add Code

Passive Motion Detection via mmWave Communication System

no code implementations • 28 Mar 2022 • Jie Li, Chao Yu, Yan Luo, Yifei Sun, Rui Wang

Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively.

Hand Gesture Recognition Hand-Gesture Recognition +1

Paper
Add Code

Learning to Minimize the Remainder in Supervised Learning

1 code implementation • 23 Jan 2022 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

To this end, we propose a new learning approach, namely gradient adjustment learning (GAL), to leverage the knowledge learned from the past training iterations to adjust vanilla gradients, such that the remainders are minimized and the approximations are improved.

Image Classification Image Retrieval +3

Paper
Code

Learning to Predict Gradients for Semi-Supervised Continual Learning

1 code implementation • 23 Jan 2022 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models.

Continual Learning

Paper
Code

Swin-Pose: Swin Transformer Based Human Pose Estimation

no code implementations • 19 Jan 2022 • Zinan Xiong, Chenxi Wang, Ying Li, Yan Luo, Yu Cao

We are interested in exploring its capability in human pose estimation, and thus propose a novel model based on transformer architecture, enhanced with a feature pyramid fusion structure.

Pose Estimation

Paper
Add Code

Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images

1 code implementation • 22 Nov 2021 • Ye Liu, Huifang Li, Chao Hu, Shuang Luo, Yan Luo, Chang Wen Chen

The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid (SCP), and hierarchical region of interest extractor (HRoIE), to aggregate global visual context at feature, spatial, and instance domains, respectively.

Instance Segmentation Object Detection

Paper
Code

Learning to Predict Trustworthiness with Steep Slope Loss

1 code implementation • NeurIPS 2021 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

Secondly, due to the data complexity, it is challenging to differentiate the incorrect predictions from the correct ones on real-world large-scale datasets.

Paper
Code

Multi-Task Hierarchical Learning Based Network Traffic Analytics

1 code implementation • 5 Jun 2021 • Onur Barut, Yan Luo, Tong Zhang, Weigang Li, Peilong Li

Classifying network traffic is the basis for important network applications.

Malware Detection

Paper
Code

Embracing Uncertainty: Decoupling and De-bias for Robust Temporal Grounding

no code implementations • CVPR 2021 • Hao Zhou, Chongyang Zhang, Yan Luo, Yanjun Chen, Chuanping Hu

Meanwhile, modified feature assigned with style-like words (including adjectives, adverbs, etc) represents the subjective information, and thus brings personalized predictions; De-bias - We propose a de-bias mechanism to generate diverse predictions, aim to alleviate the bias caused by single-style annotations in the presence of label uncertainty.

Relation

Paper
Add Code

Which to Match? Selecting Consistent GT-Proposal Assignment for Pedestrian Detection

no code implementations • 18 Mar 2021 • Yan Luo, Chongyang Zhang, Muming Zhao, Hao Zhou, Jun Sun

Consequently, we address the weakness of IoU by introducing one geometric sensitive search algorithm as a new assignment and regression metric.

Autonomous Driving Pedestrian Detection +1

Paper
Add Code

Where, What, Whether: Multi-modal Learning Meets Pedestrian Detection

no code implementations • CVPR 2020 • Yan Luo, Chongyang Zhang, Muming Zhao, Hao Zhou, Jun Sun

i) We generate a bird view map, which is naturally free from occlusion issues, and scan all points on it to look for suitable locations for each pedestrian instance.

Pedestrian Detection

Paper
Add Code

A Fast and Effective Method of Macula Automatic Detection for Retina Images

no code implementations • 7 Oct 2020 • Yukang Jiang, Jianying Pan, Yanhe Shen, Jin Zhu, Jiamin Huang, Huirui Xie, Xueqin Wang, Yan Luo

Retina image processing is one of the crucial and popular topics of medical image processing.

Paper
Add Code

$n$-Reference Transfer Learning for Saliency Prediction

1 code implementation • 9 Jul 2020 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

The proposed framework is gradient-based and model-agnostic.

Ranked #1 on Few-Shot Transfer Learning for Saliency Prediction on SALICON->WebpageSaliency - 1-shot

Transfer Learning

Paper
Code

NetML: A Challenge for Network Traffic Analytics

1 code implementation • 25 Apr 2020 • Onur Barut, Yan Luo, Tong Zhang, Weigang Li, Peilong Li

Classifying network traffic is the basis for important network applications.

BIG-bench Machine Learning Malware Detection

Paper
Code

3D Aggregated Faster R-CNN for General Lesion Detection

no code implementations • 29 Jan 2020 • Ning Zhang, Yu Cao, Benyuan Liu, Yan Luo

This classifier branch is equipped with Feature Aggregation and Local Magnification Layers to enhance the classifier branch.

Computed Tomography (CT) Lesion Detection +2

Paper
Add Code

Direction Concentration Learning: Enhancing Congruency in Machine Learning

1 code implementation • 17 Dec 2019 • Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

We propose a Direction Concentration Learning (DCL) method to improve congruency in the learning process, where enhancing congruency influences the convergence path to be less circuitous.

Ranked #8 on Image Classification on Tiny ImageNet Classification (using extra training data)

BIG-bench Machine Learning Continual Learning +2

Paper
Code

$\mathcal{G}$-softmax: Improving Intra-class Compactness and Inter-class Separability of Features

1 code implementation • 8 Apr 2019 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

In addition, analysis of the intra-class compactness and inter-class separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement.

General Classification Multi-Label Classification

Paper
Code

NeuroTreeNet: A New Method to Explore Horizontal Expansion Network

no code implementations • 22 Nov 2018 • Shenlong Lou, Yan Luo, Qiancong Fan, Feng Chen, Yiping Chen, Cheng Wang, Jonathan Li

It is widely recognized that the deeper networks or networks with more feature maps have better performance.

Super-Resolution

Paper
Add Code

DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment

1 code implementation • 17 Jun 2016 • Chang Liu, Yu Cao, Yan Luo, Guanling Chen, Vinod Vokkarane, Yunsheng Ma

We applied our proposed approach to two real-world food image data sets (UEC-256 and Food-101) and achieved impressive results.

Cloud Computing Fine-Grained Image Recognition

Paper
Code

Foveation-based Mechanisms Alleviate Adversarial Examples

no code implementations • 19 Nov 2015 • Yan Luo, Xavier Boix, Gemma Roig, Tomaso Poggio, Qi Zhao

To see this, first, we report results in ImageNet that lead to a revision of the hypothesis that adversarial perturbations are a consequence of CNNs acting as a linear classifier: CNNs act locally linearly to changes in the image regions with objects recognized by the CNN, and in other regions the CNN may act non-linearly.

Foveation Translation

Paper
Add Code

Label Consistent Quadratic Surrogate Model for Visual Saliency Prediction

no code implementations • CVPR 2015 • Yan Luo, Yongkang Wong, Qi Zhao

In addition, since new datasets are built and shared in the community from time to time, it would be good not to retrain the entire model when new data are added.

Saliency Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.