Search Results for author: Yan Luo

Found 44 papers, 22 papers with code

NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

3 code implementations22 Apr 2024 Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin, Yu Zhu, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang, Qingsen Yan, Wenbin Zou, Weipeng Yang, Yunxiang Li, Qiaomu Wei, Tian Ye, Sixiang Chen, Zhao Zhang, Suiyi Zhao, Bo wang, Yan Luo, Zhichao Zuo, Mingshen Wang, Junhu Wang, Yanyan Wei, Xiaopeng Sun, Yu Gao, Jiancheng Huang, Hongming Chen, Xiang Chen, Hui Tang, Yuanbin Chen, Yuanbo Zhou, Xinwei Dai, Xintao Qiu, Wei Deng, Qinquan Gao, Tong Tong, Mingjia Li, Jin Hu, Xinyu He, Xiaojie Guo, sabarinathan, K Uma, A Sasithradevi, B Sathya Bama, S. Mohamed Mansoor Roomi, V. Srivatsav, Jinjuan Wang, Long Sun, Qiuying Chen, Jiahong Shao, Yizhi Zhang, Marcos V. Conde, Daniel Feijoo, Juan C. Benito, Alvaro García, Jaeho Lee, Seongwan Kim, Sharif S M A, Nodirkhuja Khujaev, Roman Tsoy, Ali Murtaza, Uswah Khairuddin, Ahmad 'Athif Mohd Faudzi, Sampada Malagi, Amogh Joshi, Nikhil Akalwadi, Chaitra Desai, Ramesh Ashok Tabib, Uma Mudenagudi, Wenyi Lian, Wenjing Lian, Jagadeesh Kalyanshetti, Vijayalaxmi Ashok Aralikatti, Palani Yashaswini, Nitish Upasi, Dikshit Hegde, Ujwala Patil, Sujata C, Xingzhuo Yan, Wei Hao, Minghan Fu, Pooja Choksy, Anjali Sarvaiya, Kishor Upla, Kiran Raja, Hailong Yan, Yunkai Zhang, Baiang Li, Jingyi Zhang, Huan Zheng

This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results.

4k Low-Light Image Enhancement +1

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

1 code implementation16 Apr 2024 Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi

In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.

Image Super-Resolution

FairCLIP: Harnessing Fairness in Vision-Language Learning

1 code implementation29 Mar 2024 Yan Luo, Min Shi, Muhammad Osama Khan, Muhammad Muneeb Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, Luo Song, Ava Kouhana, Tobias Elze, Yi Fang, Mengyu Wang

Fairness is a critical concern in deep learning, especially in healthcare, where these models influence diagnoses and treatment decisions.

Fairness

MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder

no code implementations7 Mar 2024 Lei LI, Tianfang Zhang, Xinglin Zhang, Jiaqi Liu, Bingqi Ma, Yan Luo, Tao Chen

Within the domain of medical analysis, extensive research has explored the potential of mutual learning between Masked Autoencoders(MAEs) and multimodal data.

Representation Learning Zero-Shot Learning

TransFlower: An Explainable Transformer-Based Model with Flow-to-Flow Attention for Commuting Flow Prediction

1 code implementation23 Feb 2024 Yan Luo, Zhuoyue Wan, Yuzhong Chen, Gengchen Mai, Fu-Lai Chung, Kent Larson

Understanding the link between urban planning and commuting flows is crucial for guiding urban development and policymaking.

Prismatic: Interactive Multi-View Cluster Analysis of Concept Stocks

no code implementations14 Feb 2024 Wong Kam-Kwai, Yan Luo, Xuanwu Yue, Wei Chen, Huamin Qu

Financial cluster analysis allows investors to discover investment alternatives and avoid undertaking excessive risks.

Clustering

Cut-and-Paste: Subject-Driven Video Editing with Attention Control

no code implementations20 Nov 2023 Zhichao Zuo, Zhao Zhang, Yan Luo, Yang Zhao, Haijun Zhang, Yi Yang, Meng Wang

This paper presents a novel framework termed Cut-and-Paste for real-word semantic video editing under the guidance of text prompt and additional reference image.

Object Video Editing

Passive Handwriting Tracking via Weak mmWave Communication Signals

no code implementations3 Nov 2023 Chao Yu, Yan Luo, Renqi Chen, Rui Wang

In this letter, a cooperative sensing framework based on millimeter wave (mmWave) communication systems is proposed to detect tiny motions with a millimeter-level resolution.

FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

1 code implementation3 Nov 2023 Yu Tian, Min Shi, Yan Luo, Ava Kouhana, Tobias Elze, Mengyu Wang

Existing medical fairness datasets are all for classification tasks, and no fairness datasets are available for medical segmentation, while medical segmentation is an equally important clinical task as classifications, which can provide detailed spatial information on organ abnormalities ready to be assessed by clinicians.

Fairness Image Segmentation +3

FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

no code implementations3 Oct 2023 Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

To address this research gap, we conduct the first comprehensive study on the fairness of 3D medical imaging models across multiple protected attributes.

Fairness

Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning

no code implementations ICCV 2023 Yan Luo, Min Shi, Yu Tian, Tobias Elze, Mengyu Wang

This is the largest glaucoma detection dataset with 3D OCT imaging data and the first glaucoma progression forecasting dataset that is publicly available.

Fairness

Focus the Discrepancy: Intra- and Inter-Correlation Learning for Image Anomaly Detection

1 code implementation ICCV 2023 Xincheng Yao, Ruoqi Li, Zefeng Qian, Yan Luo, Chongyang Zhang

Humans recognize anomalies through two aspects: larger patch-wise representation discrepancies and weaker patch-to-normal-patch correlations.

Anomaly Detection Self-Supervised Learning

Harvard Glaucoma Fairness: A Retinal Nerve Disease Dataset for Fairness Learning and Fair Identity Normalization

1 code implementation15 Jun 2023 Yan Luo, Yu Tian, Min Shi, Louis R. Pasquale, Lucy Q. Shen, Nazlee Zebardast, Tobias Elze, Mengyu Wang

To address this gap, we introduce Harvard Glaucoma Fairness (Harvard-GF), a retinal nerve disease dataset with both 2D and 3D imaging data and balanced racial groups for glaucoma detection.

Fairness Feature Importance

Timestamps as Prompts for Geography-Aware Location Recommendation

no code implementations9 Apr 2023 Yan Luo, Haoyi Duan, Ye Liu, Fu-Lai Chung

In this paper, we revisit the problem of location recommendation and point out that explicitly modeling temporal information is a great help when the model needs to predict not only the next location but also further locations.

End-to-End Personalized Next Location Recommendation via Contrastive User Preference Modeling

no code implementations22 Mar 2023 Yan Luo, Ye Liu, Fu-Lai Chung, Yu Liu, Chang Wen Chen

History encoder is designed to model mobility patterns from historical check-in sequences, while query generator explicitly learns user preferences to generate user-specific intention queries.

mmAlert: mmWave Link Blockage Prediction via Passive Sensing

no code implementations22 Feb 2023 Chao Yu, Yifei Sun, Yan Luo, Rui Wang

It is demonstrated via experiments that the mmAlert system can always detect the motions of the walking person close to the LoS path, and predict 90\% of the LoS blockage with sensing time of 1. 4 seconds.

Long-Range Zero-Shot Generative Deep Network Quantization

no code implementations13 Nov 2022 Yan Luo, Yangcheng Gao, Zhao Zhang, Haijun Zhang, Mingliang Xu, Meng Wang

We find it is because: 1) a normal generator is hard to obtain high diversity of synthetic data, since it lacks long-range information to allocate attention to global features; 2) the synthetic images aim to simulate the statistics of real data, which leads to weak intra-class heterogeneity and limited feature richness.

Knowledge Distillation Quantization

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

1 code implementation28 Oct 2022 Jia-Xin Ye, Xin-Cheng Wen, Xuan-Ze Wang, Yong Xu, Yan Luo, Chang-Li Wu, Li-Yan Chen, Kun-Hong Liu

In this paper, we propose a Gated Multi-scale Temporal Convolutional Network (GM-TCNet) to construct a novel emotional causality representation learning component with a multi-scale receptive field.

Representation Learning Speech Emotion Recognition

CTL-MTNet: A Novel CapsNet and Transfer Learning-Based Mixed Task Net for the Single-Corpus and Cross-Corpus Speech Emotion Recognition

no code implementations18 Jul 2022 Xin-Cheng Wen, Jia-Xin Ye, Yan Luo, Yong Xu, Xuan-Ze Wang, Chang-Li Wu, Kun-Hong Liu

For the single-corpus task, the combination of Convolution-Pooling and Attention CapsNet module CPAC) is designed by embedding the self-attention mechanism to the CapsNet, guiding the module to focus on the important features that can be fed into different capsules.

Cross-corpus Speech Emotion Recognition +1

Passive Motion Detection via mmWave Communication System

no code implementations28 Mar 2022 Jie Li, Chao Yu, Yan Luo, Yifei Sun, Rui Wang

Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively.

Hand Gesture Recognition Hand-Gesture Recognition +1

Learning to Minimize the Remainder in Supervised Learning

1 code implementation23 Jan 2022 Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

To this end, we propose a new learning approach, namely gradient adjustment learning (GAL), to leverage the knowledge learned from the past training iterations to adjust vanilla gradients, such that the remainders are minimized and the approximations are improved.

Image Classification Image Retrieval +3

Learning to Predict Gradients for Semi-Supervised Continual Learning

1 code implementation23 Jan 2022 Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models.

Continual Learning

Swin-Pose: Swin Transformer Based Human Pose Estimation

no code implementations19 Jan 2022 Zinan Xiong, Chenxi Wang, Ying Li, Yan Luo, Yu Cao

We are interested in exploring its capability in human pose estimation, and thus propose a novel model based on transformer architecture, enhanced with a feature pyramid fusion structure.

Pose Estimation

Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images

1 code implementation22 Nov 2021 Ye Liu, Huifang Li, Chao Hu, Shuang Luo, Yan Luo, Chang Wen Chen

The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid (SCP), and hierarchical region of interest extractor (HRoIE), to aggregate global visual context at feature, spatial, and instance domains, respectively.

Instance Segmentation Object Detection

Learning to Predict Trustworthiness with Steep Slope Loss

1 code implementation NeurIPS 2021 Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

Secondly, due to the data complexity, it is challenging to differentiate the incorrect predictions from the correct ones on real-world large-scale datasets.

Embracing Uncertainty: Decoupling and De-bias for Robust Temporal Grounding

no code implementations CVPR 2021 Hao Zhou, Chongyang Zhang, Yan Luo, Yanjun Chen, Chuanping Hu

Meanwhile, modified feature assigned with style-like words (including adjectives, adverbs, etc) represents the subjective information, and thus brings personalized predictions; De-bias - We propose a de-bias mechanism to generate diverse predictions, aim to alleviate the bias caused by single-style annotations in the presence of label uncertainty.

Relation

Which to Match? Selecting Consistent GT-Proposal Assignment for Pedestrian Detection

no code implementations18 Mar 2021 Yan Luo, Chongyang Zhang, Muming Zhao, Hao Zhou, Jun Sun

Consequently, we address the weakness of IoU by introducing one geometric sensitive search algorithm as a new assignment and regression metric.

Autonomous Driving Pedestrian Detection +1

Where, What, Whether: Multi-modal Learning Meets Pedestrian Detection

no code implementations CVPR 2020 Yan Luo, Chongyang Zhang, Muming Zhao, Hao Zhou, Jun Sun

i) We generate a bird view map, which is naturally free from occlusion issues, and scan all points on it to look for suitable locations for each pedestrian instance.

Pedestrian Detection

3D Aggregated Faster R-CNN for General Lesion Detection

no code implementations29 Jan 2020 Ning Zhang, Yu Cao, Benyuan Liu, Yan Luo

This classifier branch is equipped with Feature Aggregation and Local Magnification Layers to enhance the classifier branch.

Computed Tomography (CT) Lesion Detection +2

Direction Concentration Learning: Enhancing Congruency in Machine Learning

1 code implementation17 Dec 2019 Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

We propose a Direction Concentration Learning (DCL) method to improve congruency in the learning process, where enhancing congruency influences the convergence path to be less circuitous.

Ranked #8 on Image Classification on Tiny ImageNet Classification (using extra training data)

BIG-bench Machine Learning Continual Learning +2

$\mathcal{G}$-softmax: Improving Intra-class Compactness and Inter-class Separability of Features

1 code implementation8 Apr 2019 Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

In addition, analysis of the intra-class compactness and inter-class separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement.

General Classification Multi-Label Classification

NeuroTreeNet: A New Method to Explore Horizontal Expansion Network

no code implementations22 Nov 2018 Shenlong Lou, Yan Luo, Qiancong Fan, Feng Chen, Yiping Chen, Cheng Wang, Jonathan Li

It is widely recognized that the deeper networks or networks with more feature maps have better performance.

Super-Resolution

DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment

1 code implementation17 Jun 2016 Chang Liu, Yu Cao, Yan Luo, Guanling Chen, Vinod Vokkarane, Yunsheng Ma

We applied our proposed approach to two real-world food image data sets (UEC-256 and Food-101) and achieved impressive results.

Cloud Computing Fine-Grained Image Recognition

Foveation-based Mechanisms Alleviate Adversarial Examples

no code implementations19 Nov 2015 Yan Luo, Xavier Boix, Gemma Roig, Tomaso Poggio, Qi Zhao

To see this, first, we report results in ImageNet that lead to a revision of the hypothesis that adversarial perturbations are a consequence of CNNs acting as a linear classifier: CNNs act locally linearly to changes in the image regions with objects recognized by the CNN, and in other regions the CNN may act non-linearly.

Foveation Translation

Label Consistent Quadratic Surrogate Model for Visual Saliency Prediction

no code implementations CVPR 2015 Yan Luo, Yongkang Wong, Qi Zhao

In addition, since new datasets are built and shared in the community from time to time, it would be good not to retrain the entire model when new data are added.

Saliency Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.