Search Results for author: Yi Lin

Found 68 papers, 32 papers with code

From Controlled Scenarios to Real-World: Cross-Domain Degradation Pattern Matching for All-in-One Image Restoration

no code implementations28 May 2025 Junyu Fan, Chuanlin Liao, Yi Lin

To bridge the data gap, a domain adaptation strategy is proposed to build the feature projection between the source and target domains by dynamically aligning their codebook embeddings, and a correlation alignment-based test-time adaptation mechanism is designed to fine-tune the alignment discrepancies by tightening the degradation embeddings to the corresponding cluster center in the source domain.

All Contrastive Learning +2

Logo-LLM: Local and Global Modeling with Large Language Models for Time Series Forecasting

1 code implementation16 May 2025 Wenjie Ou, Zhishuo Zhao, Dongyue Guo, Yi Lin

While Transformer-based methods effectively capture global dependencies, they often overlook short-term local variations in time series.

Time Series Time Series Forecasting

SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models

1 code implementation12 May 2025 Peichao Lai, Kexuan Zhang, Yi Lin, Linyihan Zhang, Feiyang Ye, Jinhao Yan, Yanwei Xu, Conghui He, Yilei Wang, Wentao Zhang, Bin Cui

Subjective Answer Grading (SAG) plays a crucial role in education, standardized testing, and automated assessment systems, particularly for evaluating short-form responses in Short Answer Scoring (SAS).

Seed1.5-VL Technical Report

no code implementations11 May 2025 Dong Guo, Faming Wu, Feida Zhu, Fuxing Leng, Guang Shi, Haobin Chen, Haoqi Fan, Jian Wang, Jianyu Jiang, Jiawei Wang, Jingji Chen, Jingjia Huang, Kang Lei, Liping Yuan, Lishu Luo, PengFei Liu, Qinghao Ye, Rui Qian, Shen Yan, Shixiong Zhao, Shuai Peng, Shuangye Li, Sihang Yuan, Sijin Wu, Tianheng Cheng, Weiwei Liu, Wenqian Wang, Xianhan Zeng, Xiao Liu, Xiaobo Qin, Xiaohan Ding, Xiaojun Xiao, Xiaoying Zhang, Xuanwei Zhang, Xuehan Xiong, Yanghua Peng, Yangrui Chen, Yanwei Li, Yanxu Hu, Yi Lin, Yiyuan Hu, Yiyuan Zhang, Youbin Wu, Yu Li, Yudong Liu, Yue Ling, Yujia Qin, Zanbo Wang, Zhiwu He, Aoxue Zhang, Bairen Yi, Bencheng Liao, Can Huang, Can Zhang, Chaorui Deng, Chaoyi Deng, Cheng Lin, Cheng Yuan, Chenggang Li, Chenhui Gou, Chenwei Lou, Chengzhi Wei, Chundian Liu, Chunyuan Li, Deyao Zhu, Donghong Zhong, Feng Li, Feng Zhang, Gang Wu, Guodong Li, Guohong Xiao, Haibin Lin, Haihua Yang, Haoming Wang, Heng Ji, Hongxiang Hao, Hui Shen, Huixia Li, Jiahao Li, Jialong Wu, Jianhua Zhu, Jianpeng Jiao, Jiashi Feng, Jiaze Chen, Jianhui Duan, Jihao Liu, Jin Zeng, Jingqun Tang, Jingyu Sun, Joya Chen, Jun Long, Junda Feng, Junfeng Zhan, Junjie Fang, Junting Lu, Kai Hua, Kai Liu, Kai Shen, Kaiyuan Zhang, Ke Shen, Ke Wang, Keyu Pan, Kun Zhang, Kunchang Li, Lanxin Li, Lei LI, Lei Shi, Li Han, Liang Xiang, Liangqiang Chen, Lin Chen, Lin Li, Lin Yan, Liying Chi, Longxiang Liu, Mengfei Du, Mingxuan Wang, Ningxin Pan, Peibin Chen, Pengfei Chen, Pengfei Wu, Qingqing Yuan, Qingyao Shuai, Qiuyan Tao, Renjie Zheng, Renrui Zhang, Ru Zhang, Rui Wang, Rui Yang, Rui Zhao, Shaoqiang Xu, Shihao Liang, Shipeng Yan, Shu Zhong, Shuaishuai Cao, Shuangzhi Wu, Shufan Liu, Shuhan Chang, Songhua Cai, Tenglong Ao, Tianhao Yang, Tingting Zhang, Wanjun Zhong, Wei Jia, Wei Weng, Weihao Yu, Wenhao Huang, Wenjia Zhu, Wenli Yang, Wenzhi Wang, Xiang Long, XiangRui Yin, Xiao Li, Xiaolei Zhu, Xiaoying Jia, Xijin Zhang, Xin Liu, Xinchen Zhang, Xinyu Yang, Xiongcai Luo, Xiuli Chen, Xuantong Zhong, Xuefeng Xiao, Xujing Li, Yan Wu, Yawei Wen, Yifan Du, Yihao Zhang, Yining Ye, Yonghui Wu, Yu Liu, Yu Yue, Yufeng Zhou, Yufeng Yuan, Yuhang Xu, Yuhong Yang, Yun Zhang, Yunhao Fang, Yuntao Li, Yurui Ren, Yuwen Xiong, Zehua Hong, Zehua Wang, Zewei Sun, Zeyu Wang, Zhao Cai, Zhaoyue Zha, Zhecheng An, Zhehui Zhao, Zhengzhuo Xu, Zhipeng Chen, Zhiyong Wu, Zhuofan Zheng, ZiHao Wang, Zilong Huang, Ziyu Zhu, Zuquan Song

We present Seed1. 5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning.

Mixture-of-Experts Multimodal Reasoning +2

Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation

1 code implementation6 May 2025 Yi Lin, Dong Zhang, Xiao Fang, Yufan Chen, Kwang-Ting Cheng, Hao Chen

Medical image segmentation is a pivotal task within the realms of medical image analysis and computer vision.

Boundary Detection Decoder +7

Emergence of Painting Ability via Recognition-Driven Evolution

no code implementations9 Jan 2025 Yi Lin, Lin Gu, Ziteng Cui, Shenghan Su, Yumo Hao, Yingtao Tian, Tatsuya Harada, Jianfei Yang

The palette branch learns a limited colour palette, while the stroke branch parameterises each stroke using B\'ezier curves to render an image, subsequently evaluated by a high-level recognition module.

Image Compression

Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

1 code implementation3 Jan 2025 Yun Zhu, Dong Zhang, Yi Lin, Yifei Feng, Jinhui Tang

Medical image segmentation demands the aggregation of global and local feature representations, posing a challenge for current methodologies in handling both long-range and short-range feature interactions.

Clustering Image Segmentation +4

Revisiting Deep Ensemble Uncertainty for Enhanced Medical Anomaly Detection

1 code implementation26 Sep 2024 Yi Gu, Yi Lin, Kwang-Ting Cheng, Hao Chen

To tackle these issues, we propose D2UE, a Diversified Dual-space Uncertainty Estimation framework for medical anomaly detection.

Anomaly Detection

FIF-UNet: An Efficient UNet Using Feature Interaction and Fusion for Medical Image Segmentation

no code implementations9 Sep 2024 Xiaolin Gou, Chuanlin Liao, Jizhe Zhou, Fengshuo Ye, Yi Lin

Nowadays, pre-trained encoders are widely used in medical image segmentation because of their ability to capture complex feature representations.

Decoder Image Segmentation +3

Aligning Medical Images with General Knowledge from Large Language Models

1 code implementation31 Aug 2024 Xiao Fang, Yi Lin, Dong Zhang, Kwang-Ting Cheng, Hao Chen

Pre-trained large vision-language models (VLMs) like CLIP have revolutionized visual representation learning using natural language as supervisions, and demonstrated promising generalization ability.

General Knowledge Medical Image Analysis +3

Fine-Grained Building Function Recognition from Street-View Images via Geometry-Aware Semi-Supervised Learning

no code implementations18 Aug 2024 Weijia Li, Jinhua Yu, Dairong Chen, Yi Lin, Runmin Dong, Xiang Zhang, Conghui He, Haohuan Fu

In this work, we propose a geometry-aware semi-supervised framework for fine-grained building function recognition, utilizing geometric relationships among multi-source data to enhance pseudo-label accuracy in semi-supervised learning, broadening its applicability to various building function categorization systems.

Pseudo Label

SkyDiffusion: Ground-to-Aerial Image Synthesis with Diffusion Models and BEV Paradigm

no code implementations3 Aug 2024 Junyan Ye, Jun He, Weijia Li, Zhutao Lv, Yi Lin, Jinhua Yu, Haote Yang, Conghui He

In this paper, we introduce SkyDiffusion, a novel cross-view generation method for synthesizing aerial images from street view images, utilizing a diffusion model and the Bird's-Eye View (BEV) paradigm.

Image Generation SSIM

ProSpec RL: Plan Ahead, then Execute

no code implementations31 Jul 2024 Liangliang Liu, Yi Guan, Boran Wang, Rujia Shen, Yi Lin, Chaoran Kong, Lian Yan, Jingchi Jiang

Imagining potential outcomes of actions before execution helps agents make more informed decisions, a prospective thinking ability fundamental to human cognition.

Model Predictive Control Reinforcement Learning (RL)

DR-RAG: Applying Dynamic Document Relevance to Retrieval-Augmented Generation for Question-Answering

no code implementations11 Jun 2024 Zijian Hei, Weiling Liu, Wenjie Ou, Juyi Qiao, Junming Jiao, Guowen Song, Ting Tian, Yi Lin

Retrieval-Augmented Generation (RAG) has recently demonstrated the performance of Large Language Models (LLMs) in the knowledge-intensive tasks such as Question-Answering (QA).

Question Answering RAG +2

Tunable Superconducting Magnetic Levitation with Self-Stability

no code implementations28 Mar 2024 Qi Xu, Yi Lin, Yunfei Tan, Jianzhao Geng

For the first time, we experimentally demonstrate a self-stable type II superconducting maglev system which is able to: counteract long term levitation force decay, adjust levitation force and equilibrium position, and establish levitation under zero field cooling condition.

Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification

no code implementations19 Mar 2024 Yi Lin, Zhengjie ZHU, Kwang-Ting Cheng, Hao Chen

To address this issue, we propose PAMT, a novel Prompt-guided Adaptive Model Transformation framework that enhances MIL classification performance by seamlessly adapting pre-trained models to the specific characteristics of histopathology data.

image-classification Image Classification +2

Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification

no code implementations13 Mar 2024 Shuhan LI, Yi Lin, Hao Chen, Kwang-Ting Cheng

In this paper, we introduce an Iterative Online Image Synthesis (IOIS) framework to address the class imbalance problem in medical image classification.

image-classification Image Classification +4

LocMoE: A Low-Overhead MoE for Large Language Model Training

no code implementations25 Jan 2024 Jing Li, Zhijie Sun, Xuan He, Li Zeng, Yi Lin, Entong Li, Binfan Zheng, Rongqian Zhao, Xin Chen

However, the performance of MoE is limited by load imbalance and high latency of All-to-All communication, along with relatively redundant computation owing to large expert capacity.

All Language Modeling +2

BoNuS: Boundary Mining for Nuclei Segmentation with Partial Point Labels

1 code implementation15 Jan 2024 Yi Lin, Zeyu Wang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

To alleviate this problem, in this paper, we propose a weakly-supervised nuclei segmentation method that only requires partial point labels of nuclei.

Multiple Instance Learning Segmentation

ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning

1 code implementation11 Dec 2023 Xincheng Yu, Dongyue Guo, Jianwei Zhang, Yi Lin

Radio speech echo is a specific phenomenon in the air traffic control (ATC) domain, which degrades speech quality and further impacts automatic speech recognition (ASR) accuracy.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Adversarial Medical Image with Hierarchical Feature Hiding

1 code implementation4 Dec 2023 Qingsong Yao, Zecheng He, Yuexiang Li, Yi Lin, Kai Ma, Yefeng Zheng, S. Kevin Zhou

Interestingly, this vulnerability is a double-edged sword, which can be exploited to hide AEs.

Decision Making

DED: Diagnostic Evidence Distillation for acne severity grading on face images

1 code implementation Expert Systems with Applications 2023 Yi Lin, Jingchi Jiang, Dongxin Chen, Zhaoyang Ma, Yi Guan, Xiguang Liu, Haiyan You, Jing Yang

In this study, we propose an acne diagnosis method, Diagnostic Evidence Distillation (DED), that suitably adapts the characteristics of acne diagnosis and can be applied to diagnose under different acne criteria.

 Ranked #1 on Acne Severity Grading on ACNE04 (Accuracy metric)

Acne Severity Grading Diagnostic +4

Continuous 3D Myocardial Motion Tracking via Echocardiography

no code implementations4 Oct 2023 Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin

Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of cardiovascular diseases (CVDs), the foremost cause of death globally.

Motion Estimation

Penalties and Rewards for Fair Learning in Paired Kidney Exchange Programs

no code implementations23 Sep 2023 Margarida Carvalho, Alison Caulfield, Yi Lin, Adrian Vetta

Rather, the key factor in increasing the number of transplants, decreasing waiting times and improving group fairness is the judicious assignment of a negative weight (penalty) to the small number of non-directed donors in the kidney exchange program.

Fairness

Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks

no code implementations27 Aug 2023 Xin Yang, Yi Lin, Zhiwei Wang, Xin Li, Kwang-Ting Cheng

A method for measuring the synthesis complexity is proposed to automatically determine the synthesis order in our sequential GAN.

Generative Adversarial Network Image Generation

PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation

1 code implementation24 Aug 2023 Haibo Jin, Haoxuan Che, Yi Lin, Hao Chen

To address these challenges, we propose diagnosis-driven prompts for medical report generation (PromptMRG), a novel framework that aims to improve the diagnostic accuracy of MRG with the guidance of diagnosis-aware prompts.

Diagnostic Medical Report Generation

Integrating spoken instructions into flight trajectory prediction to optimize automation in air traffic control

no code implementations2 May 2023 Dongyue Guo, Zheng Zhang, Bo Yang, Jianwei Zhang, Hongyu Yang, Yi Lin

The booming air transportation industry inevitably burdens air traffic controllers' workload, causing unexpected human factor-related incidents.

Prediction Traffic Prediction +1

A Non-autoregressive Multi-Horizon Flight Trajectory Prediction Framework with Gray Code Representation

1 code implementation2 May 2023 Dongyue Guo, Zheng Zhang, Zhen Yan, Jianwei Zhang, Yi Lin

Additionally, the Gray code representation and the differential prediction paradigm are designed to cope with the high-bit misclassifications of the BE representation, which significantly reduces the outliers in the predictions.

Computational Efficiency Decoder +1

Disorder-invariant Implicit Neural Representation

no code implementations3 Apr 2023 Hao Zhu, Shaowen Xie, Zhen Liu, Fengyi Liu, Qi Zhang, You Zhou, Yi Lin, Zhan Ma, Xun Cao

However, the expressive power of INR is limited by the spectral bias in the network training.

Attribute Retrieval

Few Shot Medical Image Segmentation with Cross Attention Transformer

1 code implementation24 Mar 2023 Yi Lin, Yufan Chen, Kwang-Ting Cheng, Hao Chen

Our proposed network mines the correlations between the support image and query image, limiting them to focus only on useful foreground information and boosting the representation capacity of both the support prototype and query features.

Few-Shot Learning Image Segmentation +4

Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image Segmentation

1 code implementation23 Mar 2023 Yi Lin, Xiao Fang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

Recently, the advent of vision Transformer (ViT) has brought substantial advancements in 3D dataset benchmarks, particularly in 3D volumetric medical image segmentation (Vol-MedSeg).

Image Segmentation Semantic Segmentation +1

Label-Efficient Deep Learning in Medical Image Analysis: Challenges and Future Directions

no code implementations22 Mar 2023 Cheng Jin, Zhengrui Guo, Yi Lin, Luyang Luo, Hao Chen

Deep learning has significantly advanced medical imaging analysis (MIA), achieving state-of-the-art performance across diverse clinical tasks.

Medical Image Analysis Survey +1

Learning a Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with Occupancy Aids Scene Representation

1 code implementation ICCV 2023 Xiaoyang Lyu, Peng Dai, Zizhang Li, Dongyu Yan, Yi Lin, Yifan Peng, Xiaojuan Qi

We found that the color rendering loss results in optimization bias against low-intensity areas, causing gradient vanishing and leaving these areas unoptimized.

Neural Rendering Surface Reconstruction

TAKT: Target-Aware Knowledge Transfer for Whole Slide Image Classification

1 code implementation10 Mar 2023 Conghao Xiong, Yi Lin, Hao Chen, Hao Zheng, Dong Wei, Yefeng Zheng, Joseph J. Y. Sung, Irwin King

Despite incorporating the target features during training, the teacher model tends to overlook them under the inherent domain shift and task discrepancy.

Classification image-classification +2

Parallel Computing Based Solution for Reliability-Constrained Distribution Network Planning

no code implementations9 Mar 2023 Yaqi Sun, Wenchuan Wu, Yi Lin, Hai Huang, Hao Chen

The main goal of distribution network (DN) expansion planning is essentially to achieve minimal investment constrained with specified reliability requirements.

Efficient Implicit Neural Reconstruction Using LiDAR

1 code implementation28 Feb 2023 Dongyu Yan, Xiaoyang Lyu, Jieqi Shi, Yi Lin

Modeling scene geometry using implicit neural representation has revealed its advantages in accuracy, flexibility, and low memory usage.

3D Reconstruction

Explainable and Safe Reinforcement Learning for Autonomous Air Mobility

1 code implementation24 Nov 2022 Lei Wang, Hongyu Yang, Yi Lin, Suwan Yin, Yuankai Wu

Although DRL has achieved important advancements in this field, the existing works pay little attention to the explainability and safety issues related to DRL controllers, particularly the safety under adversarial attacks.

Adversarial Attack Deep Reinforcement Learning +4

Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future Directions

1 code implementation21 Sep 2022 Dong Zhang, Yi Lin, Hao Chen, Zhuotao Tian, Xin Yang, Jinhui Tang, Kwang Ting Cheng

Over the past few years, the rapid development of deep learning technologies for computer vision has significantly improved the performance of medical image segmentation (MedISeg).

Data Augmentation Domain Adaptation +3

Seg4Reg+: Consistency Learning between Spine Segmentation and Cobb Angle Regression

no code implementations26 Aug 2022 Yi Lin, Luyan Liu, Kai Ma, Yefeng Zheng

In this study, we propose a novel multi-task framework, named Seg4Reg+, which jointly optimizes the segmentation and regression networks.

global-optimization Image Segmentation +4

Spatiotemporal Propagation Learning for Network-Wide Flight Delay Prediction

1 code implementation14 Jul 2022 Yuankai Wu, Hongyu Yang, Yi Lin, Hong Liu

By this means, STPN allows cross-talk of spatial and temporal factors for modeling delay propagation.

Decision Making Prediction +1

Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNs

no code implementations5 Jun 2022 Zhiwei Wang, Jinxin Lv, Yunqiao Yang, Yuanhuai Liang, Yi Lin, Qiang Li, Xin Li, Xin Yang

Vertebral landmark localization is a crucial step for variant spine-related clinical applications, which requires detecting the corner points of 17 vertebrae.

Nuclei Segmentation with Point Annotations from Pathology Images via Self-Supervised Learning and Co-Training

1 code implementation16 Feb 2022 Yi Lin, Zhiyong Qu, Hao Chen, Zhongke Gao, Yuexiang Li, Lili Xia, Kai Ma, Yefeng Zheng, Kwang-Ting Cheng

Third, a self-supervised visual representation learning method is tailored for nuclei segmentation of pathology images that transforms the hematoxylin component images into the H&E stained images to gain better understanding of the relationship between the nuclei and cytoplasm.

Representation Learning Segmentation +2

Deep Learning for Computational Cytology: A Survey

no code implementations10 Feb 2022 Hao Jiang, Yanning Zhou, Yi Lin, Ronald CK Chan, Jiang Liu, Hao Chen

Computational cytology is a critical, rapid-developing, yet challenging topic in the field of medical image computing which analyzes the digitized cytology image by computer-aided technologies for cancer screening.

Deep Learning Medical Image Analysis +2

An Acne Grading Framework on Face Images via Skin Attention and SFNet

no code implementations IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2022 Yi Lin, Yi Guan, Zhaoyang Ma, Haiyan You, Xue Cheng, Jingchi Jiang

In this paper, the global estimation of acne severity grading is studied by Convolutional Neural Networks (CNNs) and a unified acne grading framework that can diagnose referring to different grading criteria is proposed.

Ranked #3 on Acne Severity Grading on ACNE04 (Accuracy metric)

Acne Severity Grading Diagnostic

A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories

no code implementations16 Nov 2021 Arijit Dasgupta, Jiafei Duan, Marcelo H. Ang Jr, Yi Lin, Su-hua Wang, Renée Baillargeon, Cheston Tan

Recent work in computer vision and cognitive reasoning has given rise to an increasing adoption of the Violation-of-Expectation (VoE) paradigm in synthetic datasets.

Automated Pulmonary Embolism Detection from CTPA Images Using an End-to-End Convolutional Neural Network

no code implementations10 Nov 2021 Yi Lin, Jianchao Su, Xiang Wang, Xiang Li, Jingen Liu, Kwang-Ting Cheng, Xin Yang

We have evaluated our approach using the 20 CTPA test dataset from the PE challenge, achieving a sensitivity of 78. 9%, 80. 7% and 80. 7% at 2 false positives per volume at 0mm, 2mm and 5mm localization error, which is superior to the state-of-the-art methods.

Pulmonary Embolism Detection

Speech recognition for air traffic control via feature learning and end-to-end training

no code implementations4 Nov 2021 Peng Fan, Dongyue Guo, Yi Lin, Bo Yang, Jianwei Zhang

In this work, we propose a new automatic speech recognition (ASR) system based on feature learning and an end-to-end training procedure for air traffic control (ATC) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

no code implementations3 Nov 2021 Dongyue Guo, Jianwei Zhang, Bo Yang, Yi Lin

Most importantly, a multi-modal speaker role identification network (MMSRINet) is designed to achieve the SRI task by considering both the speech and textual modality features.

Binary Classification

LENAS: Learning-based Neural Architecture Search and Ensemble for 3D Radiotherapy Dose Prediction

1 code implementation12 Jun 2021 Yi Lin, Yanfei Liu, Hao Chen, Xin Yang, Kai Ma, Yefeng Zheng, Kwang-Ting Cheng

To mitigate the complexity introduced by the model ensemble, we adopt the teacher-student paradigm, leveraging the diverse outputs from multiple learned networks as supervisory signals to guide the training of the student network.

Diversity Ensemble Learning +2

ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems

no code implementations17 Feb 2021 Yi Lin, Bo Yang, Linchao Li, Dongyue Guo, Jianwei Zhang, Hu Chen, Yi Zhang

Finally, by integrating the SRL with ASR, an end-to-end multilingual ASR framework is formulated in a supervised manner, which is able to translate the raw wave into text in one model, i. e., wave-to-text.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Improving speech recognition models with small samples for air traffic control systems

no code implementations16 Feb 2021 Yi Lin, Qin Li, Bo Yang, Zhen Yan, Huachun Tan, Zhengmao Chen

By virtue of the common terminology used in the ATC domain, the transfer learning task can be regarded as a sub-domain adaption task, in which the transferred model is optimized using a joint corpus consisting of baseline samples and new transcribed samples from the target dataset.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

A Hierarchical Feature Constraint to Camouflage Medical Adversarial Attacks

1 code implementation17 Dec 2020 Qingsong Yao, Zecheng He, Yi Lin, Kai Ma, Yefeng Zheng, S. Kevin Zhou

Deep neural networks (DNNs) for medical images are extremely vulnerable to adversarial examples (AEs), which poses security concerns on clinical decision making.

Adversarial Attack Decision Making

STELA: A Real-Time Scene Text Detector with Learned Anchor

1 code implementation17 Sep 2019 Linjie Deng, Yanxiang Gong, Xinchen Lu, Yi Lin, Zheng Ma, Mei Xie

To achieve high coverage of target boxes, a normal strategy of conventional one-stage anchor-based detectors is to utilize multiple priors at each spatial position, especially in scene text detection tasks.

Scene Text Detection Text Detection

Semi-supervised mp-MRI Data Synthesis with StitchLayer and Auxiliary Distance Maximization

no code implementations17 Dec 2018 Zhiwei Wang, Yi Lin, Kwang-Ting Cheng, Xin Yang

Experimental results show that our method can effectively synthesize a large variety of mpMRI images which contain meaningful CS PCa lesions, display a good visual quality and have the correct paired relationship.

Synthesizing Multi-Parameter Magnetic Resonance Imaging (Mp-Mri) Data

Cannot find the paper you are looking for? You can Submit a new open access paper.