Search Results for author: Lu Huang

Found 12 papers, 2 papers with code

A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery

no code implementations29 Mar 2025 PengYu Chen, Sicheng Wang, Cuizhen Wang, Senrong Wang, Beiao Huang, Lu Huang, Zhe Zang

Accurate rooftop detection from historical aerial imagery is vital for examining long-term urban development and human settlement patterns.

Colorization Image Colorization +4

Classifier Clustering and Feature Alignment for Federated Learning under Distributed Concept Drift

1 code implementation24 Oct 2024 Junbao Chen, Jingfeng Xue, Yong Wang, Zhenyan Liu, Lu Huang

Moreover, to address data heterogeneity, we study the feature alignment under distributed concept drift, and find two factors that are crucial for feature alignment: the conditional distribution $P(Y|X)$ and the degree of data heterogeneity.

Clustering Federated Learning

Deep-learning-based groupwise registration for motion correction of cardiac $T_1$ mapping

no code implementations18 Jun 2024 Yi Zhang, Yidong Zhao, Lu Huang, Liming Xia, Qian Tao

In this work, we propose a novel deep-learning-based groupwise registration framework, which omits the need for a template, and registers all baseline images simultaneously.

Test-time Adaptation

Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer

no code implementations15 Nov 2023 Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

Deep biasing for the Transducer can improve the recognition performance of rare words or contextual entities, which is essential in practical applications, especially for streaming Automatic Speech Recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

HMM-Free Encoder Pre-Training for Streaming RNN Transducer

no code implementations2 Apr 2021 Lu Huang, Jingyu Sun, Yufeng Tang, JunFeng Hou, Jinkun Chen, Jun Zhang, Zejun Ma

This work describes an encoder pre-training procedure using frame-wise label to improve the training of streaming recurrent neural network transducer (RNN-T) model.

Speech Recognition

The Domain Shift Problem of Medical Image Segmentation and Vendor-Adaptation by Unet-GAN

no code implementations30 Oct 2019 Wenjun Yan, Yuanyuan Wang, Shengjia Gu, Lu Huang, Fuhua Yan, Liming Xia, Qian Tao

In this work, we proposed a generic framework to address this problem, consisting of (1) an unpaired generative adversarial network (GAN) for vendor-adaptation, and (2) a Unet for object segmentation.

Generative Adversarial Network Image Segmentation +3

An Improved Residual LSTM Architecture for Acoustic Modeling

no code implementations17 Aug 2017 Lu Huang, Jiasong Sun, Ji Xu, Yi Yang

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.