Search Results for author: Yi Luo

Found 34 papers, 11 papers with code

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

no code implementations NeurIPS 2021 Xiaolin Hu, Kai Li, Weiyi Zhang, Yi Luo, Jean-Marie Lemercier, Timo Gerkmann

Recent advances in the design of neural network architectures, in particular those specialized in modeling sequences, have provided significant improvements in speech separation performance.

Speech Separation

Cascadable all-optical NAND gates using diffractive networks

no code implementations2 Nov 2021 Yi Luo, Deniz Mengu, Aydogan Ozcan

Based on this architecture, we numerically optimized the design of a diffractive neural network composed of 4 passive layers to all-optically perform NAND operation using the diffraction of light, and cascaded these diffractive NAND gates to perform complex logical functions by successively feeding the output of one diffractive NAND gate into another.

Distilling Self-Knowledge From Contrastive Links to Classify Graph Nodes Without Passing Messages

2 code implementations16 Jun 2021 Yi Luo, Aiguo Chen, Ke Yan, Ling Tian

Nowadays, Graph Neural Networks (GNNs) following the Message Passing paradigm become the dominant way to learn on graphic data.

Node Classification

Dynamic imaging and characterization of volatile aerosols in e-cigarette emissions using deep learning-based holographic microscopy

no code implementations31 Mar 2021 Yi Luo, Yichen Wu, Liqiao Li, Yuening Guo, Ege Cetintas, Yifang Zhu, Aydogan Ozcan

To evaluate the effects of e-liquid composition on aerosol dynamics, we measured the volatility of the particles generated by flavorless, nicotine-free e-liquids with various PG/VG volumetric ratios, revealing a negative correlation between the particles' volatility and the volumetric ratio of VG in the e-liquid.

Memory-Associated Differential Learning

2 code implementations10 Feb 2021 Yi Luo, Aiguo Chen, Bei Hui, Ke Yan

Conventional Supervised Learning approaches focus on the mapping from input features to output labels.

Link Prediction

Group Communication with Context Codec for Lightweight Source Separation

1 code implementation14 Dec 2020 Yi Luo, Cong Han, Nima Mesgarani

A context codec module, containing a context encoder and a context decoder, is designed as a learnable downsampling and upsampling module to decrease the length of a sequential feature processed by the separation module.

Speech Enhancement Speech Separation

Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios

no code implementations16 Nov 2020 Yi Luo, Siyi Chen, X. -G. Ma

This work presented a new drone-based face detection dataset Drone LAMS in order to solve issues of low performance of drone-based face detection in scenarios such as large angles which was a predominant working condition when a drone flies high.

Face Detection

Integrated Gallium Nitride Nonlinear Photonics

no code implementations30 Oct 2020 Yanzhen Zheng, Changzheng Sun, Bing Xiong, Lai Wang, Zhibiao Hao, Jian Wang, Yanjun Han, Hongtao Li, Jiadong Yu, Yi Luo

Thanks to its high nonlinearity and high refractive index contrast, GaN-on-insulator (GaNOI) is also a promising platform for nonlinear optical applications.

Optics Applied Physics

An End-to-end Architecture of Online Multi-channel Speech Separation

no code implementations7 Sep 2020 Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie

Previously, we introduced a sys-tem, calledunmixing, fixed-beamformerandextraction(UFE), that was shown to be effective in addressing the speech over-lap problem in conversation transcription.

Speech Recognition Speech Separation

Deep learning-based holographic polarization microscopy

no code implementations1 Jul 2020 Tairan Liu, Kevin de Haan, Bijie Bai, Yair Rivenson, Yi Luo, Hongda Wang, David Karalli, Hongxiang Fu, Yibo Zhang, John FitzGerald, Aydogan Ozcan

Our analysis shows that a trained deep neural network can extract the birefringence information using both the sample specific morphological features as well as the holographic amplitude and phase distribution.

Medical Diagnosis

Terahertz Pulse Shaping Using Diffractive Surfaces

no code implementations30 Jun 2020 Muhammed Veli, Deniz Mengu, Nezih T. Yardimci, Yi Luo, Jingxi Li, Yair Rivenson, Mona Jarrahi, Aydogan Ozcan

Recent advances in deep learning have been providing non-intuitive solutions to various inverse problems in optics.

Object Classification Transfer Learning

Spectrally-Encoded Single-Pixel Machine Vision Using Diffractive Networks

no code implementations15 May 2020 Jingxi Li, Deniz Mengu, Nezih T. Yardimci, Yi Luo, Xurong Li, Muhammed Veli, Yair Rivenson, Mona Jarrahi, Aydogan Ozcan

3D engineering of matter has opened up new avenues for designing systems that can perform various computational tasks through light-matter interaction.

General Classification Object Classification

Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss

no code implementations27 Mar 2020 Yi Luo, Nima Mesgarani

Many recent source separation systems are designed to separate a fixed number of sources out of a mixture.

Continuous speech separation: dataset and analysis

1 code implementation30 Jan 2020 Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li

In this paper, we define continuous speech separation (CSS) as a task of generating a set of non-overlapped speech signals from a \textit{continuous} audio stream that contains multiple utterances that are \emph{partially} overlapped by a varying degree.

automatic-speech-recognition Speech Recognition +1

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

2 code implementations30 Oct 2019 Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka

An important problem in ad-hoc microphone speech separation is how to guarantee the robustness of a system with respect to the locations and numbers of microphones.

Speech Separation

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation

6 code implementations14 Oct 2019 Yi Luo, Zhuo Chen, Takuya Yoshioka

Recent studies in deep learning-based speech separation have proven the superiority of time-domain approaches to conventional time-frequency-based methods.

Speech Separation

Design of Task-Specific Optical Systems Using Broadband Diffractive Neural Networks

no code implementations14 Sep 2019 Yi Luo, Deniz Mengu, Nezih T. Yardimci, Yair Rivenson, Muhammed Veli, Mona Jarrahi, Aydogan Ozcan

We report a broadband diffractive optical neural network design that simultaneously processes a continuum of wavelengths generated by a temporally-incoherent broadband source to all-optically perform a specific task learned using deep learning.

Class-specific Differential Detection in Diffractive Optical Neural Networks Improves Inference Accuracy

no code implementations8 Jun 2019 Jingxi Li, Deniz Mengu, Yi Luo, Yair Rivenson, Aydogan Ozcan

Similar to ensemble methods practiced in machine learning, we also independently-optimized multiple differential diffractive networks that optically project their light onto a common detector plane, and achieved testing accuracies of 98. 59%, 91. 06% and 51. 44% for MNIST, Fashion-MNIST and grayscale CIFAR-10, respectively.

General Classification Object Classification

Demand Prediction for Electric Vehicle Sharing

no code implementations10 Mar 2019 Man Luo, Hongkai Wen, Yi Luo, Bowen Du, Konstantin Klemmer, Hong-Ming Zhu

Electric Vehicle (EV) sharing systems have recently experienced unprecedented growth across the globe.

Decision Making

Response to Comment on "All-optical machine learning using diffractive deep neural networks"

no code implementations10 Oct 2018 Deniz Mengu, Yi Luo, Yair Rivenson, Xing Lin, Muhammed Veli, Aydogan Ozcan

In their Comment, Wei et al. (arXiv:1809. 08360v1 [cs. LG]) claim that our original interpretation of Diffractive Deep Neural Networks (D2NN) represent a mischaracterization of the system due to linearity and passivity.

Analysis of Diffractive Optical Neural Networks and Their Integration with Electronic Neural Networks

no code implementations3 Oct 2018 Deniz Mengu, Yi Luo, Yair Rivenson, Aydogan Ozcan

Furthermore, we report the integration of D2NNs with electronic neural networks to create hybrid-classifiers that significantly reduce the number of input pixels into an electronic network using an ultra-compact front-end D2NN with a layer-to-layer distance of a few wavelengths, also reducing the complexity of the successive electronic network.

General Classification

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation

13 code implementations20 Sep 2018 Yi Luo, Nima Mesgarani

The majority of the previous methods have formulated the separation problem through the time-frequency representation of the mixed signal, which has several drawbacks, including the decoupling of the phase and magnitude of the signal, the suboptimality of time-frequency representation for speech separation, and the long latency in calculating the spectrograms.

Multi-task Audio Source Seperation Music Source Separation +3

Real-time Single-channel Dereverberation and Separation with Time-domainAudio Separation Network

1 code implementation ISCA Interspeech 2018 Yi Luo, Nima Mesgarani

We investigate the recently proposed Time-domain Audio Sep-aration Network (TasNet) in the task of real-time single-channel speech dereverberation.

Denoising Speech Dereverberation +1

TasNet: time-domain audio separation network for real-time, single-channel speech separation

2 code implementations1 Nov 2017 Yi Luo, Nima Mesgarani

We directly model the signal in the time-domain using an encoder-decoder framework and perform the source separation on nonnegative encoder outputs.

Speech Separation

Point Set Registration With Global-Local Correspondence and Transformation Estimation

no code implementations ICCV 2017 Su Zhang, Yang Yang, Kun Yang, Yi Luo, Sim-Heng Ong

We present a new point set registration method with global-local correspondence and transformation estimation (GL-CATE).

Speaker-independent Speech Separation with Deep Attractor Network

no code implementations12 Jul 2017 Yi Luo, Zhuo Chen, Nima Mesgarani

A reference point attractor is created in the embedding space to represent each speaker which is defined as the centroid of the speaker in the embedding space.

Speech Separation

Deep attractor network for single-microphone speaker separation

no code implementations27 Nov 2016 Zhuo Chen, Yi Luo, Nima Mesgarani

We propose a novel deep learning framework for single channel speech separation by creating attractor points in high dimensional embedding space of the acoustic signals which pull together the time-frequency bins corresponding to each source.

Speaker Separation Speech Separation

Deep Clustering and Conventional Networks for Music Separation: Stronger Together

no code implementations18 Nov 2016 Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani

Deep clustering is the first method to handle general audio separation scenarios with multiple sources of the same type and an arbitrary number of sources, performing impressively in speaker-independent speech separation tasks.

Deep Clustering Multi-Task Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.