Search Results for author: Han Han

Found 10 papers, 5 papers with code

MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking

no code implementations18 Apr 2024 Zhong Wang, Zengyu Wan, Han Han, Bohao Liao, Yuliang Wu, Wei Zhai, Yang Cao, Zheng-Jun Zha

Event-based eye tracking has shown great promise with the high temporal resolution and low redundancy provided by the event camera.

Traffic Sign Interpretation in Real Road Scene

no code implementations17 Nov 2023 Chuang Yang, Kai Zhuang, Mulin Chen, Haozhao Ma, Xu Han, Tao Han, Changxing Guo, Han Han, Bingxuan Zhao, Qi Wang

Following the above issues, we propose a traffic sign interpretation (TSI) task, which aims to interpret global semantic interrelated traffic signs (e. g.,~driving instruction-related texts, symbols, and guide panels) into a natural language for providing accurate instruction support to autonomous or assistant driving.

Instruction Following Multi-Task Learning

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis

1 code implementation24 Jan 2023 Cyrus Vahidi, Han Han, Changhong Wang, Mathieu Lagrange, György Fazekas, Vincent Lostanlen

Computer musicians refer to mesostructures as the intermediate levels of articulation between the microstructure of waveshapes and the macrostructure of musical forms.

Perceptual-Neural-Physical Sound Matching

1 code implementation7 Jan 2023 Han Han, Vincent Lostanlen, Mathieu Lagrange

On the other hand, mean square error in the spectrotemporal domain, known as "spectral loss", is perceptually motivated and serves in differentiable digital signal processing (DDSP).

Attribute Audio Synthesis

Differentiable Time-Frequency Scattering on GPU

3 code implementations18 Apr 2022 John Muradeli, Cyrus Vahidi, Changhong Wang, Han Han, Vincent Lostanlen, Mathieu Lagrange, George Fazekas

Joint time-frequency scattering (JTFS) is a convolutional operator in the time-frequency domain which extracts spectrotemporal modulations at various rates and scales.

Audio Generation Resynthesis

Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading

no code implementations CVPR 2022 Ganchao Tan, Yang Wang, Han Han, Yang Cao, Feng Wu, Zheng-Jun Zha

To recognize words from the event data, we propose a novel Multi-grained Spatio-Temporal Features Perceived Network (MSTP) to perceive fine-grained spatio-temporal features from microsecond time-resolved event data.

Action Recognition Lip Reading

Reconfigurable Intelligent Surface-induced Randomness for mmWave Key Generation

no code implementations31 Oct 2021 Shubo Yang, Han Han, Yihong Liu, Weisi Guo, Zhibo Pang, Lei Zhang

In this paper, for mmWave secret key generation of physical layer security, we use a reconfigurable intelligent surface (RIS) to induce randomness directly in wireless environments, without adding complexity to transceivers.

Quantization

wav2shape: Hearing the Shape of a Drum Machine

1 code implementation20 Jul 2020 Han Han, Vincent Lostanlen

Disentangling and recovering physical attributes, such as shape and material, from a few waveform examples is a challenging inverse problem in audio signal processing, with numerous applications in musical acoustics as well as structural engineering.

Audio Signal Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.