Search Results for author: Han Han

Found 10 papers, 5 papers with code

MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking

no code implementations • 18 Apr 2024 • Zhong Wang, Zengyu Wan, Han Han, Bohao Liao, Yuliang Wu, Wei Zhai, Yang Cao, Zheng-Jun Zha

Event-based eye tracking has shown great promise with the high temporal resolution and low redundancy provided by the event camera.

Paper
Add Code

Event-Based Eye Tracking. AIS 2024 Challenge Survey

no code implementations • 17 Apr 2024 • Zuowen Wang, Chang Gao, Zongwei Wu, Marcos V. Conde, Radu Timofte, Shih-Chii Liu, Qinyu Chen, Zheng-Jun Zha, Wei Zhai, Han Han, Bohao Liao, Yuliang Wu, Zengyu Wan, Zhong Wang, Yang Cao, Ganchao Tan, Jinze Chen, Yan Ru Pei, Sasskia Brüers, Sébastien Crouzet, Douglas McLelland, Oliver Coenen, Baoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So, Philippe Bich, Chiara Boretti, Luciano Prono, Mircea Lică, David Dinucu-Jianu, Cătălin Grîu, Xiaopeng Lin, Hongwei Ren, Bojun Cheng, Xinan Zhang, Valentin Vial, Anthony Yezzi, James Tsai

This survey reviews the AIS 2024 Event-Based Eye Tracking (EET) Challenge.

Paper
Add Code

Traffic Sign Interpretation in Real Road Scene

no code implementations • 17 Nov 2023 • Chuang Yang, Kai Zhuang, Mulin Chen, Haozhao Ma, Xu Han, Tao Han, Changxing Guo, Han Han, Bingxuan Zhao, Qi Wang

Following the above issues, we propose a traffic sign interpretation (TSI) task, which aims to interpret global semantic interrelated traffic signs (e. g.,~driving instruction-related texts, symbols, and guide panels) into a natural language for providing accurate instruction support to autonomous or assistant driving.

Instruction Following Multi-Task Learning

Paper
Add Code

Fitting Auditory Filterbanks with Multiresolution Neural Networks

2 code implementations • 25 Jul 2023 • Vincent Lostanlen, Daniel Haider, Han Han, Mathieu Lagrange, Peter Balazs, Martin Ehler

Waveform-based deep learning faces a dilemma between nonparametric and parametric approaches.

Inductive Bias Knowledge Distillation

Paper
Code

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis

1 code implementation • 24 Jan 2023 • Cyrus Vahidi, Han Han, Changhong Wang, Mathieu Lagrange, György Fazekas, Vincent Lostanlen

Computer musicians refer to mesostructures as the intermediate levels of articulation between the microstructure of waveshapes and the macrostructure of musical forms.

Paper
Code

Perceptual-Neural-Physical Sound Matching

1 code implementation • 7 Jan 2023 • Han Han, Vincent Lostanlen, Mathieu Lagrange

On the other hand, mean square error in the spectrotemporal domain, known as "spectral loss", is perceptually motivated and serves in differentiable digital signal processing (DDSP).

Attribute Audio Synthesis

Paper
Code

Differentiable Time-Frequency Scattering on GPU

3 code implementations • 18 Apr 2022 • John Muradeli, Cyrus Vahidi, Changhong Wang, Han Han, Vincent Lostanlen, Mathieu Lagrange, George Fazekas

Joint time-frequency scattering (JTFS) is a convolutional operator in the time-frequency domain which extracts spectrotemporal modulations at various rates and scales.

Audio Generation Resynthesis

Paper
Code

Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading

no code implementations • CVPR 2022 • Ganchao Tan, Yang Wang, Han Han, Yang Cao, Feng Wu, Zheng-Jun Zha

To recognize words from the event data, we propose a novel Multi-grained Spatio-Temporal Features Perceived Network (MSTP) to perceive fine-grained spatio-temporal features from microsecond time-resolved event data.

Action Recognition Lip Reading

Paper
Add Code

Reconfigurable Intelligent Surface-induced Randomness for mmWave Key Generation

no code implementations • 31 Oct 2021 • Shubo Yang, Han Han, Yihong Liu, Weisi Guo, Zhibo Pang, Lei Zhang

In this paper, for mmWave secret key generation of physical layer security, we use a reconfigurable intelligent surface (RIS) to induce randomness directly in wireless environments, without adding complexity to transceivers.

Quantization

Paper
Add Code

wav2shape: Hearing the Shape of a Drum Machine

1 code implementation • 20 Jul 2020 • Han Han, Vincent Lostanlen

Disentangling and recovering physical attributes, such as shape and material, from a few waveform examples is a challenging inverse problem in audio signal processing, with numerous applications in musical acoustics as well as structural engineering.

Audio Signal Processing

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.