Search Results for author: Han Han

Found 16 papers, 10 papers with code

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

1 code implementation21 Mar 2025 Mengsong Wu, Tong Zhu, Han Han, Xiang Zhang, Wenbiao Shao, Wenliang Chen

However most of the existing methods either need to finetune that the model can only use tools seen in the training data, or add tool demonstrations into the prompt with lower efficiency.

GSM8K Question Answering

NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

1 code implementation15 Oct 2024 Han Han, Tong Zhu, Xiang Zhang, Mengsong Wu, Hao Xiong, Wenliang Chen

Large language models (LLMs) combined with tool learning have gained impressive results in real-world applications.

Double-Side Polarization and Beamforming Alignment in Polarization Reconfigurable MISO System with Deep Neural Networks

no code implementations30 Sep 2024 Seungcheol Oh, Han Han, Joongheon Kim, Sean Kwon

Polarization reconfigurable (PR) antennas enhance spectrum and energy efficiency between next-generation node B(gNB) and user equipment (UE).

Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark

2 code implementations14 May 2024 Mengsong Wu, Tong Zhu, Han Han, Chuanyuan Tan, Xiang Zhang, Wenliang Chen

Therefore, Seal-Tools can serve as a new benchmark to evaluate the tool-calling ability of LLMs.

Active Sensing for Multiuser Beam Tracking with Reconfigurable Intelligent Surface

no code implementations6 May 2024 Han Han, Tao Jiang, Wei Yu

Specifically, the mobile UEs send uplink pilots to the AP periodically during the channel sensing intervals, the AP then adaptively configures the beamformers and the RIS reflection coefficients for subsequent data transmission based on the received pilots.

MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking

1 code implementation18 Apr 2024 Zhong Wang, Zengyu Wan, Han Han, Bohao Liao, Yuliang Wu, Wei Zhai, Yang Cao, Zheng-Jun Zha

Event-based eye tracking has shown great promise with the high temporal resolution and low redundancy provided by the event camera.

Data Augmentation Diversity

Traffic Sign Interpretation in Real Road Scene

no code implementations17 Nov 2023 Chuang Yang, Kai Zhuang, Mulin Chen, Haozhao Ma, Xu Han, Tao Han, Changxing Guo, Han Han, Bingxuan Zhao, Qi Wang

Following the above issues, we propose a traffic sign interpretation (TSI) task, which aims to interpret global semantic interrelated traffic signs (e. g.,~driving instruction-related texts, symbols, and guide panels) into a natural language for providing accurate instruction support to autonomous or assistant driving.

Instruction Following Multi-Task Learning

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis

1 code implementation24 Jan 2023 Cyrus Vahidi, Han Han, Changhong Wang, Mathieu Lagrange, György Fazekas, Vincent Lostanlen

Computer musicians refer to mesostructures as the intermediate levels of articulation between the microstructure of waveshapes and the macrostructure of musical forms.

Perceptual-Neural-Physical Sound Matching

1 code implementation7 Jan 2023 Han Han, Vincent Lostanlen, Mathieu Lagrange

On the other hand, mean square error in the spectrotemporal domain, known as "spectral loss", is perceptually motivated and serves in differentiable digital signal processing (DDSP).

Attribute Audio Synthesis +1

Differentiable Time-Frequency Scattering on GPU

3 code implementations18 Apr 2022 John Muradeli, Cyrus Vahidi, Changhong Wang, Han Han, Vincent Lostanlen, Mathieu Lagrange, George Fazekas

Joint time-frequency scattering (JTFS) is a convolutional operator in the time-frequency domain which extracts spectrotemporal modulations at various rates and scales.

Audio Generation Resynthesis

Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading

no code implementations CVPR 2022 Ganchao Tan, Yang Wang, Han Han, Yang Cao, Feng Wu, Zheng-Jun Zha

To recognize words from the event data, we propose a novel Multi-grained Spatio-Temporal Features Perceived Network (MSTP) to perceive fine-grained spatio-temporal features from microsecond time-resolved event data.

Action Recognition Lip Reading

Reconfigurable Intelligent Surface-induced Randomness for mmWave Key Generation

no code implementations31 Oct 2021 Shubo Yang, Han Han, Yihong Liu, Weisi Guo, Zhibo Pang, Lei Zhang

In this paper, for mmWave secret key generation of physical layer security, we use a reconfigurable intelligent surface (RIS) to induce randomness directly in wireless environments, without adding complexity to transceivers.

Quantization

wav2shape: Hearing the Shape of a Drum Machine

1 code implementation20 Jul 2020 Han Han, Vincent Lostanlen

Disentangling and recovering physical attributes, such as shape and material, from a few waveform examples is a challenging inverse problem in audio signal processing, with numerous applications in musical acoustics as well as structural engineering.

Audio Signal Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.