no code implementations • 14 Apr 2025 • Hao Yin, Guangzong Si, Zilei Wang
Contrastive decoding strategies are widely used to reduce hallucinations in multimodal large language models (MLLMs).
no code implementations • 3 Apr 2025 • Hao Yin, Shi Guo, Xu Jia, Xudong Xu, Lu Zhang, Si Liu, Dong Wang, Huchuan Lu, Tianfan Xue
In this work, we propose a novel pipeline for non-contact sound recovery, fully utilizing spatial-temporal information from the event stream.
1 code implementation • 17 Mar 2025 • Hao Yin, Guangzong Si, Zilei Wang
Multimodal large language models (MLLMs) improve performance on vision-language tasks by integrating visual features from pre-trained vision encoders into large language models (LLMs).
1 code implementation • 17 Mar 2025 • Hao Yin, Guangzong Si, Zilei Wang
However, these methods present two main limitations: (1) bluntly suppressing language priors can compromise coherence and accuracy of generated content, and (2) processing contrastive inputs adds computational load, significantly slowing inference speed.
no code implementations • 5 Feb 2025 • Hao Yin, Paritosh Parmar, Daoliang Xu, Yang Zhang, Tianyou Zheng, Weiwei Fu
Action Quality Assessment (AQA) -- the ability to quantify the quality of human motion, actions, or skill levels and provide feedback -- has far-reaching implications in areas such as low-cost physiotherapy, sports training, and workforce development.
no code implementations • 27 Nov 2024 • Yichen Wang, Jie Wang, Fulin Wang, Xiang Li, Hao Yin, Bhiksha Raj
In recent years, graph representation learning has undergone a paradigm shift, driven by the emergence and proliferation of graph neural networks (GNNs) and their heterogeneous counterparts.
no code implementations • 26 Nov 2024 • Yichen Wang, Hao Yin, Yifan Yang, Chenyang Zhao, Siqin Wang
Freight truck-related crashes pose significant challenges, leading to substantial economic losses, injuries, and fatalities, with pronounced spatial disparities across different regions.
no code implementations • 17 Jul 2024 • Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming Li, Dong Wang, Huchuan Lu, Xu Jia
Sign language is one of the most effective communication tools for people with hearing difficulties.
no code implementations • 11 Jun 2024 • Hanzhao Li, Liumeng Xue, Haohan Guo, Xinfa Zhu, YuanJun Lv, Lei Xie, Yunlin Chen, Hao Yin, Zhifei Li
The multi-codebook speech codec enables the application of large language models (LLM) in TTS but bottlenecks efficiency and robustness due to multi-sequence prediction.
no code implementations • 11 Dec 2023 • Hao Yin, Bayu Jayawardhana, Stephan Trenn
The first result pertains to the equivalence of the contraction of a DAE system and the uniform global exponential stability (UGES) of its variational DAE system.
no code implementations • 11 Dec 2023 • Hao Yin, Bayu Jayawardhana, Stephan Trenn
This paper introduce the notion of output contraction that expands the contraction notion to the time-varying nonlinear systems with output.
no code implementations • 30 Nov 2023 • Xiangyu Gao, Yaping Sun, Dongyu Wei, Xiaodong Xu, Hao Chen, Hao Yin, Shuguang Cui
In this context, we address the problem of efficient remote object recognition by optimizing feature transmission between mobile devices and edge servers.
no code implementations • 28 Jul 2023 • Huan Wu, Huan-Feng Duan, Wallace W. L. Lai, Kun Zhu, Xin Cheng, Hao Yin, Bin Zhou, Chun-Cheung Lai, Chao Lu, Xiaoli Ding
Detecting leaks in water networks is a costly challenge.
1 code implementation • 10 May 2023 • Lei Yuan, Zi-Qian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Li-He Li, Chao Qian, Yang Yu
Concretely, to avoid the ego-system overfitting to a specific attacker, we maintain a set of attackers, which is optimized to guarantee the attackers high attacking quality and behavior diversity.
no code implementations • 8 Sep 2022 • Zeyu Liu, Yi Wang, Jing Wen, Yong Zhang, Hao Yin, Chao Guo, Zhongyu Wang
In addition, in order to improve the segmentation performance, we adopt multi-view and multi-window level method, at the same time we employ a fine-tune strategy to mitigate the impact of inconsistent labeling.
no code implementations • 22 Jun 2022 • Lyutianyang Zhang, Hao Yin, Sumit Roy, Liu Cao
Meanwhile, a deep reinforcement learning channel access (DLCA) protocol is developed to replace the binary exponential backoff mechanism in DCF to enhance the network throughput by enabling the coordination of APs.
no code implementations • 1 Jun 2022 • Chengxing Jia, Hao Yin, Chenxiao Gao, Tian Xu, Lei Yuan, Zongzhang Zhang, Yang Yu
Model-based offline optimization with dynamics-aware policy provides a new perspective for policy learning and out-of-distribution generalization, where the learned policy could adapt to different dynamics enumerated at the training stage.
no code implementations • 2 Mar 2021 • Anwen Liao, Zhen Gao, Dongming Wang, Hua Wang, Hao Yin, Derrick Wing Kwan Ng, Mohamed-Slim Alouini
According to the proposed prior-aided iterative angle estimation algorithm, azimuth/elevation angles can be estimated, and these angles are adopted to achieve precise beam-alignment and refine GTTDU module for further eliminating delay-beam squint.
Information Theory Signal Processing Information Theory
no code implementations • 14 Aug 2020 • Zijie Ji, Phee Lep Yeoh, Deyou Zhang, Gaojie Chen, Yan Zhang, Zunwen He, Hao Yin, Yonghui Li
We propose and analyze secret key generation using intelligent reflecting surface (IRS) assisted wireless communication networks.
no code implementations • 12 Apr 2017 • Hao Yin, Austin R. Benson, Jure Leskovec
Here we introduce higher-order clustering coefficients that measure the closure probability of higher-order network cliques and provide a more comprehensive view of how the edges of complex networks cluster.