Search Results for author: Yu Hu

Found 50 papers, 17 papers with code

Dynamic Compressing Prompts for Efficient Inference of Large Language Models

no code implementations15 Apr 2025 Jinwu Hu, Wei zhang, Yufeng Wang, Yu Hu, Bin Xiao, Mingkui Tan, Qing Du

We model prompt compression as a Markov Decision Process (MDP), enabling the DCP-Agent to sequentially remove redundant tokens by adapting to dynamic contexts and retaining crucial content.

Token Reduction

Skeleton and Font Generation Network for Zero-shot Chinese Character Generation

no code implementations14 Jan 2025 Mobai Xue, Jun Du, Zhenrong Zhang, Jiefeng Ma, Qikai Chang, Pengfei Hu, Jianshu Zhang, Yu Hu

We used generated misspelled characters as data augmentation in Chinese character error correction tasks, simulating the scenario where students learn handwritten Chinese characters with the help of misspelled characters.

Data Augmentation Font Generation

Generating Long-form Story Using Dynamic Hierarchical Outlining with Memory-Enhancement

1 code implementation18 Dec 2024 Qianyue Wang, Jinwu Hu, ZhengPing Li, Yufeng Wang, daiyuan li, Yu Hu, Mingkui Tan

Long-form story generation task aims to produce coherent and sufficiently lengthy text, essential for applications such as novel writingand interactive storytelling.

Form Knowledge Graphs +1

Dynamic Ensemble Reasoning for LLM Experts

no code implementations10 Dec 2024 Jinwu Hu, Yufeng Wang, Shuhai Zhang, Kai Zhou, Guohao Chen, Yu Hu, Bin Xiao, Mingkui Tan

Ensemble reasoning for the strengths of different LLM experts is critical to achieving consistent and satisfactory performance on diverse inputs across a wide range of tasks.

Transfer Learning

DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions

no code implementations11 Nov 2024 Shu-Tong Niu, Jun Du, Ruo-Yu Wang, Gao-Bin Yang, Tian Gao, Jia Pan, Yu Hu

First, we sequentially integrate the NSD and SS modules within a joint training framework, enabling the separation module to leverage speaker time boundaries from the diarization module effectively.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

no code implementations21 Oct 2024 Heng Zhai, Jilin Mei, Chen Min, Liang Chen, Fangzhou Zhao, Yu Hu

3D semantic occupancy prediction is an essential part of autonomous driving, focusing on capturing the geometric details of scenes.

3D Semantic Occupancy Prediction Autonomous Driving +1

Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity

no code implementations9 Sep 2024 Junkun Chen, Jilin Mei, Liang Chen, Fangzhou Zhao, Yan Xing, Yu Hu

We observe that features of the same category are more tightly clustered in feature space, while those of different categories are more dispersed.

Few-Shot Learning object-detection +1

TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation

1 code implementation28 Aug 2024 Junbao Zhou, Jilin Mei, Pengze Wu, Liang Chen, Fangzhou Zhao, Xijun Zhao, Yu Hu

However, this approach introduces a data imbalance biased to novel data that presents a new challenge of catastrophic forgetting.

Autonomous Driving Few-Shot Semantic Segmentation +3

PID: Physics-Informed Diffusion Model for Infrared Image Generation

1 code implementation12 Jul 2024 Fangyuan Mao, Jilin Mei, Shun Lu, Fuyang Liu, Liang Chen, Fangzhou Zhao, Yu Hu

Infrared imaging technology has gained significant attention for its reliable sensing ability in low visibility conditions, prompting many studies to convert the abundant RGB images to infrared images.

Image Generation

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

no code implementations13 Jun 2024 Jiefeng Ma, Yan Wang, Chenyu Liu, Jun Du, Yu Hu, Zhenrong Zhang, Pengfei Hu, Qing Wang, Jianshu Zhang

Accurately identifying and organizing textual content is crucial for the automation of document processing in the field of form understanding.

Form Relation Prediction

Disturbance Rejection-Guarded Learning for Vibration Suppression of Two-Inertia Systems

no code implementations16 Apr 2024 Fan Zhang, Jinfeng Chen, Yu Hu, Zhiqiang Gao, Ge Lv, Qin Lin

On the other hand, machine learning benefits from an additional assurance layer provided by the ESO, as any imperfections in the machine learning model can be compensated for by the ESO.

FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining

1 code implementation15 Apr 2024 Zou Zhen, Yu Hu, Zhao Feng

Recent studies have witnessed the effectiveness and efficiency of Mamba for perceiving global and local information based on its exploiting local correlation among patches, however, rarely attempts have been explored to extend it with frequency analysis for image deraining, limiting its ability to perceive global degradation that is relevant to frequency modeling (e. g. Fourier transform).

Mamba Rain Removal

PA&DA: Jointly Sampling PAth and DAta for Consistent NAS

1 code implementation CVPR 2023 Shun Lu, Yu Hu, Longxing Yang, Zihao Sun, Jilin Mei, Jianchao Tan, Chengru Song

Our method only requires negligible computation cost for optimizing the sampling distributions of path and data, but achieves lower gradient variance during supernet training and better generalization performance for the supernet, resulting in a more consistent NAS.

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback

no code implementations24 Feb 2023 Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao

Large language models (LLMs), such as ChatGPT, are able to generate human-like, fluent responses for many downstream tasks, e. g., task-oriented dialog and question answering.

Informativeness Open-Domain Question Answering

Few-shot 3D LiDAR Semantic Segmentation for Autonomous Driving

no code implementations17 Feb 2023 Jilin Mei, Junbao Zhou, Yu Hu

Thus, we propose a few-shot 3D LiDAR semantic segmentation method that predicts both novel classes and base classes simultaneously.

Autonomous Driving Generalized Few-Shot Semantic Segmentation +4

Uniform tensor clustering by jointly exploring sample affinities of various orders

no code implementations3 Feb 2023 Hongmin Cai, Fei Qi, Junyu Li, Yu Hu, Yue Zhang, Yiu-ming Cheung, Bin Hu

Conventional clustering methods based on pairwise affinity usually suffer from the concentration effect while processing huge dimensional features yet low sample sizes data, resulting in inaccuracy to encode the sample proximity and suboptimal performance in clustering.

Clustering

Unleashing the Power of Gradient Signal-to-Noise Ratio for Zero-Shot NAS

1 code implementation ICCV 2023 Zihao Sun, Yu Sun, Longxing Yang, Shun Lu, Jilin Mei, Wenxiao Zhao, Yu Hu

Neural Architecture Search (NAS) aims to automatically find optimal neural network architectures in an efficient way.

Neural Architecture Search

Mapping effective connectivity by virtually perturbing a surrogate brain

1 code implementation31 Dec 2022 Zixiang Luo, Kaining Peng, Zhichao Liang, Shengyuan Cai, Chenyu Xu, Dan Li, Yu Hu, Changsong Zhou, Quanying Liu

Effective connectivity (EC), indicative of the causal interactions between brain regions, is fundamental to understanding information processing in the brain.

CloudBrain-ReconAI: An Online Platform for MRI Reconstruction and Image Quality Evaluation

no code implementations4 Dec 2022 Yirong Zhou, Chen Qian, Jiayu Li, Zi Wang, Yu Hu, Biao Qu, Liuhong Zhu, Jianjun Zhou, Taishan Kang, Jianzhong Lin, Qing Hong, Jiyang Dong, Di Guo, Xiaobo Qu

Efficient collaboration between engineers and radiologists is important for image reconstruction algorithm development and image quality evaluation in magnetic resonance imaging (MRI).

Cloud Computing compressed sensing +1

A General Model-Based Extended State Observer with Built-In Zero Dynamics

no code implementations25 Aug 2022 Jinfeng Chen, Zhiqiang Gao, Yu Hu, Sally Shao

A general model-based extended state observer (GMB-ESO) is proposed for single-input single-output linear time-invariant systems with a given state space model, where the total disturbance, a lump sum of model uncertainties and external disturbances, is defined as an extended state in the same manner as in the original formulation of ESO.

AGNAS: Attention-Guided Micro- and Macro-Architecture Search

1 code implementation International Conference on Machine Learning 2022 Zihao Sun, Yu Hu, Shun Lu, Longxing Yang, Jilin Mei, Yinhe Han, Xiaowei Li

We utilize the attention weights to represent the importance of the relevant operations for the micro search or the importance of the relevant blocks for the macro search.

Neural Architecture Search

Potential utilization of Battery Energy Storage Systems (BESS) in the major European electricity markets

no code implementations18 Dec 2021 Yu Hu, Miguel Armada, Maria Jesus Sanchez

The result shows that under the current empirical estimation of the battery cost and lifetime, BESS is not feasible for energy arbitrage in most of the European electricity markets.

Learning Linear Polytree Structural Equation Models

1 code implementation22 Jul 2021 Xingmei Lou, Yu Hu, XiaoDong Li

We are interested in the problem of learning the directed acyclic graph (DAG) when data are generated from a linear structural equation model (SEM) and the causal structure can be characterized by a polytree.

Deep learning based low-dose synchrotron radiation CT reconstruction

no code implementations9 Jun 2021 Ling Li, Yu Hu

Synchrotron radiation sources are widely used in various fields, among which computed tomography (CT) is one of the most important.

Computed Tomography (CT) CT Reconstruction +1

Quantization of Deep Neural Networks for Accurate Edge Computing

no code implementations25 Apr 2021 Wentao Chen, Hailong Qiu, Jian Zhuang, Chutong Zhang, Yu Hu, Qing Lu, Tianchen Wang, Yiyu Shi, Meiping Huang, Xiaowe Xu

Deep neural networks (DNNs) have demonstrated their great potential in recent years, exceeding the per-formance of human experts in a wide range of applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

DPointNet: A Density-Oriented PointNet for 3D Object Detection in Point Clouds

no code implementations7 Feb 2021 Jie Li, Yu Hu

In this paper, we put forward a novel density-oriented PointNet (DPointNet) for 3D object detection in point clouds, in which the density of points increases layer by layer.

3D Object Detection Object +1

Model-based cellular kinetic analysis of SARS-CoV-2 infection: different immune response modes and treatment strategies

no code implementations12 Jan 2021 Zhengqing Zhou, Zhiheng Zhao, Shuyu Shi, Jianghua Wu, Dianjie Li, Jianwei Li, Jingpeng Zhang, Ke Gui, Yu Zhang, Heng Mei, Yu Hu, Qi Ouyang, Fangting Li

Integrating theoretical results with clinical COVID-19 patients' data, we classified the COVID-19 development processes into three typical modes of immune responses, correlated with the clinical classification of mild & moderate, severe and critical patients.

ALFA: Adversarial Feature Augmentation for Enhanced Image Recognition

no code implementations1 Jan 2021 Tianlong Chen, Yu Cheng, Zhe Gan, Yu Hu, Zhangyang Wang, Jingjing Liu

Adversarial training is an effective method to combat adversarial attacks in order to create robust neural networks.

Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention

no code implementations28 Dec 2020 Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin

In this paper, we propose a novel deep learning architecture to improving word-level lip-reading.

Lip Reading

Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing

1 code implementation9 Dec 2020 Run-Ze Wang, Zhen-Hua Ling, Jing-Bo Zhou, Yu Hu

The dynamic schema-state and SQL-state representations are then utilized to decode the SQL query corresponding to current utterance.

Graph Neural Network Text-To-SQL

Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement

no code implementations21 Sep 2020 Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee

We first extract visual embedding from lip frames using a pre-trained phone or articulation place recognizer for visual-only EASE (VEASE).

Speech Enhancement

A Density-Aware PointRCNN for 3D Object Detection in Point Clouds

no code implementations11 Sep 2020 Jie Li, Yu Hu

We present an improved version of PointRCNN for 3D object detection, in which a multi-branch backbone network is adopted to handle the non-uniform density of point clouds.

3D Object Detection object-detection

Barriers to grid-connected battery systems: Evidence from the Spanish electricity market

no code implementations28 Jun 2020 Yu Hu, David Soler Soneira, María Jesús Sánchez

The concept of "potentially profitable utilization time" is proposed and introduced to identify and evaluate future potential grid applications for battery systems.

Exploring Spatial-Temporal Multi-Frequency Analysis for High-Fidelity and Temporal-Consistency Video Prediction

1 code implementation CVPR 2020 Beibei Jin, Yu Hu, Qiankun Tang, Jingyu Niu, Zhiping Shi, Yinhe Han, Xiaowei Li

Inspired by the frequency band decomposition characteristic of Human Vision System (HVS), we propose a video prediction network based on multi-level wavelet analysis to deal with spatial and temporal information in a unified manner.

 Ranked #1 on Video Prediction on KTH (PSNR metric)

Prediction Video Generation +1

PosNeg-Balanced Anchors with Aligned Features for Single-Shot Object Detection

no code implementations9 Aug 2019 Qiankun Tang, Shice Liu, Jie Li, Yu Hu

We introduce a novel single-shot object detector to ease the imbalance of foreground-background class by suppressing the easy negatives while increasing the positives.

Decoder object-detection +1

Integrating Tensor Similarity to Enhance Clustering Performance

no code implementations10 May 2019 Hong Peng, Yu Hu, Jiazhou Chen, Hai-Yan Wang, Yang Li, Hongmin Cai

The performance of most the clustering methods hinges on the used pairwise affinity, which is usually denoted by a similarity matrix.

Clustering

See and Think: Disentangling Semantic Scene Completion

1 code implementation NeurIPS 2018 Shice Liu, Yu Hu, Yiming Zeng, Qiankun Tang, Beibei Jin, Yinhe Han, Xiaowei Li

Semantic scene completion predicts volumetric occupancy and object category of a 3D scene, which helps intelligent agents to understand and interact with the surroundings.

2D Semantic Segmentation 3D Semantic Scene Completion +2

Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation

no code implementations CVPR 2018 Xiaowei Xu, Qing Lu, Yu Hu, Lin Yang, Sharon Hu, Danny Chen, Yiyu Shi

Unlike existing litera- ture on quantization which primarily targets memory and computation complexity reduction, we apply quan- tization as a method to reduce over tting in FCNs for better accuracy.

Image Segmentation Quantization +2

Part-of-Speech Relevance Weights for Learning Word Embeddings

no code implementations24 Mar 2016 Quan Liu, Zhen-Hua Ling, Hui Jiang, Yu Hu

The model proposed in this paper paper jointly optimizes word vectors and the POS relevance matrices.

Learning Word Embeddings POS +2

Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency

no code implementations28 Dec 2015 Shiliang Zhang, Cong Liu, Hui Jiang, Si Wei, Li-Rong Dai, Yu Hu

In this paper, we propose a novel neural network structure, namely \emph{feedforward sequential memory networks (FSMN)}, to model long-term dependency in time series without using recurrent feedback.

Language Modelling speech-recognition +3

Cannot find the paper you are looking for? You can Submit a new open access paper.