Search Results for author: Yang Hu

Found 71 papers, 18 papers with code

Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

no code implementations26 Nov 2024 Ziang Xu, Bin Li, Yang Hu, Chenyu Zhang, James East, Sharib Ali, Jens Rittscher

Accurate 3D mapping in endoscopy enables quantitative, holistic lesion characterization within the gastrointestinal (GI) tract, requiring reliable depth and pose estimation.

Pose Estimation

Primal-Dual Spectral Representation for Off-policy Evaluation

no code implementations23 Oct 2024 Yang Hu, Tianyi Chen, Na Li, Kai Wang, Bo Dai

We highlight that our algorithm, SpectralDICE, is the first to leverage the linear representation of primal-dual variables that is both computation and sample efficient, the performance of which is supported by a rigorous theoretical sample complexity guarantee and a thorough empirical evaluation on various benchmarks.

Off-policy evaluation

Risk-sensitive Affine Control Synthesis for Stationary LTI Systems

no code implementations23 Oct 2024 Yang Hu, Shahriar Talebi, Na Li

To address deviations from expected performance in stochastic systems, we propose a risk-sensitive control synthesis method to minimize certain risk measures over the limiting stationary distribution.

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

no code implementations7 Oct 2024 Junming Wang, Xingyu Zhang, Zebin Xing, Songen Gu, Xiaoyang Guo, Yang Hu, Ziying Song, Qian Zhang, Xiaoxiao Long, Wei Yin

In this paper, we propose HE-Drive: the first human-like-centric end-to-end autonomous driving system to generate trajectories that are both temporally consistent and comfortable.

Autonomous Driving Denoising +1

IFNet: Deep Imaging and Focusing for Handheld SAR with Millimeter-wave Signals

no code implementations3 May 2024 Yadong Li, Dongheng Zhang, Ruixu Geng, Jincheng Wu, Yang Hu, Qibin Sun, Yan Chen

Recent advancements have showcased the potential of handheld millimeter-wave (mmWave) imaging, which applies synthetic aperture radar (SAR) principles in portable settings.

SSIM

Efficient Duple Perturbation Robustness in Low-rank MDPs

no code implementations11 Apr 2024 Yang Hu, Haitong Ma, Bo Dai, Na Li

The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from efficiency issues that obstruct their real-world implementation.

Reinforcement Learning (RL)

Leveraging Intelligent Recommender system as a first step resilience measure -- A data-driven supply chain disruption response framework

no code implementations30 Mar 2024 Yang Hu

Interests in the value of digital technologies for its potential uses to increase supply chain resilience (SCRes) are increasing in light to the industry 4. 0 and the global pandemic.

Recommendation Systems

Emulating Complex Synapses Using Interlinked Proton Conductors

no code implementations26 Jan 2024 Lifu Zhang, Ji-An Li, Yang Hu, Jie Jiang, Rongjie Lai, Marcus K. Benna, Jian Shi

The memory consolidation from coupled storage components is revealed by both numerical simulations and experimental observations.

Continual Learning

Passive Non-Line-of-Sight Imaging with Light Transport Modulation

no code implementations26 Dec 2023 Jiarui Zhang, Ruixu Geng, Xiaolong Du, Yan Chen, Houqiang Li, Yang Hu

In this work, we propose NLOS-LTM, a novel passive NLOS imaging method that effectively handles multiple light transport conditions with a single network.

Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation

1 code implementation11 Dec 2023 Dong Zhao, Ruizhi Yang, Shuang Wang, Qi Zang, Yang Hu, Licheng Jiao, Nicu Sebe, Zhun Zhong

This approach formulates pseudo-labels at the connectivity level and thus can facilitate learning structured and low-noise semantics.

Domain Adaptation Semantic Segmentation

DREAM-PCD: Deep Reconstruction and Enhancement of mmWave Radar Pointcloud

no code implementations27 Sep 2023 Ruixu Geng, Yadong Li, Dongheng Zhang, Jincheng Wu, Yating Gao, Yang Hu, Yan Chen

However, existing methods failed to simultaneously address the three main challenges in mmWave radar pointcloud reconstruction: specular information lost, low angular resolution, and strong interference and noise.

Beyond attention: deriving biologically interpretable insights from weakly-supervised multiple-instance learning models

no code implementations7 Sep 2023 Willem Bonnaffé, CRUK ICGC Prostate Group, Freddie Hamdy, Yang Hu, Ian Mills, Jens Rittscher, Clare Verrill, Dan J. Woodcock

Recent advances in attention-based multiple instance learning (MIL) have improved our insights into the tissue regions that models rely on to make predictions in digital pathology.

Multiple Instance Learning

Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity

1 code implementation20 Jun 2023 Runyu Zhang, Yang Hu, Na Li

This paper introduces a new formulation for risk-sensitive MDPs, which assesses risk in a slightly different manner compared to the classical Markov risk measure (Ruszczy\'nski 2010), and establishes its equivalence with a class of soft robust MDP (RMDP) problems, including the standard RMDP as a special case.

Perspectives in closed-loop supply chains network design considering risk and uncertainty factors

no code implementations7 Jun 2023 Yang Hu

Risk and uncertainty in each stage of CLSC have greatly increased the complexity and reduced process efficiency of the closed-loop networks, impeding the sustainable and resilient development of industries and the circular economy.

Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling

1 code implementation18 May 2023 Shitong Shao, Xu Dai, Shouyi Yin, Lujun Li, Huanran Chen, Yang Hu

On CIFAR-10, we obtain a FID of 2. 80 by sampling in 15 steps under one-session training and the new state-of-the-art FID of 3. 37 by sampling in one step with additional training.

Knowledge Distillation

A Dynamic-Neighbor Particle Swarm Optimizer for Accurate Latent Factor Analysis

no code implementations23 Feb 2023 Jia Chen, Yixian Chun, Yuanyi Liu, Renyu Zhang, Yang Hu

High-Dimensional and Incomplete matrices, which usually contain a large amount of valuable latent information, can be well represented by a Latent Factor Analysis model.

DLBD: A Self-Supervised Direct-Learned Binary Descriptor

1 code implementation CVPR 2023 Bin Xiao, Yang Hu, Bo Liu, Xiuli Bi, Weisheng Li, Xinbo Gao

Since their binarization processes are not a component of the network, the learning-based binary descriptor cannot fully utilize the advances of deep learning.

Binarization Image Retrieval +1

RFPose-OT: RF-Based 3D Human Pose Estimation via Optimal Transport Theory

no code implementations26 Dec 2022 Cong Yu, Dongheng Zhang, Zhi Wu, Zhi Lu, Chunyang Xie, Yang Hu, Yan Chen

Different from existing methods that predict human poses from RF signals on the signal level directly, we consider the structure difference between the RF signals and the human poses, propose to transform the RF signals to the pose domain on the feature level based on Optimal Transport (OT) theory, and generate human poses from the transformed features.

3D Human Pose Estimation

Modeling Hierarchical Structural Distance for Unsupervised Domain Adaptation

no code implementations21 Nov 2022 Yingxue Xu, Guihua Wen, Yang Hu, Pei Yang

Compared with the ground distance of the conventional domain-level OT, the image-level OT captures structural associations among local regions of images that are beneficial to classification.

Image Classification Unsupervised Domain Adaptation

Two-Dimensional Weisfeiler-Lehman Graph Neural Networks for Link Prediction

1 code implementation20 Jun 2022 Yang Hu, Xiyuan Wang, Zhouchen Lin, Pan Li, Muhan Zhang

As pointed out by previous works, this two-step procedure results in low discriminating power, as 1-WL-GNNs by nature learn node-level representations instead of link-level.

Link Prediction Vocal Bursts Valence Prediction

NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

no code implementations25 May 2022 Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, Jin Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang, Javen Qinfeng Shi, Dong Gong, Dan Zhu, Mengdi Sun, Guannan Chen, Yang Hu, Haowei Li, Baozhu Zou, Zhen Liu, Wenjie Lin, Ting Jiang, Chengzhi Jiang, Xinpeng Li, Mingyan Han, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Juan Marín-Vega, Michael Sloth, Peter Schneider-Kamp, Richard Röttger, Chunyang Li, Long Bao, Gang He, Ziyao Xu, Li Xu, Gen Zhan, Ming Sun, Xing Wen, Junlin Li, Shuang Feng, Fei Lei, Rui Liu, Junxiang Ruan, Tianhong Dai, Wei Li, Zhan Lu, Hengyan Liu, Peian Huang, Guangyu Ren, Yonglin Luo, Chang Liu, Qiang Tu, Fangya Li, Ruipeng Gang, Chenghua Li, Jinjing Li, Sai Ma, Chenming Liu, Yizhen Cao, Steven Tel, Barthelemy Heyrman, Dominique Ginhac, Chul Lee, Gahyeon Kim, Seonghyun Park, An Gia Vien, Truong Thanh Nhat Mai, Howoon Yoon, Tu Vo, Alexander Holston, Sheir Zaheer, Chan Y. Park

The challenge is composed of two tracks with an emphasis on fidelity and complexity constraints: In Track 1, participants are asked to optimize objective fidelity scores while imposing a low-complexity constraint (i. e. solutions can not exceed a given number of operations).

Image Restoration Vocal Bursts Intensity Prediction

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

1 code implementation6 Apr 2022 Zhuangwei Shi, Yang Hu, Guangliang Mo, Jian Wu

Due to the complex volatility of the stock market, the research and prediction on the change of the stock price, can avoid the risk for the investors.

Stock Prediction Time Series +1

Enabling Efficient Deep Convolutional Neural Network-based Sensor Fusion for Autonomous Driving

no code implementations22 Feb 2022 Xiaoming Zeng, Zhendong Wang, Yang Hu

We also propose a Layer-sharing technique in the deep layer that can achieve better accuracy with less computational overhead.

Autonomous Driving Decision Making +1

SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions

no code implementations18 Feb 2022 Ripon K. Saha, Akira Ura, Sonal Mahajan, Chenguang Zhu, Linyi Li, Yang Hu, Hiroaki Yoshida, Sarfraz Khurshid, Mukul R. Prasad

In this work we propose an AutoML technique SapientML, that can learn from a corpus of existing datasets and their human-written pipelines, and efficiently generate a high-quality pipeline for a predictive task on a new dataset.

AutoML BIG-bench Machine Learning +1

RFMask: A Simple Baseline for Human Silhouette Segmentation with Radio Signals

no code implementations25 Jan 2022 Zhi Wu, Dongheng Zhang, Chunyang Xie, Cong Yu, Jinbo Chen, Yang Hu, Yan Chen

To overcome such limitations, in this paper, we propose to utilize the radio signals, which can traverse obstacles and are unaffected by the lighting conditions to achieve silhouette segmentation.

Human Detection Segmentation

Radio-Assisted Human Detection

no code implementations16 Dec 2021 Chengrun Qiu, Dongheng Zhang, Yang Hu, Houqiang Li, Qibin Sun, Yan Chen

In this paper, we propose a radio-assisted human detection framework by incorporating radio information into the state-of-the-art detection methods, including anchor-based onestage detectors and two-stage detectors.

Human Detection Region Proposal

Learning Token-based Representation for Image Retrieval

1 code implementation12 Dec 2021 Hui Wu, Min Wang, Wengang Zhou, Yang Hu, Houqiang Li

Next, a refinement block is introduced to enhance the visual tokens with self-attention and cross-attention.

Image Retrieval Retrieval

RFGAN: RF-Based Human Synthesis

no code implementations7 Dec 2021 Cong Yu, Zhi Wu, Dongheng Zhang, Zhi Lu, Yang Hu, Yan Chen

To the best of our knowledge, this is the first work to generate optical images based on RF signals.

Unsupervised Domain Adaptation for RF-based Gesture Recognition

no code implementations20 Nov 2021 Bin-Bin Zhang, Dongheng Zhang, Yadong Li, Yang Hu, Yan Chen

Then we propose a confidence constraint loss to enhance the effectiveness of pseudo-labeling, and design two corresponding data augmentation methods based on the characteristic of the RF signals to strengthen the performance of the consistency regularization, which can make the framework more effective and robust.

Data Augmentation RF-based Gesture Recognition +1

Calculus of Consent via MARL: Legitimating the Collaborative Governance Supplying Public Goods

no code implementations20 Nov 2021 Yang Hu, Zhui Zhu, Sirui Song, Xue Liu, Yang Yu

Experimental results in an exemplary environment show that our MARL approach is able to demonstrate the effectiveness and necessity of restrictions on individual liberty for collaborative supply of public goods.

Multi-agent Reinforcement Learning

Towards Domain-Independent and Real-Time Gesture Recognition Using mmWave Signal

2 code implementations11 Nov 2021 Yadong Li, Dongheng Zhang, Jinbo Chen, Jinwei Wan, Dong Zhang, Yang Hu, Qibin Sun, Yan Chen

To enhance the robustness of the system and reduce data collecting efforts, we design a data augmentation framework for mmWave signals based on correlations between signal patterns and gesture variations.

Data Augmentation Gesture Recognition

Pipeline Parallelism for Inference on Heterogeneous Edge Computing

no code implementations28 Oct 2021 Yang Hu, Connor Imes, Xuanang Zhao, Souvik Kundu, Peter A. Beerel, Stephen P. Crago, John Paul N. Walters

We propose EdgePipe, a distributed framework for edge systems that uses pipeline parallelism to both speed up inference and enable running larger (and more accurate) models that otherwise cannot fit on single edge devices.

Edge-computing

NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks

1 code implementation26 Oct 2021 Wenxi Wang, Yang Hu, Mohit Tiwari, Sarfraz Khurshid, Kenneth McMillan, Risto Miikkulainen

Aiming to make GNN improvements practical, this paper proposes an approach called NeuroBack, which builds on two insights: (1) predicting phases (i. e., values) of variables appearing in the majority (or even all) of the satisfying assignments are essential for CDCL SAT solving, and (2) it is sufficient to query the neural model only once for the predictions before the SAT solving starts.

Intervention Adversarial Auto-Encoder

no code implementations29 Sep 2021 Yang Hu, Cheng Zhang

In this paper we propose a new method to stabilize the training process of the latent variables of adversarial auto-encoders, which we name Intervention Adversarial auto-encoder (IVAAE).

Heredity-aware Child Face Image Generation with Latent Space Disentanglement

no code implementations25 Aug 2021 Xiao Cui, Wengang Zhou, Yang Hu, Weilun Wang, Houqiang Li

The main idea is to disentangle the latent space of a pre-trained generation model and precisely control the face attributes of child images with clear semantics.

Disentanglement Image Generation

Personalized Outfit Recommendation With Learnable Anchors

no code implementations CVPR 2021 Zhi Lu, Yang Hu, Yan Chen, Bing Zeng

To accommodate the variety of users' preferences, we characterize each user with a set of anchors, i. e. a group of learnable latent vectors in the outfit space that are the representatives of the outfits the user likes.

Graph-MLP: Node Classification without Message Passing in Graph

1 code implementation8 Jun 2021 Yang Hu, Haoxuan You, Zhecan Wang, Zhicheng Wang, Erjin Zhou, Yue Gao

Graph Neural Network (GNN) has been demonstrated its effectiveness in dealing with non-Euclidean structural data.

Classification Graph Neural Network +1

Recent Advances on Non-Line-of-Sight Imaging: Conventional Physical Models, Deep Learning, and New Scenes

no code implementations28 Apr 2021 Ruixu Geng, Yang Hu, Yan Chen

As an emerging technology that has attracted huge attention, non-line-of-sight (NLOS) imaging can reconstruct hidden objects by analyzing the diffuse reflection on a relay surface, with broad application prospects in the fields of autonomous driving, medical imaging, and defense.

Autonomous Driving Deep Learning +1

Tackling Variabilities in Autonomous Driving

no code implementations21 Apr 2021 Yuqiong Qi, Yang Hu, Haibin Wu, Shen Li, Haiyu Mao, Xiaochun Ye, Dongrui Fan, Ninghui Sun

In this work, we aim to extensively explore the above system design challenges and these challenges motivate us to propose a comprehensive framework that synergistically handles the heterogeneous hardware accelerator design principles, system design criteria, and task scheduling mechanism.

Autonomous Driving Deep Reinforcement Learning +2

Neural Architecture Search For Fault Diagnosis

no code implementations19 Feb 2020 Xudong Li, Yang Hu, Jianhua Zheng, Mingtao Li

In this paper, we proposed a NAS method for fault diagnosis using reinforcement learning.

Deep Learning Neural Architecture Search +1

Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture

1 code implementation21 Sep 2019 Pawel Ladosz, Eseoghene Ben-Iwhiwhu, Jeffery Dick, Yang Hu, Nicholas Ketz, Soheil Kolouri, Jeffrey L. Krichmar, Praveen Pilly, Andrea Soltoggio

This paper presents a new neural architecture that combines a modulated Hebbian network (MOHN) with DQN, which we call modulated Hebbian plus Q network architecture (MOHQA).

Decision Making Deep Reinforcement Learning +2

Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity

no code implementations14 Aug 2019 Yang Hu, Giovanni Montana

We demonstrate its performance compared to a state-of-the-art approach and several ablation cases, visualize and interpret the hidden factors, and identify avenues for future improvements.

Decoder Deep Reinforcement Learning +3

A Multi-Scale Mapping Approach Based on a Deep Learning CNN Model for Reconstructing High-Resolution Urban DEMs

no code implementations19 Jul 2019 Ling Jiang, Yang Hu, Xilin Xia, Qiuhua Liang, Andrea Soltoggio

Few attempts have been made for urban topography which is typically an integration of complex man-made and natural features.

Stochastic Region Pooling: Make Attention More Expressive

no code implementations22 Apr 2019 Mingnan Luo, Guihua Wen, Yang Hu, Dan Dai, Yingxue Xu

Global Average Pooling (GAP) is used by default on the channel-wise attention mechanism to extract channel descriptors.

Diversity

Inner-Imaging Networks: Put Lenses into Convolutional Structure

1 code implementation22 Apr 2019 Yang Hu, Guihua Wen, Mingnan Luo, Dan Dai, Wenming Cao, Zhiwen Yu, Wendy Hall

To deal with these problems, a novel Inner-Imaging architecture is proposed in this paper, which allows relationships between channels to meet the above requirement.

Chinese Herbal Recognition based on Competitive Attentional Fusion of Multi-hierarchies Pyramid Features

no code implementations23 Dec 2018 Yingxue Xu, Guihua Wen, Yang Hu, Mingnan Luo, Dan Dai, Yishan Zhuang

According to the characteristics of herbal images, we proposed the competitive attentional fusion pyramid networks to model the features of herbal image, which mdoels the relationship of feature maps from different levels, and re-weights multi-level channels with channel-wise attention mechanism.

Convolutional herbal prescription building method from multi-scale facial features

no code implementations17 Dec 2018 Huiqiang Liao, Guihua Wen, Yang Hu, Changjun Wang

In order to mine features from different granularities of faces, we design a multi-scale convolutional neural network based on three-grained face, which mines the patient's face information from the organs, local regions, and the entire face.

Feature Learning for Fault Detection in High-Dimensional Condition-Monitoring Signals

1 code implementation12 Oct 2018 Gabriel Michau, Yang Hu, Thomas Palmé, Olga Fink

The performance is first evaluated on a synthetic dataset that encompasses typical characteristics of condition monitoring data.

Dimensionality Reduction Fault Detection +3

FashionNet: Personalized Outfit Recommendation with Deep Neural Network

no code implementations4 Oct 2018 Tong He, Yang Hu

Our system, dubbed FashionNet, consists of two components, a feature network for feature extraction and a matching network for compatibility computation.

Recommendation Systems

Metabolize Neural Network

no code implementations4 Sep 2018 Dan Dai, Zhiwen Yu, Yang Hu, Wenming Cao, Mingnan Luo

It is self-evident that the significance of metabolize neuronal network(MetaNet) in model construction.

Competitive Inner-Imaging Squeeze and Excitation for Residual Network

1 code implementation24 Jul 2018 Yang Hu, Guihua Wen, Mingnan Luo, Dan Dai, Jiajiong Ma, Zhiwen Yu

In this work, we propose a competitive squeeze-excitation (SE) mechanism for the residual network.

A Fully Convolutional Two-Stream Fusion Network for Interactive Image Segmentation

1 code implementation6 Jul 2018 Yang Hu, Andrea Soltoggio, Russell Lock, Steve Carter

The proposed network includes two sub-networks: a two-stream late fusion network (TSLFN) that predicts the foreground at a reduced resolution, and a multi-scale refining network (MSRN) that refines the foreground at full resolution.

Image Segmentation Segmentation +2

Tongue image constitution recognition based on Complexity Perception method

no code implementations1 Mar 2018 Jiajiong Ma, Guihua Wen, Yang Hu, Tianyuan Chang, Haibin Zeng, Lijun Jiang, Jianzeng Qin

To evaluate the performance of our proposed method, we conduct experiments on three sizes of tongue datasets, in which deep convolutional neural network method and traditional digital image analysis method are respectively applied to extract features for tongue images.

Classification General Classification

Automatic construction of Chinese herbal prescription from tongue image via CNNs and auxiliary latent therapy topics

no code implementations23 Jan 2018 Yang Hu, Guihua Wen, Huiqiang Liao, Changjun Wang, Dan Dai, Zhiwen Yu

In order to adapt to the tongue image in a variety of photographic environments and construct herbal prescriptions, a neural network framework for prescription construction is designed.

Multi-stage Suture Detection for Robot Assisted Anastomosis based on Deep Learning

no code implementations8 Nov 2017 Yang Hu, Yun Gu, Jie Yang, Guang-Zhong Yang

In robotic surgery, task automation and learning from demonstration combined with human supervision is an emerging trend for many new surgical robot platforms.

Endoscopic Depth Measurement and Super-Spectral-Resolution Imaging

no code implementations19 Jun 2017 Jianyu Lin, Neil T. Clancy, Yang Hu, Ji Qi, Taran Tatla, Danail Stoyanov, Lena Maier-Hein, Daniel S. Elson

Intra-operative measurements of tissue shape and multi/ hyperspectral information have the potential to provide surgical guidance and decision making support.

Decision Making Optical Flow Estimation

Open-set Person Re-identification

no code implementations5 Aug 2014 Shengcai Liao, Zhipeng Mo, Jianqing Zhu, Yang Hu, Stan Z. Li

Person re-identification is becoming a hot research for developing both machine learning algorithms and video surveillance applications.

Metric Learning Person Re-Identification

Person Re-identification by Local Maximal Occurrence Representation and Metric Learning

1 code implementation CVPR 2015 Shengcai Liao, Yang Hu, Xiangyu Zhu, Stan Z. Li

In this paper, we propose an effective feature representation called Local Maximal Occurrence (LOMO), and a subspace and metric learning method called Cross-view Quadratic Discriminant Analysis (XQDA).

Metric Learning Person Re-Identification

Fast Matching by 2 Lines of Code for Large Scale Face Recognition Systems

no code implementations28 Feb 2013 Dong Yi, Zhen Lei, Yang Hu, Stan Z. Li

However, the use of this method is very generic and not limited in face recognition, which can be easily generalized to other biometrics as a post-processing module.

Computational Efficiency Face Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.