Search Results for author: Hao Shen

Found 41 papers, 12 papers with code

An Adaptive Dictionary Learning Approach for Modeling Dynamical Textures

no code implementations19 Dec 2013 Xian Wei, Hao Shen, Martin Kleinsteuber

Video representation is an important and challenging task in the computer vision community.

Dictionary Learning

Texture Retrieval via the Scattering Transform

no code implementations12 Jan 2015 Alexander Sagel, Dominik Meyer, Hao Shen

Our approach employs a recently developed method, the so-called Scattering transform, for the process of feature extraction in texture retrieval.

Content-Based Image Retrieval Retrieval +1

$\ell_1$ Regularized Gradient Temporal-Difference Learning

no code implementations5 Oct 2016 Dominik Meyer, Hao Shen, Klaus Diepold

In this paper, we study the Temporal Difference (TD) learning with linear value function approximation.

Reinforcement Learning in Conflicting Environments for Autonomous Vehicles

no code implementations22 Oct 2016 Dominik Meyer, Johannes Feldmaier, Hao Shen

In this work, we investigate the application of Reinforcement Learning to two well known decision dilemmas, namely Newcomb's Problem and Prisoner's Dilemma.

Autonomous Vehicles reinforcement-learning +1

Towards a Mathematical Understanding of the Difficulty in Learning with Feedforward Neural Networks

no code implementations CVPR 2018 Hao Shen

Training deep neural networks for solving machine learning problems is one great challenge in the field, mainly due to its associated optimisation problem being highly non-convex.

Dynamic Variational Autoencoders for Visual Process Modeling

1 code implementation20 Mar 2018 Alexander Sagel, Hao Shen

This work studies the problem of modeling visual processes by leveraging deep generative architectures for learning linear, Gaussian representations from observed sequences.

Trace Quotient with Sparsity Priors for Learning Low Dimensional Image Representations

no code implementations8 Oct 2018 Xian Wei, Hao Shen, Martin Kleinsteuber

We propose a generic algorithmic framework, which leverages two classic representation learning paradigms, i. e., sparse representation and the trace quotient criterion.

Data Visualization Dimensionality Reduction +1

A Differential Topological View of Challenges in Learning with Feedforward Neural Networks

no code implementations26 Nov 2018 Hao Shen

Among many unsolved puzzles in theories of Deep Neural Networks (DNNs), there are three most fundamental challenges that highly demand solutions, namely, expressibility, optimisability, and generalisability.

Representation Learning

A Generative Map for Image-based Camera Localization

1 code implementation18 Feb 2019 Mingpan Guo, Stefan Matthes, Jiaojiao Ye, Hao Shen

For localization, we show that Generative Map achieves comparable performance with current regression models.

Camera Localization regression +2

Grand Challenge of 106-Point Facial Landmark Localization

no code implementations9 May 2019 Yinglu Liu, Hao Shen, Yue Si, Xiaobo Wang, Xiangyu Zhu, Hailin Shi, Zhibin Hong, Hanqi Guo, Ziyuan Guo, Yanqin Chen, Bi Li, Teng Xi, Jun Yu, Haonian Xie, Guochen Xie, Mengyan Li, Qing Lu, Zengfu Wang, Shenqi Lai, Zhenhua Chai, Xiaoming Wei

However, previous competitions on facial landmark localization (i. e., the 300-W, 300-VW and Menpo challenges) aim to predict 68-point landmarks, which are incompetent to depict the structure of facial components.

Face Alignment Face Recognition +2

A High-Efficiency Framework for Constructing Large-Scale Face Parsing Benchmark

no code implementations13 May 2019 Yinglu Liu, Hailin Shi, Yue Si, Hao Shen, Xiaobo Wang, Tao Mei

Each image is provided with accurate annotation of a 11-category pixel-level label map along with coordinates of 106-point landmarks.

Face Alignment Face Detection +3

A New Dataset and Boundary-Attention Semantic Segmentation for Face Parsing

no code implementations Proceedings of the AAAI Conference on Artificial Intelligence 2020 Yinglu Liu, Hailin Shi, Hao Shen, Yue Si, Xiaobo Wang, Tao Mei

The dataset is publicly accessible to the community for boosting the advance of face parsing. 1 Second, a simple yet effective Boundary-Attention Semantic Segmentation (BASS) method is proposed for face parsing, which contains a three-branch network with elaborately developed loss functions to fully exploit the boundary information.

Face Parsing Image Generation +1

CenterMask: single shot instance segmentation with point representation

no code implementations CVPR 2020 Yuqing Wang, Zhaoliang Xu, Hao Shen, Baoshan Cheng, Lirong Yang

Accordingly, we decompose the instance segmentation into two parallel subtasks: Local Shape prediction that separates instances even in overlapping conditions, and Global Saliency generation that segments the whole image in a pixel-to-pixel manner.

Instance Segmentation Object +2

3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning

no code implementations13 May 2020 Mi Tian, Qiong Nie, Hao Shen

Camera localization is a fundamental and key component of autonomous driving vehicles and mobile robots to localize themselves globally for further environment perception, path planning and motion control.

Autonomous Driving Camera Localization

Metapath- and Entity-aware Graph Neural Network for Recommendation

1 code implementation22 Oct 2020 Muhammad Umer Anwaar, Zhiwei Han, Shyam Arumugaswamy, Rayyan Ahmad Khan, Thomas Weber, Tianming Qiu, Hao Shen, Yuanting Liu, Martin Kleinsteuber

In this paper, we employ collaborative subgraphs (CSGs) and metapaths to form metapath-aware subgraphs, which explicitly capture sequential semantics in graph structures.

Link Prediction Recommendation Systems

Attentional Separation-and-Aggregation Network for Self-supervised Depth-Pose Learning in Dynamic Scenes

no code implementations18 Nov 2020 Feng Gao, Jincheng Yu, Hao Shen, Yu Wang, Huazhong Yang

Learning depth and ego-motion from unlabeled videos via self-supervision from epipolar projection can improve the robustness and accuracy of the 3D perception and localization of vision-based robots.

End-to-End Video Instance Segmentation with Transformers

2 code implementations CVPR 2021 Yuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, Huaxia Xia

Here, we propose a new video instance segmentation framework built upon Transformers, termed VisTR, which views the VIS task as a direct end-to-end parallel sequence decoding/prediction problem.

Instance Segmentation Segmentation +3

Dynamic Texture Recognition via Nuclear Distances on Kernelized Scattering Histogram Spaces

1 code implementation1 Feb 2021 Alexander Sagel, Julian Wörmann, Hao Shen

Distance-based dynamic texture recognition is an important research field in multimedia processing with applications ranging from retrieval to segmentation of video data.

Classification Dynamic Texture Recognition +2

Large $N$ limit of the $O(N)$ linear sigma model in 3D

no code implementations4 Feb 2021 Hao Shen, Rongchan Zhu, Xiangchan Zhu

We prove tightness of the invariant measures in the large N limit.

Quantization Probability Mathematical Physics Analysis of PDEs Mathematical Physics

Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation

no code implementations16 Jun 2021 Martin Gottwald, Sven Gronauer, Hao Shen, Klaus Diepold

First, we conduct a critical point analysis of the error function and provide technical insights on optimisation and design choices for neural networks.

Continuous Control

Identifying Influential Users in Unknown Social Networks for Adaptive Incentive Allocation Under Budget Restriction

no code implementations13 Jul 2021 Shiqing Wu, Weihua Li, Hao Shen, Quan Bai

To tackle the aforementioned challenges, in this paper, we propose a novel algorithm for exploring influential users in unknown networks, which can estimate the influential relationships among users based on their historical behaviors and without knowing the topology of the network.

Recommendation Systems

Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning

no code implementations NeurIPS 2021 Jinxin Liu, Hao Shen, Donglin Wang, Yachen Kang, Qiangxing Tian

Unsupervised reinforcement learning aims to acquire skills without prior goal representations, where an agent automatically explores an open-ended environment to represent goals and learn the goal-conditioned policy.

reinforcement-learning Reinforcement Learning (RL) +2

HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction

1 code implementation CVPR 2022 Yunze Liu, Yun Liu, Che Jiang, Kangbo Lyu, Weikang Wan, Hao Shen, Boqiang Liang, Zhoujie Fu, He Wang, Li Yi

We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the research of category-level human-object interaction.

Action Segmentation Benchmarking +6

Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations

2 code implementations4 Mar 2022 Hao Shen, Weikang Wan, He Wang

Generalizable object manipulation skills are critical for intelligent and multi-functional robots to work in real-world complex scenes.

Imitation Learning

Learning from Attacks: Attacking Variational Autoencoder for Improving Image Classification

no code implementations11 Mar 2022 Jianzhang Zheng, Fan Yang, Hao Shen, Xuan Tang, Mingsong Chen, Liang Song, Xian Wei

We propose an algorithmic framework that leverages the advantages of the DNNs for data self-expression and task-specific predictions, to improve image classification.

Classification Image Classification

3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection

1 code implementation CVPR 2022 Junyu Luo, Jiahui Fu, Xianghao Kong, Chen Gao, Haibing Ren, Hao Shen, Huaxia Xia, Si Liu

3D visual grounding aims to locate the referred target object in 3D point cloud scenes according to a free-form language description.

Visual Grounding

ISA-Net: Improved spatial attention network for PET-CT tumor segmentation

no code implementations4 Nov 2022 Zhengyong Huang, Sijuan Zou, Guoshuai Wang, Zixiang Chen, Hao Shen, HaiYan Wang, Na Zhang, Lu Zhang, Fan Yang, Haining Wangg, Dong Liang, Tianye Niu, Xiaohua Zhuc, Zhanli Hua

In this paper, we propose a deep learning segmentation method based on multimodal positron emission tomography-computed tomography (PET-CT), which combines the high sensitivity of PET and the precise anatomical information of CT. We design an improved spatial attention network(ISA-Net) to increase the accuracy of PET or CT in detecting tumors, which uses multi-scale convolution operation to extract feature information and can highlight the tumor region location information and suppress the non-tumor region location information.

Segmentation STS +1

Adaptive Dynamic Filtering Network for Image Denoising

1 code implementation22 Nov 2022 Hao Shen, Zhong-Qiu Zhao, Wandi Zhang

To alleviate these issues, we propose to employ dynamic convolution to improve the learning of high-frequency and multi-scale features.

Image Denoising

UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy

no code implementations CVPR 2023 Yinzhen Xu, Weikang Wan, Jialiang Zhang, Haoran Liu, Zikang Shan, Hao Shen, Ruicheng Wang, Haoran Geng, Yijia Weng, Jiayi Chen, Tengyu Liu, Li Yi, He Wang

Trained on our synthesized large-scale dexterous grasp dataset, this model enables us to sample diverse and high-quality dexterous grasp poses for the object point cloud. For the second stage, we propose to replace the motion planning used in parallel gripper grasping with a goal-conditioned grasp policy, due to the complexity involved in dexterous grasping execution.

Motion Planning

Autoencoders with Intrinsic Dimension Constraints for Learning Low Dimensional Image Representations

no code implementations16 Apr 2023 Jianzhang Zheng, Hao Shen, Jian Yang, Xuan Tang, Mingsong Chen, Hui Yu, Jielong Guo, Xian Wei

Motivated by the important role of ID, in this paper, we propose a novel deep representation learning approach with autoencoder, which incorporates regularization of the global and local ID constraints into the reconstruction of data representations.

Image Classification Representation Learning

SPSQL: Step-by-step Parsing Based Framework for Text-to-SQL Generation

no code implementations10 May 2023 Ran Shen, Gang Sun, Hao Shen, Yiling Li, Liangfeng Jin, Han Jiang

Then, we construct data formats of different subtasks based on existing data and improve the accuracy of the overall model by improving the accuracy of each submodel.

Data Augmentation Marketing +6

Potential-based Credit Assignment for Cooperative RL-based Testing of Autonomous Vehicles

no code implementations28 May 2023 Utku Ayvaz, Chih-Hong Cheng, Hao Shen

While autonomous vehicles (AVs) may perform remarkably well in generic real-life cases, their irrational action in some unforeseen cases leads to critical safety concerns.

Autonomous Vehicles counterfactual +2

Mutual Information-driven Triple Interaction Network for Efficient Image Dehazing

1 code implementation14 Aug 2023 Hao Shen, Zhong-Qiu Zhao, Yulun Zhang, Zhao Zhang

Multi-stage architectures have exhibited efficacy in image dehazing, which usually decomposes a challenging task into multiple more tractable sub-tasks and progressively estimates latent hazy-free images.

Image Dehazing

Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior

1 code implementation20 Mar 2024 Zhouhong Gu, Xiaoxuan Zhu, Haoran Guo, Lin Zhang, Yin Cai, Hao Shen, Jiangjie Chen, Zheyu Ye, Yifei Dai, Yan Gao, Yao Hu, Hongwei Feng, Yanghua Xiao

By configuring specific environmental settings within Agent Group Chat, we are able to assess whether agents exhibit behaviors that align with human expectations.

Cannot find the paper you are looking for? You can Submit a new open access paper.