Search Results for author: Weiming Hu

Found 45 papers, 14 papers with code

Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos

1 code implementation5 Nov 2021 Minglang Qiao, Yufan Liu, Mai Xu, Xin Deng, Bing Li, Weiming Hu, Ali Borji

In this paper, we propose a multitask learning method for visual-audio saliency prediction and sound source localization on multi-face video by leveraging visual, audio and face information.

Eye Tracking Saliency Prediction

SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction

no code implementations18 Sep 2021 Zekun Li, Yufan Liu, Bing Li, Weiming Hu, Kebin Wu, Pei Wang

CDI builds the global attention and interaction among different levels in decoupled space which also solves the problem of heavy computation.

Differentiable Convolution Search for Point Cloud Processing

no code implementations ICCV 2021 Xing Nie, Yongcheng Liu, Shaohong Chen, Jianlong Chang, Chunlei Huo, Gaofeng Meng, Qi Tian, Weiming Hu, Chunhong Pan

It can work in a purely data-driven manner and thus is capable of auto-creating a group of suitable convolutions for geometric shape modeling.

Learn to Match: Automatic Matching Network Design for Visual Tracking

1 code implementation ICCV 2021 Zhipeng Zhang, Yihao Liu, Xiao Wang, Bing Li, Weiming Hu

Siamese tracking has achieved groundbreaking performance in recent years, where the essence is the efficient matching operator cross-correlation and its variants.

Visual Tracking

GasHisSDB: A New Gastric Histopathology Image Dataset for Computer Aided Diagnosis of Gastric Cancer

no code implementations4 Jun 2021 Weiming Hu, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Jiquan Ma, Yong Zhang, HaoYuan Chen, Wanli Liu, Changhao Sun, YuDong Yao, Hongzan Sun, Marcin Grzegorzek

In order to prove that the methods of different periods in the field of image classification have discrepancies on GasHisSDB, we select a variety of classifiers for evaluation.

Classification Image Classification

A Comparison for Anti-noise Robustness of Deep Learning Classification Methods on a Tiny Object Image Dataset: from Convolutional Neural Network to Visual Transformer and Performer

no code implementations3 Jun 2021 Ao Chen, Chen Li, HaoYuan Chen, Hechen Yang, Peng Zhao, Weiming Hu, Wanli Liu, Shuojia Zou, Marcin Grzegorzek

In this paper, we first briefly review the development of Convolutional Neural Network and Visual Transformer in deep learning, and introduce the sources and development of conventional noises and adversarial attacks.

Classification Image Classification

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking

no code implementations6 May 2021 Zhenbang Li, Yaya Shi, Jin Gao, Shaoru Wang, Bing Li, Pengpeng Liang, Weiming Hu

In this paper, we show the existence of universal perturbations that can enable the targeted attack, e. g., forcing a tracker to follow the ground-truth trajectory with specified offsets, to be video-agnostic and free from inference in a network.

Visual Tracking

GasHis-Transformer: A Multi-scale Visual Transformer Approach for Gastric Histopathology Image Classification

no code implementations29 Apr 2021 HaoYuan Chen, Chen Li, Xiaoyan Li, Ge Wang, Weiming Hu, Yixin Li, Wanli Liu, Changhao Sun, YuDong Yao, Yueyang Teng, Marcin Grzegorzek

Finally, a comparative study is performed to test the generalizability with both H&E and immunohistochemical stained images on a lymphoma image dataset and a breast cancer dataset, producing comparable F1-scores (85. 6% and 82. 8%) and accuracies (83. 9% and 89. 4%), respectively.

Adversarial Attack General Classification +2

PDNet: Towards Better One-stage Object Detection with Prediction Decoupling

no code implementations28 Apr 2021 Li Yang, Yan Xu, Shaoru Wang, Chunfeng Yuan, Ziqi Zhang, Bing Li, Weiming Hu

However, the most suitable positions for inferring different targets, i. e., the object category and boundaries, are generally different.

Object Detection

One More Check: Making "Fake Background" Be Tracked Again

1 code implementation19 Apr 2021 Chao Liang, Zhipeng Zhang, Xue Zhou, Bing Li, Yi Lu, Weiming Hu

The one-shot multi-object tracking, which integrates object detection and ID embedding extraction into a unified network, has achieved groundbreaking results in recent years.

Multi-Object Tracking Object Detection

Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model

1 code implementation ECCV 2020 Yufan Liu, Minglang Qiao, Mai Xu, Bing Li, Weiming Hu, Ali Borji

Inspired by the findings of our investigation, we propose a novel multi-modal video saliency model consisting of three branches: visual, audio and face.

Eye Tracking Saliency Prediction

Open-book Video Captioning with Retrieve-Copy-Generate Network

no code implementations CVPR 2021 Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu

Due to the rapid emergence of short videos and the requirement for content understanding and creation, the video captioning task has received increasing attention in recent years.

Video Captioning

Weather Analogs with a Machine Learning Similarity Metric for Renewable Resource Forecasting

no code implementations8 Mar 2021 Weiming Hu, Guido Cervone, George Young, Luca Delle Monache

The central core of the AnEn technique is a similarity metric that sorts historical forecasts with respect to a new target prediction.

Feature Selection

Using Long Short-Term Memory (LSTM) and Internet of Things (IoT) for localized surface temperature forecasting in an urban environment

no code implementations4 Feb 2021 Manzhu Yu, Fangcao Xu, Weiming Hu, Jian Sun, Guido Cervone

Meanwhile, by using IoT observations, the spatial resolution of air temperature predictions is significantly improved.

DSIC: Dynamic Sample-Individualized Connector for Multi-Scale Object Detection

no code implementations16 Nov 2020 Zekun Li, Yufan Liu, Bing Li, Weiming Hu

Furthermore, these two components are both plug-and-play and can be embedded in any backbone.

Object Detection

Towards Accurate Pixel-wise Object Tracking by Attention Retrieval

1 code implementation6 Aug 2020 Zhipeng Zhang, Bing Li, Weiming Hu, Houwen Peng

We first build a look-up-table (LUT) with the ground-truth mask in the starting frame, and then retrieves the LUT to obtain an attention map for spatial constraints.

Object Tracking

Object Relational Graph with Teacher-Recommended Learning for Video Captioning

no code implementations CVPR 2020 Ziqi Zhang, Yaya Shi, Chunfeng Yuan, Bing Li, Peijin Wang, Weiming Hu, Zheng-Jun Zha

In this paper, we propose a complete video captioning system including both a novel model and an effective training strategy.

 Ranked #1 on Video Captioning on MSR-VTT (using extra training data)

Language Modelling Video Captioning

RDSNet: A New Deep Architecture for Reciprocal Object Detection and Instance Segmentation

1 code implementation11 Dec 2019 Shaoru Wang, Yongchao Gong, Junliang Xing, Lichao Huang, Chang Huang, Weiming Hu

To reciprocate these two tasks, we design a two-stream structure to learn features on both the object level (i. e., bounding boxes) and the pixel level (i. e., instance masks) jointly.

Instance Segmentation Object Detection +2

Probabilistic Forecasting using Deep Generative Models

no code implementations26 Sep 2019 Alessandro Fanfarillo, Behrooz Roozitalab, Weiming Hu, Guido Cervone

In order to provide a meaningful probabilistic forecast, the AnEn method requires storing a historical set of past predictions and observations in memory for a period of at least several months and spanning the seasons relevant for the prediction of interest.

Multimodal Semantic Attention Network for Video Captioning

no code implementations8 May 2019 Liang Sun, Bing Li, Chunfeng Yuan, Zheng-Jun Zha, Weiming Hu

Inspired by the fact that different modalities in videos carry complementary information, we propose a Multimodal Semantic Attention Network(MSAN), which is a new encoder-decoder framework incorporating multimodal semantic attributes for video captioning.

General Classification Multi-Label Classification +1

Fast Online Object Tracking and Segmentation: A Unifying Approach

3 code implementations CVPR 2019 Qiang Wang, Li Zhang, Luca Bertinetto, Weiming Hu, Philip H. S. Torr

In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach.

Real-Time Visual Tracking Semi-Supervised Semantic Segmentation +2

Visual Tracking via Spatially Aligned Correlation Filters Network

no code implementations ECCV 2018 Mengdan Zhang, Qiang Wang, Junliang Xing, Jin Gao, Peixi Peng, Weiming Hu, Steve Maybank

Correlation filters based trackers rely on a periodic assumption of the search sample to efficiently distinguish the target from the background.

Visual Tracking

Distractor-aware Siamese Networks for Visual Object Tracking

1 code implementation ECCV 2018 Zheng Zhu, Qiang Wang, Bo Li, Wei Wu, Junjie Yan, Weiming Hu

During the off-line training phase, an effective sampling strategy is introduced to control this distribution and make the model focus on the semantic distractors.

Incremental Learning Visual Object Tracking +1

Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification

no code implementations ECCV 2018 Yang Du, Chunfeng Yuan, Bing Li, Lili Zhao, Yangxi Li, Weiming Hu

Furthermore, since different layers in a deep network capture feature maps of different scales, we use these feature maps to construct a spatial pyramid and then utilize multi-scale information to obtain more accurate attention scores, which are used to weight the local features in all spatial positions of feature maps to calculate attention maps.

Action Classification Classification +1

Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation

no code implementations CVPR 2018 Kai Li, Junliang Xing, Chi Su, Weiming Hu, Yundong Zhang, Stephen Maybank

First, a novel cost-sensitive multi-task loss function is designed to learn transferable aging features by training on the source population.

Age Estimation

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking

2 code implementations CVPR 2018 Qiang Wang, Zhu Teng, Junliang Xing, Jin Gao, Weiming Hu, Stephen Maybank

The RASNet model reformulates the correlation filter within a Siamese tracking framework, and introduces different kinds of the attention mechanisms to adapt the model without updating the model online.

Object Tracking Representation Learning +1

Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos

no code implementations CVPR 2017 Yang Du, Chunfeng Yuan, Bing Li, Weiming Hu, Stephen Maybank

In dynamic object detection, it is challenging to construct an effective model to sufficiently characterize the spatial-temporal properties of the background.

Object Detection

DCFNet: Discriminant Correlation Filters Network for Visual Tracking

5 code implementations13 Apr 2017 Qiang Wang, Jin Gao, Junliang Xing, Mengdan Zhang, Weiming Hu

In this work, we present an end-to-end lightweight network architecture, namely DCFNet, to learn the convolutional features and perform the correlation tracking process simultaneously.

Object Tracking Visual Tracking

Tensor Power Iteration for Multi-Graph Matching

no code implementations CVPR 2016 Xinchu Shi, Haibin Ling, Weiming Hu, Junliang Xing, Yanning Zhang

Due to its wide range of applications, matching between two graphs has been extensively studied and remains an active topic.

Graph Matching

Local Subspace Collaborative Tracking

no code implementations ICCV 2015 Lin Ma, Xiaoqin Zhang, Weiming Hu, Junliang Xing, Jiwen Lu, Jie zhou

To address this, this paper presents a local subspace collaborative tracking method for robust visual tracking, where multiple linear and nonlinear subspaces are learned to better model the nonlinear relationship of object appearances.

Object Tracking Visual Tracking

Towards Multi-view and Partially-Occluded Face Alignment

no code implementations CVPR 2014 Junliang Xing, Zhiheng Niu, Junshi Huang, Weiming Hu, Shuicheng Yan

During each training stage, the SRD model learns a relational dictionary to capture consistent relationships between face appearance and shape, which are respectively modeled by the pose-indexed image features and the shape displacements for current estimated landmarks.

Face Alignment

Human Action Recognition Based on Context-Dependent Graph Kernels

no code implementations CVPR 2014 Baoxin Wu, Chunfeng Yuan, Weiming Hu

Then, the proposed CGKs are applied to measure the similarity between actions represented by the two-graph model.

Action Recognition

Multi-target Tracking with Motion Context in Tensor Power Iteration

no code implementations CVPR 2014 Xinchu Shi, Haibin Ling, Weiming Hu, Chunfeng Yuan, Junliang Xing

In this paper, we model interactions between neighbor targets by pair-wise motion context, and further encode such context into the global association optimization.

Illumination Estimation Based on Bilayer Sparse Coding

no code implementations CVPR 2013 Bing Li, Weihua Xiong, Weiming Hu, Houwen Peng

In this paper, we propose a novel bilayer sparse coding model for illumination estimation that considers image similarity in terms of both low level color distribution and high level image scene content simultaneously.

Color Constancy

Multi-target Tracking by Rank-1 Tensor Approximation

no code implementations CVPR 2013 Xinchu Shi, Haibin Ling, Junling Xing, Weiming Hu

In this paper we formulate multi-target tracking (MTT) as a rank-1 tensor approximation problem and propose an 1 norm tensor power iteration solution.

Multi-task Sparse Learning with Beta Process Prior for Action Recognition

no code implementations CVPR 2013 Chunfeng Yuan, Weiming Hu, Guodong Tian, Shuang Yang, Haoran Wang

In this paper, we formulate human action recognition as a novel Multi-Task Sparse Learning(MTSL) framework which aims to construct a test sample with multiple features from as few bases as possible.

Action Recognition Sparse Learning

3D R Transform on Spatio-temporal Interest Points for Action Recognition

no code implementations CVPR 2013 Chunfeng Yuan, Xi Li, Weiming Hu, Haibin Ling, Stephen Maybank

In this paper, we propose a new global feature to capture the detailed geometrical distribution of interest points.

Action Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.