Search Results for author: Weiyao Lin

Found 70 papers, 29 papers with code

A new network-based algorithm for human activity recognition in video

no code implementations21 Feb 2015 Weiyao Lin, Yuanzhe Chen, Jianxin Wu, Hanli Wang, Bin Sheng, Hongxiang Li

Based on this network, we further model people in the scene as packages while human activities can be modeled as the process of package transmission in the network.

Activity Detection Activity Recognition In Videos +2

Intra-and-Inter-Constraint-based Video Enhancement based on Piecewise Tone Mapping

no code implementations21 Feb 2015 Yuanzhe Chen, Weiyao Lin, Chongyang Zhang, Zhenzhong Chen, Ning Xu, Jun Xie

In this paper, we propose a new intra-and-inter-constraint-based video enhancement approach aiming to 1) achieve high intra-frame quality of the entire picture where multiple region-of-interests (ROIs) can be adaptively and simultaneously enhanced, and 2) guarantee the inter-frame quality consistencies among video frames.

Tone Mapping Video Enhancement

A Heat-Map-based Algorithm for Recognizing Group Activities in Videos

no code implementations21 Feb 2015 Weiyao Lin, Hang Chu, Jianxin Wu, Bin Sheng, Zhenzhong Chen

In this paper, a new heat-map-based (HMB) algorithm is proposed for group activity recognition.

Group Activity Recognition

Macroblock Classification Method for Video Applications Involving Motions

no code implementations28 Feb 2015 Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou

We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.

Change Detection Classification +2

Activity Recognition Using A Combination of Category Components And Local Models for Video Surveillance

no code implementations28 Feb 2015 Weiyao Lin, Ming-Ting Sun, Radha Poovendran, Zhengyou Zhang

This paper presents a novel approach for automatic recognition of human activities for video surveillance applications.

Activity Recognition

Group Event Detection with a Varying Number of Group Members for Video Surveillance

no code implementations28 Feb 2015 Weiyao Lin, Ming-Ting Sun, Radha Poovendran, Zhengyou Zhang

This paper presents a novel approach for automatic recognition of group activities for video surveillance applications.

Action Detection Activity Detection +1

Deep Spatial Pyramid: The Devil is Once Again in the Details

no code implementations21 Apr 2015 Bin-Bin Gao, Xiu-Shen Wei, Jianxin Wu, Weiyao Lin

In this paper we show that by carefully making good choices for various detailed but important factors in a visual recognition framework using deep learning features, one can achieve a simple, efficient, yet highly accurate image classification system.

General Classification Image Classification

Person Re-identification with Correspondence Structure Learning

1 code implementation ICCV 2015 Yang Shen, Weiyao Lin, Junchi Yan, Mingliang Xu, Jianxin Wu, Jingdong Wang

This paper addresses the problem of handling spatial misalignments due to camera-view changes or human-pose variations in person re-identification.

Patch Matching Person Re-Identification

Tree-based Visualization and Optimization for Image Collection

no code implementations17 Jul 2015 Xintong Han, Chongyang Zhang, Weiyao Lin, Mingliang Xu, Bin Sheng, Tao Mei

The visualization of an image collection is the process of displaying a collection of images on a screen under some specific layout requirements.

Unsupervised Trajectory Clustering via Adaptive Multi-Kernel-Based Shrinkage

no code implementations ICCV 2015 Hongteng Xu, Yang Zhou, Weiyao Lin, Hongyuan Zha

Facing to the challenges of trajectory clustering, e. g., large variations within a cluster and ambiguities across clusters, we first introduce an adaptive multi-kernel-based estimation process to estimate the `shrunk' positions and speeds of trajectories' points.

Anomaly Detection Clustering +1

RIDE: Reversal Invariant Descriptor Enhancement

no code implementations ICCV 2015 Lingxi Xie, Jingdong Wang, Weiyao Lin, Bo Zhang, Qi Tian

In many fine-grained object recognition datasets, image orientation (left/right) might vary from sample to sample.

Object Recognition

Fractal Dimension Invariant Filtering and Its CNN-based Implementation

no code implementations CVPR 2017 Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha

By adding a nonlinear post-processing step behind anisotropic filter banks, we demonstrate that the proposed filtering method is capable of preserving the local invariance of the fractal dimension of image.

Texture Classification

Picking Deep Filter Responses for Fine-Grained Image Recognition

no code implementations CVPR 2016 Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, Qi Tian

Recognizing fine-grained sub-categories such as birds and dogs is extremely challenging due to the highly localized and subtle differences in some specific parts.

Fine-Grained Image Recognition

A Tube-and-Droplet-based Approach for Representing and Analyzing Motion Trajectories

no code implementations10 Sep 2016 Weiyao Lin, Yang Zhou, Hongteng Xu, Junchi Yan, Mingliang Xu, Jianxin Wu, Zicheng Liu

Our approach first leverages the complete information from given trajectories to construct a thermal transfer field which provides a context-rich way to describe the global motion pattern in a scene.

3D Action Recognition Anomaly Detection +2

Motion Segmentation via Global and Local Sparse Subspace Optimization

no code implementations24 Jan 2017 Michael Ying Yang, Hanno Ackermann, Weiyao Lin, Sitong Feng, Bodo Rosenhahn

In this paper, we propose a new framework for segmenting feature-based moving objects under affine subspace model.

Clustering Motion Segmentation +1

Learning Correspondence Structures for Person Re-identification

no code implementations20 Mar 2017 Weiyao Lin, Yang shen, Junchi Yan, Mingliang Xu, Jianxin Wu, Jingdong Wang, Ke Lu

We first introduce a boosting-based approach to learn a correspondence structure which indicates the patch-wise matching probabilities between images from a target camera pair.

Patch Matching Person Re-Identification

ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression

no code implementations ICCV 2017 Jian-Hao Luo, Jianxin Wu, Weiyao Lin

Similar experiments with ResNet-50 reveal that even for a compact network, ThiNet can also reduce more than half of the parameters and FLOPs, at the cost of roughly 1$\%$ top-5 accuracy drop.

Neural Network Compression

Kill Two Birds With One Stone: Boosting Both Object Detection Accuracy and Speed With adaptive Patch-of-Interest Composition

no code implementations12 Aug 2017 Shihao Zhang, Weiyao Lin, Ping Lu, Weihua Li, Shuo Deng

Object detection is an important yet challenging task in video understanding & analysis, where one major challenge lies in the proper balance between two contradictive factors: detection accuracy and detection speed.

Object object-detection +2

Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion

no code implementations20 Nov 2017 Weiyao Lin, Yang Mi, Jianxin Wu, Ke Lu, Hongkai Xiong

In this paper, we propose a novel deep-based framework for action recognition, which improves the recognition accuracy by: 1) deriving more precise features for representing actions, and 2) reducing the asynchrony between different information streams.

Action Recognition Temporal Action Localization

Unsupervised Deep Domain Adaptation for Pedestrian Detection

no code implementations9 Feb 2018 Lihang Liu, Weiyao Lin, Lisheng Wu, Yong Yu, Michael Ying Yang

This paper addresses the problem of unsupervised domain adaptation on the task of pedestrian detection in crowded scenes.

Pedestrian Detection Unsupervised Domain Adaptation

Deep Neural Network Compression with Single and Multiple Level Quantization

1 code implementation6 Mar 2018 Yuhui Xu, Yongzhuang Wang, Aojun Zhou, Weiyao Lin, Hongkai Xiong

In this paper, we propose two novel network quantization approaches, single-level network quantization (SLQ) for high-bit quantization and multi-level network quantization (MLQ) for extremely low-bit quantization (ternary). We are the first to consider the network quantization from both width and depth level.

Neural Network Compression Quantization

Enhancing HEVC Compressed Videos with a Partition-masked Convolutional Neural Network

no code implementations10 May 2018 Xiaoyi He, Qiang Hu, Xintong Han, Xiaoyun Zhang, Chongyang Zhang, Weiyao Lin

In this paper, we propose a partition-masked Convolution Neural Network (CNN) to achieve compressed-video enhancement for the state-of-the-art coding standard, High Efficiency Video Coding (HECV).

Multimedia

Enriched Long-term Recurrent Convolutional Network for Facial Micro-Expression Recognition

2 code implementations22 May 2018 Huai-Qian Khor, John See, Raphael C. -W. Phan, Weiyao Lin

Facial micro-expression (ME) recognition has posed a huge challenge to researchers for its subtlety in motion and limited databases.

Data Augmentation Micro Expression Recognition +2

Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages

1 code implementation29 Jul 2018 Yuxi Li, Jiuwei Li, Weiyao Lin, Jianguo Li

Based on the deeply supervised object detection (DSOD) framework, we propose Tiny-DSOD dedicating to resource-restricted usages.

Object object-detection +1

Network Decoupling: From Regular to Depthwise Separable Convolutions

1 code implementation16 Aug 2018 Jianbo Guo, Yuxi Li, Weiyao Lin, Yurong Chen, Jianguo Li

Depthwise separable convolution has shown great efficiency in network design, but requires time-consuming training procedure with full training-set available.

object-detection Object Detection

Trained Rank Pruning for Efficient Deep Neural Networks

1 code implementation6 Dec 2018 Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

We propose Trained Rank Pruning (TRP), which iterates low rank approximation and training.

Quantization

DNQ: Dynamic Network Quantization

no code implementations6 Dec 2018 Yuhui Xu, Shuai Zhang, Yingyong Qi, Jiaxian Guo, Weiyao Lin, Hongkai Xiong

Network quantization is an effective method for the deployment of neural networks on memory and energy constrained mobile devices.

Quantization

Towards Accurate One-Stage Object Detection with AP-Loss

1 code implementation CVPR 2019 Kean Chen, Jianguo Li, Weiyao Lin, John See, Ji Wang, Ling-Yu Duan, Zhibo Chen, Changwei He, Junni Zou

For this purpose, we develop a novel optimization algorithm, which seamlessly combines the error-driven update scheme in perceptron learning and backpropagation algorithm in deep networks.

Classification General Classification +3

Group Re-Identification with Multi-grained Matching and Integration

no code implementations17 May 2019 Weiyao Lin, Yuxi Li, Hao Xiao, John See, Junni Zou, Hongkai Xiong, Jingdong Wang, Tao Mei

The task of re-identifying groups of people underdifferent camera views is an important yet less-studied problem. Group re-identification (Re-ID) is a very challenging task sinceit is not only adversely affected by common issues in traditionalsingle object Re-ID problems such as viewpoint and human posevariations, but it also suffers from changes in group layout andgroup membership.

ATRW: A Benchmark for Amur Tiger Re-identification in the Wild

1 code implementation13 Jun 2019 Shuyuan Li, Jianguo Li, Hanlin Tang, Rui Qian, Weiyao Lin

This paper tries to fill the gap by introducing a novel large-scale dataset, the Amur Tiger Re-identification in the Wild (ATRW) dataset.

Trained Rank Pruning for Efficient Deep Neural Networks

1 code implementation9 Oct 2019 Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Wenrui Dai, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

To accelerate DNNs inference, low-rank approximation has been widely adopted because of its solid theoretical rationale and efficient implementations.

Spatial-Temporal Transformer Networks for Traffic Flow Forecasting

1 code implementation9 Jan 2020 Mingxing Xu, Wenrui Dai, Chunmiao Liu, Xing Gao, Weiyao Lin, Guo-Jun Qi, Hongkai Xiong

In this paper, we propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) that leverages dynamical directed spatial dependencies and long-range temporal dependencies to improve the accuracy of long-term traffic forecasting.

Traffic Prediction

TRP: Trained Rank Pruning for Efficient Deep Neural Networks

1 code implementation30 Apr 2020 Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

The TRP trained network inherently has a low-rank structure, and is approximated with negligible performance loss, thus eliminating the fine-tuning process after low rank decomposition.

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events

no code implementations9 May 2020 Weiyao Lin, Huabin Liu, Shizhan Liu, Yuxi Li, Rui Qian, Tao Wang, Ning Xu, Hongkai Xiong, Guo-Jun Qi, Nicu Sebe

To this end, we present a new large-scale dataset with comprehensive annotations, named Human-in-Events or HiEve (Human-centric video analysis in complex Events), for the understanding of human motions, poses, and actions in a variety of realistic events, especially in crowd & complex events.

Action Recognition Pose Estimation

Multiple Sound Sources Localization from Coarse to Fine

1 code implementation ECCV 2020 Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin

How to visually localize multiple sound sources in unconstrained videos is a formidable problem, especially when lack of the pairwise sound-object annotations.

PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments

1 code implementation ECCV 2020 Zhiming Chen, Kean Chen, Weiyao Lin, John See, Hui Yu, Yan Ke, Cong Yang

The experimental results show that PIoU loss can dramatically improve the performance of OBB detectors, particularly on objects with high aspect ratios and complex backgrounds.

object-detection Object Detection In Aerial Images +2

AP-Loss for Accurate One-Stage Object Detection

1 code implementation17 Aug 2020 Kean Chen, Weiyao Lin, Jianguo Li, John See, Ji Wang, Junni Zou

This paper alleviates this issue by proposing a novel framework to replace the classification task in one-stage detectors with a ranking task, and adopting the Average-Precision loss (AP-loss) for the ranking problem.

Classification General Classification +3

CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization

no code implementations ECCV 2020 Yuxi Li, Weiyao Lin, John See, Ning Xu, Shugong Xu, Ke Yan, Cong Yang

Most current pipelines for spatio-temporal action localization connect frame-wise or clip-wise detection results to generate action proposals, where only local information is exploited and the efficiency is hindered by dense per-frame localization.

Action Detection Spatio-Temporal Action Localization +1

Finding Action Tubes with a Sparse-to-Dense Framework

no code implementations30 Aug 2020 Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Li-Min Wang, Shugong Xu

The task of spatial-temporal action detection has attracted increasing attention among researchers.

Ranked #3 on Action Detection on UCF Sports (Video-mAP 0.2 metric)

Action Detection

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

1 code implementation NeurIPS 2020 Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou

First, we propose to learn robust object representations by aggregating the candidate sound localization results in the single source scenes.

Object Object Localization

Multi-Level Curriculum for Training a Distortion-Aware Barrel Distortion Rectification Model

no code implementations ICCV 2021 Kang Liao, Chunyu Lin, Lixin Liao, Yao Zhao, Weiyao Lin

In this paper, inspired by the curriculum learning, we analyze the barrel distortion rectification task in a progressive and meaningful manner.

Variational Pedestrian Detection

no code implementations CVPR 2021 Yuang Zhang, Huanyu He, Jianguo Li, Yuxi Li, John See, Weiyao Lin

Pedestrian detection in a crowd is a challenging task due to a high number of mutually-occluding human instances, which brings ambiguity and optimization difficulties to the current IoU-based ground truth assignment procedure in classical object detection methods.

object-detection Object Detection +2

SiamRCR: Reciprocal Classification and Regression for Visual Object Tracking

no code implementations24 May 2021 Jinlong Peng, Zhengkai Jiang, Yueyang Gu, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Weiyao Lin

In addition, we add a localization branch to predict the localization accuracy, so that it can work as the replacement of the regression assistance link during inference.

Classification Object +2

TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition

1 code implementation10 Jul 2021 Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin

The first stage locates the action by learning a temporal affine transform, which warps each video feature to its action duration while dismissing the action-irrelevant feature (e. g. background).

Few-Shot action recognition Few Shot Action Recognition +2

Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and Forecasting

2 code implementations ICLR 2022 Shizhan Liu, Hang Yu, Cong Liao, Jianguo Li, Weiyao Lin, Alex X. Liu, Schahram Dustdar

Accurate prediction of the future given the past based on time series data is of paramount importance, since it opens the door for decision making and risk management ahead of time.

Decision Making Management +2

Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic Perspective

1 code implementation2 Nov 2021 Yuxi Li, Ning Xu, Wenjie Yang, John See, Weiyao Lin

We conduct comprehensive comparison and detailed analysis on challenging benchmarks of DAVIS16, DAVIS17 and Youtube-VOS, demonstrating that the cyclic mechanism is helpful to enhance segmentation quality, improve the robustness of VOS systems, and further provide qualitative comparison and interpretation on how different VOS algorithms work.

Segmentation Semantic Segmentation +2

Class-aware Sounding Objects Localization via Audiovisual Correspondence

1 code implementation22 Dec 2021 Di Hu, Yake Wei, Rui Qian, Weiyao Lin, Ruihua Song, Ji-Rong Wen

To address this problem, we propose a two-stage step-by-step learning framework to localize and recognize sounding objects in complex audiovisual scenarios using only the correspondence between audio and vision.

Object object-detection +3

Speed Up Object Detection on Gigapixel-Level Images With Patch Arrangement

no code implementations CVPR 2022 Jiahao Fan, Huabin Liu, Wenjie Yang, John See, Aixin Zhang, Weiyao Lin

With the appearance of super high-resolution (e. g., gigapixel-level) images, performing efficient object detection on such images becomes an important issue.

Object object-detection +1

Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training

no code implementations11 Jan 2022 Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei

Vision-language pre-training has been an emerging and fast-developing research topic, which transfers multi-modal knowledge from rich-resource pre-training task to limited-resource downstream tasks.

Image Captioning Language Modelling +3

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing

1 code implementation13 Feb 2022 Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou

Specifically, we observe that the previous practice of learning only a single audio representation is insufficient due to the additive nature of audio signals.

End-to-end video instance segmentation via spatial-temporal graph neural networks

1 code implementation ICCV 2021 Tao Wang, Ning Xu, Kean Chen, Weiyao Lin

Specifically, graph nodes representing instance features are used for detection and segmentation while graph edges representing instance relations are used for tracking.

Instance Segmentation Segmentation +2

Controllable Augmentations for Video Representation Learning

no code implementations30 Mar 2022 Rui Qian, Weiyao Lin, John See, Dian Li

The major reason is that the positive pairs, i. e., different clips sampled from the same video, have limited temporal receptive field, and usually share similar background but differ in motions.

Action Recognition Contrastive Learning +3

FRIH: Fine-grained Region-aware Image Harmonization

no code implementations13 May 2022 Jinlong Peng, Zekun Luo, Liang Liu, Boshen Zhang, Tao Wang, Yabiao Wang, Ying Tai, Chengjie Wang, Weiyao Lin

Image harmonization aims to generate a more realistic appearance of foreground and background for a composite image.

Image Harmonization

Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition

1 code implementation20 Jul 2022 Huabin Liu, Weixian Lv, John See, Weiyao Lin

In this paper, we propose a novel video frame sampler for few-shot action recognition to address this issue, where task-specific spatial-temporal frame sampling is achieved via a temporal selector (TS) and a spatial amplifier (SA).

Few-Shot action recognition Few Shot Action Recognition

Low-Rank Winograd Transformation for 3D Convolutional Neural Networks

no code implementations26 Jan 2023 Ziran Qin, Mingbao Lin, Weiyao Lin

This paper focuses on Winograd transformation in 3D convolutional neural networks (CNNs) that are more over-parameterized compared with the 2D version.

Spatio-Temporal Point Process for Multiple Object Tracking

no code implementations5 Feb 2023 Tao Wang, Kean Chen, Weiyao Lin, John See, Zenghui Zhang, Qian Xu, Xia Jia

As such, we propose a novel framework that can effectively predict and mask-out the noisy and confusing detection results before associating the objects into trajectories.

Multiple Object Tracking Object

Few-shot Action Recognition via Intra- and Inter-Video Information Maximization

no code implementations10 May 2023 Huabin Liu, Weiyao Lin, Tieyuan Chen, Yuxi Li, Shuyuan Li, John See

The alignment model performs temporal and spatial action alignment sequentially at the feature level, leading to more precise measurements of inter-video similarity.

Few-Shot action recognition Few Shot Action Recognition +2

BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable Basis

1 code implementation NeurIPS 2023 Zelin Ni, Hang Yu, Shizhan Liu, Jianguo Li, Weiyao Lin

Bases have become an integral part of modern deep learning-based models for time series forecasting due to their ability to act as feature extractors or future references.

Contrastive Learning Self-Supervised Learning +2

Density Matters: Improved Core-set for Active Domain Adaptive Segmentation

no code implementations15 Dec 2023 Shizhan Liu, Zhengkai Jiang, Yuxi Li, Jinlong Peng, Yabiao Wang, Weiyao Lin

Active domain adaptation has emerged as a solution to balance the expensive annotation cost and the performance of trained models in semantic segmentation.

Domain Adaptation Semantic Segmentation

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

1 code implementation21 Mar 2024 Yihang Chen, Qianyi Wu, Jianfei Cai, Mehrtash Harandi, Weiyao Lin

3D Gaussian Splatting (3DGS) has emerged as a promising framework for novel view synthesis, boasting rapid rendering speed with high fidelity.

Attribute Novel View Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.