Search Results for author: Weiyao Lin

Found 70 papers, 29 papers with code

Towards Good Practices for Action Video Encoding

no code implementations • CVPR 2014 • Jianxin Wu, Yu Zhang, Weiyao Lin

High dimensional representations such as VLAD or FV have shown excellent accuracy in action recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

A new network-based algorithm for human activity recognition in video

no code implementations • 21 Feb 2015 • Weiyao Lin, Yuanzhe Chen, Jianxin Wu, Hanli Wang, Bin Sheng, Hongxiang Li

Based on this network, we further model people in the scene as packages while human activities can be modeled as the process of package transmission in the network.

Activity Detection Activity Recognition In Videos +2

Paper
Add Code

Intra-and-Inter-Constraint-based Video Enhancement based on Piecewise Tone Mapping

no code implementations • 21 Feb 2015 • Yuanzhe Chen, Weiyao Lin, Chongyang Zhang, Zhenzhong Chen, Ning Xu, Jun Xie

In this paper, we propose a new intra-and-inter-constraint-based video enhancement approach aiming to 1) achieve high intra-frame quality of the entire picture where multiple region-of-interests (ROIs) can be adaptively and simultaneously enhanced, and 2) guarantee the inter-frame quality consistencies among video frames.

Tone Mapping Video Enhancement

Paper
Add Code

A Heat-Map-based Algorithm for Recognizing Group Activities in Videos

no code implementations • 21 Feb 2015 • Weiyao Lin, Hang Chu, Jianxin Wu, Bin Sheng, Zhenzhong Chen

In this paper, a new heat-map-based (HMB) algorithm is proposed for group activity recognition.

Group Activity Recognition

Paper
Add Code

Improved Image Deblurring based on Salient-region Segmentation

no code implementations • 28 Feb 2015 • Chongyang Zhang, Weiyao Lin, Wei Li, Bing Zhou, Jun Xie, Jijia Li

Image deblurring techniques play important roles in many image processing applications.

Deblurring Image Deblurring +1

Paper
Add Code

Macroblock Classification Method for Video Applications Involving Motions

no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou

We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.

Change Detection Classification +2

Paper
Add Code

Activity Recognition Using A Combination of Category Components And Local Models for Video Surveillance

no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Radha Poovendran, Zhengyou Zhang

This paper presents a novel approach for automatic recognition of human activities for video surveillance applications.

Activity Recognition

Paper
Add Code

Group Event Detection with a Varying Number of Group Members for Video Surveillance

no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Radha Poovendran, Zhengyou Zhang

This paper presents a novel approach for automatic recognition of group activities for video surveillance applications.

Action Detection Activity Detection +1

Paper
Add Code

Deep Spatial Pyramid: The Devil is Once Again in the Details

no code implementations • 21 Apr 2015 • Bin-Bin Gao, Xiu-Shen Wei, Jianxin Wu, Weiyao Lin

In this paper we show that by carefully making good choices for various detailed but important factors in a visual recognition framework using deep learning features, one can achieve a simple, efficient, yet highly accurate image classification system.

General Classification Image Classification

Paper
Add Code

Person Re-identification with Correspondence Structure Learning

1 code implementation • ICCV 2015 • Yang Shen, Weiyao Lin, Junchi Yan, Mingliang Xu, Jianxin Wu, Jingdong Wang

This paper addresses the problem of handling spatial misalignments due to camera-view changes or human-pose variations in person re-identification.

Patch Matching Person Re-Identification

Paper
Code

Tree-based Visualization and Optimization for Image Collection

no code implementations • 17 Jul 2015 • Xintong Han, Chongyang Zhang, Weiyao Lin, Mingliang Xu, Bin Sheng, Tao Mei

The visualization of an image collection is the process of displaying a collection of images on a screen under some specific layout requirements.

Paper
Add Code

Unsupervised Trajectory Clustering via Adaptive Multi-Kernel-Based Shrinkage

no code implementations • ICCV 2015 • Hongteng Xu, Yang Zhou, Weiyao Lin, Hongyuan Zha

Facing to the challenges of trajectory clustering, e. g., large variations within a cluster and ambiguities across clusters, we first introduce an adaptive multi-kernel-based estimation process to estimate the `shrunk' positions and speeds of trajectories' points.

Anomaly Detection Clustering +1

Paper
Add Code

RIDE: Reversal Invariant Descriptor Enhancement

no code implementations • ICCV 2015 • Lingxi Xie, Jingdong Wang, Weiyao Lin, Bo Zhang, Qi Tian

In many fine-grained object recognition datasets, image orientation (left/right) might vary from sample to sample.

Object Recognition

Paper
Add Code

A diffusion and clustering-based approach for finding coherent motions and understanding crowd scenes

no code implementations • 16 Feb 2016 • Weiyao Lin, Yang Mi, Weiyue Wang, Jianxin Wu, Jingdong Wang, Tao Mei

These semantic regions can be used to recognize pre-defined activities in crowd scenes.

Clustering Optical Flow Estimation +1

Paper
Add Code

Fractal Dimension Invariant Filtering and Its CNN-based Implementation

no code implementations • CVPR 2017 • Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha

By adding a nonlinear post-processing step behind anisotropic filter banks, we demonstrate that the proposed filtering method is capable of preserving the local invariance of the fractal dimension of image.

Texture Classification

Paper
Add Code

Picking Deep Filter Responses for Fine-Grained Image Recognition

no code implementations • CVPR 2016 • Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, Qi Tian

Recognizing fine-grained sub-categories such as birds and dogs is extremely challenging due to the highly localized and subtle differences in some specific parts.

Fine-Grained Image Recognition

Paper
Add Code

A Tube-and-Droplet-based Approach for Representing and Analyzing Motion Trajectories

no code implementations • 10 Sep 2016 • Weiyao Lin, Yang Zhou, Hongteng Xu, Junchi Yan, Mingliang Xu, Jianxin Wu, Zicheng Liu

Our approach first leverages the complete information from given trajectories to construct a thermal transfer field which provides a context-rich way to describe the global motion pattern in a scene.

3D Action Recognition Anomaly Detection +2

Paper
Add Code

Motion Segmentation via Global and Local Sparse Subspace Optimization

no code implementations • 24 Jan 2017 • Michael Ying Yang, Hanno Ackermann, Weiyao Lin, Sitong Feng, Bodo Rosenhahn

In this paper, we propose a new framework for segmenting feature-based moving objects under affine subspace model.

Clustering Motion Segmentation +1

Paper
Add Code

Learning Correspondence Structures for Person Re-identification

no code implementations • 20 Mar 2017 • Weiyao Lin, Yang shen, Junchi Yan, Mingliang Xu, Jianxin Wu, Jingdong Wang, Ke Lu

We first introduce a boosting-based approach to learn a correspondence structure which indicates the patch-wise matching probabilities between images from a target camera pair.

Patch Matching Person Re-Identification

Paper
Add Code

Ensemble of Part Detectors for Simultaneous Classification and Localization

no code implementations • 29 May 2017 • Xiaopeng Zhang, Hongkai Xiong, Weiyao Lin, Qi Tian

Part-based representation has been proven to be effective for a variety of visual applications.

Classification Clustering +4

Paper
Add Code

ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression

no code implementations • ICCV 2017 • Jian-Hao Luo, Jianxin Wu, Weiyao Lin

Similar experiments with ResNet-50 reveal that even for a compact network, ThiNet can also reduce more than half of the parameters and FLOPs, at the cost of roughly 1$\%$ top-5 accuracy drop.

Neural Network Compression

Paper
Add Code

Kill Two Birds With One Stone: Boosting Both Object Detection Accuracy and Speed With adaptive Patch-of-Interest Composition

no code implementations • 12 Aug 2017 • Shihao Zhang, Weiyao Lin, Ping Lu, Weihua Li, Shuo Deng

Object detection is an important yet challenging task in video understanding & analysis, where one major challenge lies in the proper balance between two contradictive factors: detection accuracy and detection speed.

Object object-detection +2

Paper
Add Code

Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion

no code implementations • 20 Nov 2017 • Weiyao Lin, Yang Mi, Jianxin Wu, Ke Lu, Hongkai Xiong

In this paper, we propose a novel deep-based framework for action recognition, which improves the recognition accuracy by: 1) deriving more precise features for representing actions, and 2) reducing the asynchrony between different information streams.

Action Recognition Temporal Action Localization

Paper
Add Code

Unsupervised Deep Domain Adaptation for Pedestrian Detection

no code implementations • 9 Feb 2018 • Lihang Liu, Weiyao Lin, Lisheng Wu, Yong Yu, Michael Ying Yang

This paper addresses the problem of unsupervised domain adaptation on the task of pedestrian detection in crowded scenes.

Pedestrian Detection Unsupervised Domain Adaptation

Paper
Add Code

Deep Neural Network Compression with Single and Multiple Level Quantization

1 code implementation • 6 Mar 2018 • Yuhui Xu, Yongzhuang Wang, Aojun Zhou, Weiyao Lin, Hongkai Xiong

In this paper, we propose two novel network quantization approaches, single-level network quantization (SLQ) for high-bit quantization and multi-level network quantization (MLQ) for extremely low-bit quantization (ternary). We are the first to consider the network quantization from both width and depth level.

Neural Network Compression Quantization

Paper
Code

Enhancing HEVC Compressed Videos with a Partition-masked Convolutional Neural Network

no code implementations • 10 May 2018 • Xiaoyi He, Qiang Hu, Xintong Han, Xiaoyun Zhang, Chongyang Zhang, Weiyao Lin

In this paper, we propose a partition-masked Convolution Neural Network (CNN) to achieve compressed-video enhancement for the state-of-the-art coding standard, High Efficiency Video Coding (HECV).

Multimedia

Paper
Add Code

Enriched Long-term Recurrent Convolutional Network for Facial Micro-Expression Recognition

2 code implementations • 22 May 2018 • Huai-Qian Khor, John See, Raphael C. -W. Phan, Weiyao Lin

Facial micro-expression (ME) recognition has posed a huge challenge to researchers for its subtlety in motion and limited databases.

Data Augmentation Micro Expression Recognition +2

260

Paper
Code

Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages

1 code implementation • 29 Jul 2018 • Yuxi Li, Jiuwei Li, Weiyao Lin, Jianguo Li

Based on the deeply supervised object detection (DSOD) framework, we propose Tiny-DSOD dedicating to resource-restricted usages.

Object object-detection +1

229

Paper
Code

Network Decoupling: From Regular to Depthwise Separable Convolutions

1 code implementation • 16 Aug 2018 • Jianbo Guo, Yuxi Li, Weiyao Lin, Yurong Chen, Jianguo Li

Depthwise separable convolution has shown great efficiency in network design, but requires time-consuming training procedure with full training-set available.

object-detection Object Detection

Paper
Code

Trained Rank Pruning for Efficient Deep Neural Networks

1 code implementation • 6 Dec 2018 • Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

We propose Trained Rank Pruning (TRP), which iterates low rank approximation and training.

Quantization

Paper
Code

DNQ: Dynamic Network Quantization

no code implementations • 6 Dec 2018 • Yuhui Xu, Shuai Zhang, Yingyong Qi, Jiaxian Guo, Weiyao Lin, Hongkai Xiong

Network quantization is an effective method for the deployment of neural networks on memory and energy constrained mobile devices.

Quantization

Paper
Add Code

Towards Accurate One-Stage Object Detection with AP-Loss

1 code implementation • CVPR 2019 • Kean Chen, Jianguo Li, Weiyao Lin, John See, Ji Wang, Ling-Yu Duan, Zhibo Chen, Changwei He, Junni Zou

For this purpose, we develop a novel optimization algorithm, which seamlessly combines the error-driven update scheme in perceptron learning and backpropagation algorithm in deep networks.

Classification General Classification +3

173

Paper
Code

Group Re-Identification with Multi-grained Matching and Integration

no code implementations • 17 May 2019 • Weiyao Lin, Yuxi Li, Hao Xiao, John See, Junni Zou, Hongkai Xiong, Jingdong Wang, Tao Mei

The task of re-identifying groups of people underdifferent camera views is an important yet less-studied problem. Group re-identification (Re-ID) is a very challenging task sinceit is not only adversely affected by common issues in traditionalsingle object Re-ID problems such as viewpoint and human posevariations, but it also suffers from changes in group layout andgroup membership.

Paper
Add Code

ATRW: A Benchmark for Amur Tiger Re-identification in the Wild

1 code implementation • 13 Jun 2019 • Shuyuan Li, Jianguo Li, Hanlin Tang, Rui Qian, Weiyao Lin

This paper tries to fill the gap by introducing a novel large-scale dataset, the Amur Tiger Re-identification in the Wild (ATRW) dataset.

Paper
Code

Dual-stream shallow networks for facial micro-expression recognition

2 code implementations • 2019 IEEE International Conference on Image Processing (ICIP) 2019 • Huai-Qian Khor, John See, Sze-Teng Liong, Raphael C. -W. Phan, Weiyao Lin

Micro-expressions are spontaneous, brief and subtle facial muscle movements that exposes underlying emotions.

Micro Expression Recognition Micro-Expression Recognition

Paper
Code

Trained Rank Pruning for Efficient Deep Neural Networks

1 code implementation • 9 Oct 2019 • Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Wenrui Dai, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

To accelerate DNNs inference, low-rank approximation has been widely adopted because of its solid theoretical rationale and efficient implementations.

Paper
Code

Spatial-Temporal Transformer Networks for Traffic Flow Forecasting

1 code implementation • 9 Jan 2020 • Mingxing Xu, Wenrui Dai, Chunmiao Liu, Xing Gao, Weiyao Lin, Guo-Jun Qi, Hongkai Xiong

In this paper, we propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) that leverages dynamical directed spatial dependencies and long-range temporal dependencies to improve the accuracy of long-term traffic forecasting.

Traffic Prediction

Paper
Code

TRP: Trained Rank Pruning for Efficient Deep Neural Networks

1 code implementation • 30 Apr 2020 • Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

The TRP trained network inherently has a low-rank structure, and is approximated with negligible performance loss, thus eliminating the fine-tuning process after low rank decomposition.

Paper
Code

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events

no code implementations • 9 May 2020 • Weiyao Lin, Huabin Liu, Shizhan Liu, Yuxi Li, Rui Qian, Tao Wang, Ning Xu, Hongkai Xiong, Guo-Jun Qi, Nicu Sebe

To this end, we present a new large-scale dataset with comprehensive annotations, named Human-in-Events or HiEve (Human-centric video analysis in complex Events), for the understanding of human motions, poses, and actions in a variety of realistic events, especially in crowd & complex events.

Action Recognition Pose Estimation

Paper
Add Code

Multiple Sound Sources Localization from Coarse to Fine

1 code implementation • ECCV 2020 • Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin

How to visually localize multiple sound sources in unconstrained videos is a formidable problem, especially when lack of the pairwise sound-object annotations.

Paper
Code

PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments

1 code implementation • ECCV 2020 • Zhiming Chen, Kean Chen, Weiyao Lin, John See, Hui Yu, Yan Ke, Cong Yang

The experimental results show that PIoU loss can dramatically improve the performance of OBB detectors, particularly on objects with high aspect ratios and complex backgrounds.

Ranked #2 on One-stage Anchor-free Oriented Object Detection on HRSC2016

object-detection Object Detection In Aerial Images +2

191

Paper
Code

AP-Loss for Accurate One-Stage Object Detection

1 code implementation • 17 Aug 2020 • Kean Chen, Weiyao Lin, Jianguo Li, John See, Ji Wang, Junni Zou

This paper alleviates this issue by proposing a novel framework to replace the classification task in one-stage detectors with a ranking task, and adopting the Average-Precision loss (AP-loss) for the ranking problem.

Classification General Classification +3

173

Paper
Code

CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization

no code implementations • ECCV 2020 • Yuxi Li, Weiyao Lin, John See, Ning Xu, Shugong Xu, Ke Yan, Cong Yang

Most current pipelines for spatio-temporal action localization connect frame-wise or clip-wise detection results to generate action proposals, where only local information is exploited and the efficiency is hindered by dense per-frame localization.

Action Detection Spatio-Temporal Action Localization +1

Paper
Add Code

Finding Action Tubes with a Sparse-to-Dense Framework

no code implementations • 30 Aug 2020 • Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Li-Min Wang, Shugong Xu

The task of spatial-temporal action detection has attracted increasing attention among researchers.

Ranked #3 on Action Detection on UCF Sports (Video-mAP 0.2 metric)

Action Detection

Paper
Add Code

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

1 code implementation • NeurIPS 2020 • Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou

First, we propose to learn robust object representations by aggregating the candidate sound localization results in the single source scenes.

Object Object Localization

Paper
Code

Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation

1 code implementation • NeurIPS 2020 • Yuxi Li, Ning Xu, Jinlong Peng, John See, Weiyao Lin

In this paper, we address several inadequacies of current video object segmentation pipelines.

Ranked #44 on Semi-Supervised Video Object Segmentation on YouTube-VOS 2018

Object One-shot visual object segmentation +3

113

Paper
Code

Multi-Level Curriculum for Training a Distortion-Aware Barrel Distortion Rectification Model

no code implementations • ICCV 2021 • Kang Liao, Chunyu Lin, Lixin Liao, Yao Zhao, Weiyao Lin

In this paper, inspired by the curriculum learning, we analyze the barrel distortion rectification task in a progressive and meaningful manner.

Paper
Add Code

Variational Pedestrian Detection

no code implementations • CVPR 2021 • Yuang Zhang, Huanyu He, Jianguo Li, Yuxi Li, John See, Weiyao Lin

Pedestrian detection in a crowd is a challenging task due to a high number of mutually-occluding human instances, which brings ambiguity and optimization difficulties to the current IoU-based ground truth assignment procedure in classical object detection methods.

object-detection Object Detection +2

Paper
Add Code

SiamRCR: Reciprocal Classification and Regression for Visual Object Tracking

no code implementations • 24 May 2021 • Jinlong Peng, Zhengkai Jiang, Yueyang Gu, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Weiyao Lin

In addition, we add a localization branch to predict the localization accuracy, so that it can work as the replacement of the regression assistance link during inference.

Classification Object +2

Paper
Add Code

TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition

1 code implementation • 10 Jul 2021 • Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin

The first stage locates the action by learning a temporal affine transform, which warps each video feature to its action duration while dismissing the action-irrelevant feature (e. g. background).

Few-Shot action recognition Few Shot Action Recognition +2

Paper
Code

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

1 code implementation • ICCV 2021 • Rui Qian, Yuxi Li, Huabin Liu, John See, Shuangrui Ding, Xian Liu, Dian Li, Weiyao Lin

The crux of self-supervised video representation learning is to build general features from unlabeled videos.

Contrastive Learning Representation Learning +1

Paper
Code

Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and Forecasting

2 code implementations • ICLR 2022 • Shizhan Liu, Hang Yu, Cong Liao, Jianguo Li, Weiyao Lin, Alex X. Liu, Schahram Dustdar

Accurate prediction of the future given the past based on time series data is of paramount importance, since it opens the door for decision making and risk management ahead of time.

Decision Making Management +2

215

Paper
Code

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

1 code implementation • 19 Oct 2021 • Yuxi Li, Boshen Zhang, Jian Li, Yabiao Wang, Weiyao Lin, Chengjie Wang, Jilin Li, Feiyue Huang

We demonstrate that both temporal grains are beneficial to atomic action recognition.

Action Detection Atomic action recognition

Paper
Code

Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic Perspective

1 code implementation • 2 Nov 2021 • Yuxi Li, Ning Xu, Wenjie Yang, John See, Weiyao Lin

We conduct comprehensive comparison and detailed analysis on challenging benchmarks of DAVIS16, DAVIS17 and Youtube-VOS, demonstrating that the cyclic mechanism is helpful to enhance segmentation quality, improve the robustness of VOS systems, and further provide qualitative comparison and interpretation on how different VOS algorithms work.

Segmentation Semantic Segmentation +2

113

Paper
Code

Class-aware Sounding Objects Localization via Audiovisual Correspondence

1 code implementation • 22 Dec 2021 • Di Hu, Yake Wei, Rui Qian, Weiyao Lin, Ruihua Song, Ji-Rong Wen

To address this problem, we propose a two-stage step-by-step learning framework to localize and recognize sounding objects in complex audiovisual scenarios using only the correspondence between audio and vision.

Object object-detection +3

Paper
Code

Speed Up Object Detection on Gigapixel-Level Images With Patch Arrangement

no code implementations • CVPR 2022 • Jiahao Fan, Huabin Liu, Wenjie Yang, John See, Aixin Zhang, Weiyao Lin

With the appearance of super high-resolution (e. g., gigapixel-level) images, performing efficient object detection on such images becomes an important issue.

Object object-detection +1

Paper
Add Code

Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training

no code implementations • 11 Jan 2022 • Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei

Vision-language pre-training has been an emerging and fast-developing research topic, which transfers multi-modal knowledge from rich-resource pre-training task to limited-resource downstream tasks.

Image Captioning Language Modelling +3

Paper
Add Code

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing

1 code implementation • 13 Feb 2022 • Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou

Specifically, we observe that the previous practice of learning only a single audio representation is insufficient due to the additive nature of audio signals.

Paper
Code

End-to-end video instance segmentation via spatial-temporal graph neural networks

1 code implementation • ICCV 2021 • Tao Wang, Ning Xu, Kean Chen, Weiyao Lin

Specifically, graph nodes representing instance features are used for detection and segmentation while graph edges representing instance relations are used for tracking.

Instance Segmentation Segmentation +2

Paper
Code

Controllable Augmentations for Video Representation Learning

no code implementations • 30 Mar 2022 • Rui Qian, Weiyao Lin, John See, Dian Li

The major reason is that the positive pairs, i. e., different clips sampled from the same video, have limited temporal receptive field, and usually share similar background but differ in motions.

Action Recognition Contrastive Learning +3

Paper
Add Code

FRIH: Fine-grained Region-aware Image Harmonization

no code implementations • 13 May 2022 • Jinlong Peng, Zekun Luo, Liang Liu, Boshen Zhang, Tao Wang, Yabiao Wang, Ying Tai, Chengjie Wang, Weiyao Lin

Image harmonization aims to generate a more realistic appearance of foreground and background for a composite image.

Image Harmonization

Paper
Add Code

Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition

1 code implementation • 20 Jul 2022 • Huabin Liu, Weixian Lv, John See, Weiyao Lin

In this paper, we propose a novel video frame sampler for few-shot action recognition to address this issue, where task-specific spatial-temporal frame sampling is achieved via a temporal selector (TS) and a spatial amplifier (SA).

Few-Shot action recognition Few Shot Action Recognition

Paper
Code

The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group Dance Challenge

3 code implementations • 27 Oct 2022 • Yuang Zhang, Tiancai Wang, Weiyao Lin, Xiangyu Zhang

We present our 1st place solution to the Group Dance Multiple People Tracking Challenge.

Multi-Object Tracking Multiple Object Tracking with Transformer +1

326

Paper
Code

Low-Rank Winograd Transformation for 3D Convolutional Neural Networks

no code implementations • 26 Jan 2023 • Ziran Qin, Mingbao Lin, Weiyao Lin

This paper focuses on Winograd transformation in 3D convolutional neural networks (CNNs) that are more over-parameterized compared with the 2D version.

Paper
Add Code

Spatio-Temporal Point Process for Multiple Object Tracking

no code implementations • 5 Feb 2023 • Tao Wang, Kean Chen, Weiyao Lin, John See, Zenghui Zhang, Qian Xu, Xia Jia

As such, we propose a novel framework that can effectively predict and mask-out the noisy and confusing detection results before associating the objects into trajectories.

Multiple Object Tracking Object

Paper
Add Code

Few-shot Action Recognition via Intra- and Inter-Video Information Maximization

no code implementations • 10 May 2023 • Huabin Liu, Weiyao Lin, Tieyuan Chen, Yuxi Li, Shuyuan Li, John See

The alignment model performs temporal and spatial action alignment sequentially at the feature level, leading to more precise measurements of inter-video similarity.

Few-Shot action recognition Few Shot Action Recognition +2

Paper
Add Code

BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable Basis

1 code implementation • NeurIPS 2023 • Zelin Ni, Hang Yu, Shizhan Liu, Jianguo Li, Weiyao Lin

Bases have become an integral part of modern deep learning-based models for time series forecasting due to their ability to act as feature extractors or future references.

Contrastive Learning Self-Supervised Learning +2

Paper
Code

Density Matters: Improved Core-set for Active Domain Adaptive Segmentation

no code implementations • 15 Dec 2023 • Shizhan Liu, Zhengkai Jiang, Yuxi Li, Jinlong Peng, Yabiao Wang, Weiyao Lin

Active domain adaptation has emerged as a solution to balance the expensive annotation cost and the performance of trained models in semantic segmentation.

Domain Adaptation Semantic Segmentation

Paper
Add Code

Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis

no code implementations • 18 Dec 2023 • Tianyao He, Huabin Liu, Yuxi Li, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin

Our framework comprises two core modules: collaborative step mining and frame-to-step alignment.

Action Quality Assessment Procedure Learning

Paper
Add Code

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

1 code implementation • 21 Mar 2024 • Yihang Chen, Qianyi Wu, Jianfei Cai, Mehrtash Harandi, Weiyao Lin

3D Gaussian Splatting (3DGS) has emerged as a promising framework for novel view synthesis, boasting rapid rendering speed with high fidelity.

Attribute Novel View Synthesis +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.