Search Results for author: Xinyu Li

Found 26 papers, 5 papers with code

A new neighborhood structure for job shop scheduling problems

no code implementations7 Sep 2021 Jin Xie, Xinyu Li, Liang Gao, Lin Gui

According to the above finding, this paper proposes a new N8 neighborhood structure considering the movement of critical operations within a critical block and the movement of critical operations outside the critical block.

Combinatorial Optimization

Object Wake-up: 3-D Object Reconstruction, Animation, and in-situ Rendering from a Single Image

no code implementations5 Aug 2021 Xinxin Zuo, Ji Yang, Sen Wang, Zhenbo Yu, Xinyu Li, Bingbing Ni, Minglun Gong, Li Cheng

The pipeline of our approach starts by reconstructing and refining a 3-D mesh representation of the object of interest from an input image; its control joints are predicted by exploiting the semantic part segmentation information; the obtained object 3-D mesh is then rigged \& animated by non-rigid deformation, and rendered to perform in-situ motions in its original image space.

Object Reconstruction

Long Short-Term Transformer for Online Action Detection

1 code implementation NeurIPS 2021 Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Xia, Zhuowen Tu, Stefano Soatto

We present Long Short-term TRansformer (LSTR), a temporal modeling algorithm for online action detection, which employs a long- and short-term memory mechanism to model prolonged sequence data.

Action Detection

SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation

no code implementations29 May 2021 Zhe Wang, Hao Chen, Xinyu Li, Chunhui Liu, Yuanjun Xiong, Joseph Tighe, Charless Fowlkes

However, it is quite expensive to annotate every frame in a large corpus of videos to construct a comprehensive supervised training dataset.

Action Parsing Action Segmentation +2

VidTr: Video Transformer Without Convolutions

no code implementations ICCV 2021 Yanyi Zhang, Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joseph Tighe

We first introduce the vanilla video transformer and show that transformer module is able to perform spatio-temporal modeling from raw pixels, but with heavy memory usage.

Action Classification Action Recognition

TubeR: Tube-Transformer for Action Detection

no code implementations2 Apr 2021 Jiaojiao Zhao, Xinyu Li, Chunhui Liu, Shuai Bing, Hao Chen, Cees G. M. Snoek, Joseph Tighe

In this paper, we propose TubeR: the first transformer based network for end-to-end action detection, with an encoder and decoder optimized for modeling action tubes with variable lengths and aspect ratios.

Action Detection Video Understanding

Selective Feature Compression for Efficient Activity Recognition Inference

no code implementations ICCV 2021 Chunhui Liu, Xinyu Li, Hao Chen, Davide Modolo, Joseph Tighe

In this work, we focus on improving the inference efficiency of current action recognition backbones on trimmed videos, and illustrate that one action model can also cover then informative region by dropping non-informative features.

Action Recognition

Learning-based Prediction and Uplink Retransmission for Wireless Virtual Reality (VR) Network

no code implementations16 Dec 2020 Xiaonan Liu, Xinyu Li, Yansha Deng

While for the online learning algorithm, based on the VR user's actual viewpoint delivered through uplink transmission, we compare it with the predicted viewpoint and update the parameters of the online learning algorithm to further improve the prediction accuracy.

Virtual Reality

NUTA: Non-uniform Temporal Aggregation for Action Recognition

no code implementations15 Dec 2020 Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Hao Chen, Joseph Tighe

In the world of action recognition research, one primary focus has been on how to construct and train networks to model the spatial-temporal volume of an input video.

Action Recognition

Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations

no code implementations CVPR 2021 Yanyi Zhang, Xinyu Li, Ivan Marsic

Multi-label activity recognition is designed for recognizing multiple activities that are performed simultaneously or sequentially in each video.

Activity Recognition Video Classification

Directional Temporal Modeling for Action Recognition

no code implementations ECCV 2020 Xinyu Li, Bing Shuai, Joseph Tighe

Many current activity recognition models use 3D convolutional neural networks (e. g. I3D, I3D-NL) to generate local spatial-temporal features.

Action Recognition

Vortices and waves in light dark matter

no code implementations2 Apr 2020 Lam Hui, Austin Joyce, Michael J. Landry, Xinyu Li

(4) The density near a vortex scales as $r^2$ while the velocity goes as $1/r$, where $r$ is the distance to vortex.

Cosmology and Nongalactic Astrophysics Astrophysics of Galaxies General Relativity and Quantum Cosmology High Energy Physics - Phenomenology High Energy Physics - Theory

BERTSel: Answer Selection with Pre-trained Models

1 code implementation18 May 2019 Dongfang Li, Yifei Yu, Qingcai Chen, Xinyu Li

we are the first to explore the performance of fine-tuning BERT for answer selection.

Answer Selection Fine-tuning +1

Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks

no code implementations20 Apr 2019 Xinyu Li, Wei zhang, Tong Shen, Tao Mei

Selfie and cartoon are two popular artistic forms that are widely presented in our daily life.

Translation

Differential Evolution with Better and Nearest Option for Function Optimization

no code implementations29 Oct 2018 Haozhen Dong, Liang Gao, Xinyu Li, Haoran Zhong, Bing Zeng

Differential evolution(DE) is a conventional algorithm with fast convergence speed.

Hybrid Attention based Multimodal Network for Spoken Language Classification

no code implementations COLING 2018 Yue Gu, Kangning Yang, Shiyu Fu, Shuhong Chen, Xinyu Li, Ivan Marsic

The proposed hybrid attention architecture helps the system focus on learning informative representations for both modality-specific feature extraction and model fusion.

Classification Emotion Recognition +5

Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment

no code implementations ACL 2018 Yue Gu, Kangning Yang, Shiyu Fu, Shuhong Chen, Xinyu Li, Ivan Marsic

Multimodal affective computing, learning to recognize and interpret human affects and subjective information from multiple data sources, is still challenging because: (i) it is hard to extract informative features to represent human affects from heterogeneous inputs; (ii) current fusion strategies only fuse different modalities at abstract level, ignoring time-dependent interactions between modalities.

Whale swarm algorithm with the mechanism of identifying and escaping from extreme points for multimodal function optimization

no code implementations9 Apr 2018 Bing Zeng, Xinyu Li, Liang Gao, Yuyan Zhang, Haozhen Dong

However, there are two difficulties urgently to be solved for most existing niching metaheuristic algorithms: how to set the optimal values of niching parameters for different optimization problems, and how to jump out of the local optima efficiently.

Progress Estimation and Phase Detection for Sequential Processes

no code implementations28 Feb 2017 Xinyu Li, Yanyi Zhang, Jianyu Zhang, Yueyang Chen, Shuhong Chen, Yue Gu, Moliang Zhou, Richard A. Farneth, Ivan Marsic, Randall S. Burd

For the Olympic swimming dataset, our system achieved an accuracy of 88%, an F1-score of 0. 58, a completeness estimation error of 6. 3% and a remaining-time estimation error of 2. 9 minutes.

Activity Recognition Multimodal Deep Learning

Whale swarm algorithm for function optimization

no code implementations11 Feb 2017 Bing Zeng, Liang Gao, Xinyu Li

Increasing nature-inspired metaheuristic algorithms are applied to solving the real-world optimization problems, as they have some advantages over the classical methods of numerical optimization.

Online People Tracking and Identification with RFID and Kinect

no code implementations10 Feb 2017 Xinyu Li, Yanyi Zhang, Ivan Marsic, Randall S. Burd

We introduce a novel, accurate and practical system for real-time people tracking and identification.

Concurrent Activity Recognition with Multimodal CNN-LSTM Structure

no code implementations6 Feb 2017 Xinyu Li, Yanyi Zhang, Jianyu Zhang, Shuhong Chen, Ivan Marsic, Richard A. Farneth, Randall S. Burd

Our system is the first to address the concurrent activity recognition with multisensory data using a single model, which is scalable, simple to train and easy to deploy.

Concurrent Activity Recognition Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.