Search Results for author: Guo-Jun Qi

Found 76 papers, 27 papers with code

PoseAnimate: Zero-shot high fidelity pose controllable character animation

no code implementations21 Apr 2024 Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu, Yanhao Zhang, Zuxuan Wu, Yu-Gang Jiang, Guo-Jun Qi

Image-to-video(I2V) generation aims to create a video sequence from a single image, which requires high temporal coherence and visual fidelity with the source image. However, existing approaches suffer from character appearance inconsistency and poor preservation of fine details.

BARET : Balanced Attention based Real image Editing driven by Target-text Inversion

no code implementations9 Dec 2023 Yuming Qiao, Fanyi Wang, Jingwen Su, Yanhao Zhang, Yunjie Yu, Siyu Wu, Guo-Jun Qi

Image editing approaches with diffusion models have been rapidly developed, yet their applicability are subject to requirements such as specific editing types (e. g., foreground or background object editing, style transfer), multiple conditions (e. g., mask, sketch, caption), and time consuming fine-tuning of diffusion models.

Image Reconstruction Style Transfer

OmniMotionGPT: Animal Motion Generation with Limited Data

no code implementations30 Nov 2023 Zhangsihao Yang, Mingyuan Zhou, Mengyi Shan, Bingbing Wen, Ziwei Xuan, Mitch Hill, Junjie Bai, Guo-Jun Qi, Yalin Wang

Our paper aims to generate diverse and realistic animal motion sequences from textual descriptions, without a large-scale animal text-motion dataset.

Motion Synthesis

Exploring the Robustness of Human Parsers Towards Common Corruptions

no code implementations2 Sep 2023 Sanyi Zhang, Xiaochun Cao, Rui Wang, Guo-Jun Qi, Jie zhou

The experimental results show that the proposed method demonstrates good universality which can improve the robustness of the human parsing models and even the semantic segmentation models when facing various image common corruptions.

Data Augmentation Human Parsing +1

Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver

no code implementations ICCV 2023 Xianpeng Liu, Ce Zheng, Kelvin Cheng, Nan Xue, Guo-Jun Qi, Tianfu Wu

Motivated by a new and strong observation that this challenge can be remedied by a 3D-space local-grid search scheme in an ideal case, we propose a stage-wise approach, which combines the information flow from 2D-to-3D (3D bounding box proposal generation with a single 2D image) and 3D-to-2D (proposal verification by denoising with 3D-to-2D contexts) in a top-down manner.

Denoising Monocular 3D Object Detection +1

POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery

1 code implementation CVPR 2023 Ce Zheng, Xianpeng Liu, Guo-Jun Qi, Chen Chen

In this paper, we propose a pure transformer architecture named POoling aTtention TransformER (POTTER) for the HMR task from single images.

3D Human Pose Estimation Human Mesh Recovery

DiffMesh: A Motion-aware Diffusion-like Framework for Human Mesh Recovery from Videos

no code implementations23 Mar 2023 Ce Zheng, Xianpeng Liu, Mengyuan Liu, Tianfu Wu, Guo-Jun Qi, Chen Chen

While image-based HMR methods have achieved impressive results, they often struggle to recover humans in dynamic scenarios, leading to temporal inconsistencies and non-smooth 3D motion predictions due to the absence of human motion.

3D Human Pose Estimation Human Mesh Recovery

AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+

1 code implementation14 Mar 2023 Xiao Wang, Ying Wang, Ziwei Xuan, Guo-Jun Qi

A criterion in unsupervised pretraining is the pretext task needs to be sufficiently hard to prevent the transformer encoder from learning trivial low-level features not generalizable well to downstream tasks.

Transfer Learning

Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search

no code implementations ICCV 2023 Benzhi Wang, Yang Yang, Jinlin Wu, Guo-Jun Qi, Zhen Lei

On the other hand, the similarity of cross-scale images is often smaller than that of images with the same scale for a person, which will increase the difficulty of matching.

Person Search

MorphGANFormer: Transformer-based Face Morphing and De-Morphing

no code implementations18 Feb 2023 Na Zhang, Xudong Liu, Xin Li, Guo-Jun Qi

Semantic face image manipulation has received increasing attention in recent years.

Image Manipulation

Efficient Image Super-Resolution with Feature Interaction Weighted Hybrid Network

no code implementations29 Dec 2022 Wenjie Li, Juncheng Li, Guangwei Gao, Weihong Deng, Jian Yang, Guo-Jun Qi, Chia-Wen Lin

Recently, great progress has been made in single-image super-resolution (SISR) based on deep learning technology.

Image Super-Resolution

Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present and Future

no code implementations23 Oct 2022 Guo-Jun Qi, Mubarak Shah

In this paper, we review adversarial pretraining of self-supervised deep networks including both convolutional neural networks and vision transformers.

Contrastive Learning Miscellaneous

Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection

1 code implementation5 Aug 2022 Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada

Image restoration algorithms such as super resolution (SR) are indispensable pre-processing modules for object detection in low quality images.

Image Restoration Object +4

AIParsing: Anchor-free Instance-level Human Parsing

no code implementations14 Jul 2022 Sanyi Zhang, Xiaochun Cao, Guo-Jun Qi, Zhanjie Song, Jie zhou

Most state-of-the-art instance-level human parsing models adopt two-stage anchor-based detectors and, therefore, cannot avoid the heuristic anchor box design and the lack of analysis on a pixel level.

Human Parsing object-detection +1

Cross-receptive Focused Inference Network for Lightweight Image Super-Resolution

1 code implementation6 Jul 2022 Wenjie Li, Juncheng Li, Guangwei Gao, Jiantao Zhou, Jian Yang, Guo-Jun Qi

Recently, Transformer-based methods have shown impressive performance in single image super-resolution (SISR) tasks due to the ability of global feature extraction.

Image Super-Resolution

FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER

1 code implementation CVPR 2023 Ce Zheng, Matias Mendieta, Taojiannan Yang, Guo-Jun Qi, Chen Chen

Recently, vision transformers have shown great success in a set of human reconstruction tasks such as 2D human pose estimation (2D HPE), 3D human pose estimation (3D HPE), and human mesh reconstruction (HMR) tasks.

2D Human Pose Estimation 3D Human Pose Estimation

Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection

2 code implementations ICCV 2021 Ziteng Cui, Guo-Jun Qi, Lin Gu, ShaoDi You, Zenghui Zhang, Tatsuya Harada

To enhance object detection in a dark environment, we propose a novel multitask auto encoding transformation (MAET) model which is able to explore the intrinsic pattern behind illumination translation.

Object object-detection +1

Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic Segmentation

no code implementations22 Jan 2022 Ying Wang, Chiuman Ho, Wenju Xu, Ziwei Xuan, Xudong Liu, Guo-Jun Qi

We propose a Dual-Flattening Transformer (DFlatFormer) to enable high-resolution output by reducing complexity to $\mathcal{O}(hw(H+W))$ that is multiple orders of magnitude smaller than the naive dense transformer.

Semantic Segmentation

RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images

no code implementations7 Jan 2022 Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Peng Gao, Zenghui Zhang, Tatsuya Harada

Image restoration algorithms such as super resolution (SR) are indispensable pre-processing modules for object detection in degraded images.

Image Restoration Object +4

Cross-domain Contrastive Learning for Unsupervised Domain Adaptation

no code implementations10 Jun 2021 Rui Wang, Zuxuan Wu, Zejia Weng, Jingjing Chen, Guo-Jun Qi, Yu-Gang Jiang

Unsupervised domain adaptation (UDA) aims to transfer knowledge learned from a fully-labeled source domain to a different unlabeled target domain.

Clustering Contrastive Learning +3

Self-Supervised Graph Representation Learning via Topology Transformations

1 code implementation25 May 2021 Xiang Gao, Wei Hu, Guo-Jun Qi

We formalize the proposed model from an information-theoretic perspective, by maximizing the mutual information between topology transformations and node representations before and after the transformations.

Graph Classification Graph Representation Learning +3

Contrastive Learning with Stronger Augmentations

1 code implementation15 Apr 2021 Xiao Wang, Guo-Jun Qi

Thus, we propose a general framework called Contrastive Learning with Stronger Augmentations~(CLSA) to complement current contrastive learning approaches.

Contrastive Learning Representation Learning +3

Hierarchical Deep CNN Feature Set-Based Representation Learning for Robust Cross-Resolution Face Recognition

no code implementations25 Mar 2021 Guangwei Gao, Yi Yu, Jian Yang, Guo-Jun Qi, Meng Yang

(i) To learn more robust and discriminative features, we desire to adaptively fuse the contextual features from different layers.

Face Recognition Representation Learning

Self-Supervised Multi-View Learning via Auto-Encoding 3D Transformations

no code implementations1 Mar 2021 Xiang Gao, Wei Hu, Guo-Jun Qi

Then, we self-train a representation to capture the intrinsic 3D object representation by decoding 3D transformation parameters from the fused feature representations of multiple views before and after the transformation.

3D Object Classification 3D Object Recognition +5

A Bayesian Federated Learning Framework with Online Laplace Approximation

no code implementations3 Feb 2021 Liangxi Liu, Xi Jiang, Feng Zheng, Hong Chen, Guo-Jun Qi, Heng Huang, Ling Shao

On the client side, a prior loss that uses the global posterior probabilistic parameters delivered from the server is designed to guide the local training.

Federated Learning

TopoTER: Unsupervised Learning of Topology Transformation Equivariant Representations

no code implementations1 Jan 2021 Xiang Gao, Wei Hu, Guo-Jun Qi

We formalize the TopoTER from an information-theoretic perspective, by maximizing the mutual information between topology transformations and node representations before and after the transformations.

Graph Classification

AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations from Self-Trained Negative Adversaries

2 code implementations CVPR 2021 Qianjiang Hu, Xiao Wang, Wei Hu, Guo-Jun Qi

Contrastive learning relies on constructing a collection of negative examples that are sufficiently hard to discriminate against positive queries when their representations are self-trained.

Contrastive Learning

Knowledge-Enriched Distributional Model Inversion Attacks

2 code implementations ICCV 2021 Si Chen, Mostafa Kahla, Ruoxi Jia, Guo-Jun Qi

We present a novel inversion-specific GAN that can better distill knowledge useful for performing attacks on private models from public data.

K-Shot Contrastive Learning of Visual Features with Multiple Instance Augmentations

no code implementations27 Jul 2020 Haohang Xu, Hongkai Xiong, Guo-Jun Qi

In this paper, we propose the $K$-Shot Contrastive Learning (KSCL) of visual features by applying multiple augmentations to investigate the sample variations within individual instances.

Contrastive Learning

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events

no code implementations9 May 2020 Weiyao Lin, Huabin Liu, Shizhan Liu, Yuxi Li, Rui Qian, Tao Wang, Ning Xu, Hongkai Xiong, Guo-Jun Qi, Nicu Sebe

To this end, we present a new large-scale dataset with comprehensive annotations, named Human-in-Events or HiEve (Human-centric video analysis in complex Events), for the understanding of human motions, poses, and actions in a variety of realistic events, especially in crowd & complex events.

Action Recognition Pose Estimation

Spatial-Temporal Transformer Networks for Traffic Flow Forecasting

1 code implementation9 Jan 2020 Mingxing Xu, Wenrui Dai, Chunmiao Liu, Xing Gao, Weiyao Lin, Guo-Jun Qi, Hongkai Xiong

In this paper, we propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) that leverages dynamical directed spatial dependencies and long-range temporal dependencies to improve the accuracy of long-term traffic forecasting.

Traffic Prediction

FLAT: Few-Shot Learning via Autoencoding Transformation Regularizers

no code implementations29 Dec 2019 Haohang Xu, Hongkai Xiong, Guo-Jun Qi

To this end, we present a novel regularization mechanism by learning the change of feature representations induced by a distribution of transformations without using the labels of data examples.

Data Augmentation Few-Shot Learning +1

GraphTER: Unsupervised Learning of Graph Transformation Equivariant Representations via Auto-Encoding Node-wise Transformations

1 code implementation CVPR 2020 Xiang Gao, Wei Hu, Guo-Jun Qi

Recent advances in Graph Convolutional Neural Networks (GCNNs) have shown their efficiency for non-Euclidean data on graphs, which often require a large amount of labeled data with high cost.

Point Cloud Segmentation

AETv2: AutoEncoding Transformations for Self-Supervised Representation Learning by Minimizing Geodesic Distances in Lie Groups

no code implementations16 Nov 2019 Feng Lin, Haohang Xu, Houqiang Li, Hongkai Xiong, Guo-Jun Qi

For this reason, we should use the geodesic to characterize how an image transform along the manifold of a transformation group, and adopt its length to measure the deviation between transformations.

Representation Learning Self-Supervised Learning

An End-to-End Foreground-Aware Network for Person Re-Identification

no code implementations25 Oct 2019 Yiheng Liu, Wengang Zhou, Jianzhuang Liu, Guo-Jun Qi, Qi Tian, Houqiang Li

By presenting a target attention loss, the pedestrian features extracted from the foreground branch become more insensitive to the backgrounds, which greatly reduces the negative impacts of changing backgrounds on matching an identical across different camera views.

Person Re-Identification

Spatiotemporal Co-attention Recurrent Neural Networks for Human-Skeleton Motion Prediction

no code implementations29 Sep 2019 Xiangbo Shu, Liyan Zhang, Guo-Jun Qi, Wei Liu, Jinhui Tang

To this end, we propose a novel Skeleton-joint Co-attention Recurrent Neural Networks (SC-RNN) to capture the spatial coherence among joints, and the temporal evolution among skeletons simultaneously on a skeleton-joint co-attention feature map in spatiotemporal space.

Human motion prediction motion prediction

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

8 code implementations ICLR 2020 Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong

Differentiable architecture search (DARTS) provided a fast solution in finding effective network architectures, but suffered from large memory and computing overheads in jointly training a super-network and searching for an optimal architecture.

Neural Architecture Search

Learning Generalized Transformation Equivariant Representations via Autoencoding Transformations

no code implementations19 Jun 2019 Guo-Jun Qi, Liheng Zhang, Xiao Wang

Transformation Equivariant Representations (TERs) aim to capture the intrinsic visual structures that equivary to various transformations by expanding the notion of {\em translation} equivariance underlying the success of Convolutional Neural Networks (CNNs).

Translation

Differential Recurrent Neural Network and its Application for Human Activity Recognition

no code implementations9 May 2019 Naifan Zhuang, Guo-Jun Qi, The Duc Kieu, Kien A. Hua

The Long Short-Term Memory (LSTM) recurrent neural network is capable of processing complex sequential information since it utilizes special gating schemes for learning representations from long input sequences.

Human Activity Recognition Time Series +1

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

no code implementations27 Mar 2019 Guo-Jun Qi, Jiebo Luo

Representation learning with small labeled data have emerged in many problems, since the success of deep neural networks often relies on the availability of a huge amount of labeled data that is expensive to collect.

Domain Adaptation Representation Learning +1

AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations

1 code implementation ICCV 2019 Guo-Jun Qi, Liheng Zhang, Chang Wen Chen, Qi Tian

This ensures the resultant TERs of individual images contain the {\em intrinsic} information about their visual structures that would equivary {\em extricably} under various transformations in a generalized {\em nonlinear} case.

Learning to Adaptively Scale Recurrent Neural Networks

no code implementations15 Feb 2019 Hao Hu, Liqiang Wang, Guo-Jun Qi

Recent advancements in recurrent neural network (RNN) research have demonstrated the superiority of utilizing multiscale structures in learning temporal representations of time series.

Time Series Time Series Analysis

AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations rather than Data

1 code implementation CVPR 2019 Liheng Zhang, Guo-Jun Qi, Liqiang Wang, Jiebo Luo

The success of deep neural networks often relies on a large amount of labeled examples, which can be difficult to obtain in many real scenarios.

Representation Learning

Prior-Knowledge and Attention-based Meta-Learning for Few-Shot Learning

no code implementations11 Dec 2018 Yunxiao Qin, WeiGuo Zhang, Chenxu Zhao, Zezheng Wang, Xiangyu Zhu, Guo-Jun Qi, Jingping Shi, Zhen Lei

In this paper, inspired by the human cognition process which utilizes both prior-knowledge and vision attention in learning new knowledge, we present a novel paradigm of meta-learning approach with three developments to introduce attention mechanism and prior-knowledge for meta-learning.

Few-Shot Learning

Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition

no code implementations1 Nov 2018 Xiangbo Shu, Jinhui Tang, Guo-Jun Qi, Wei Liu, Jian Yang

In a Co-LSTM unit, each sub-memory unit stores individual motion information, while this Co-LSTM unit selectively integrates and stores inter-related motion information between multiple interacting persons from multiple sub-memory units via the cell gate and co-memory cell, respectively.

Action Recognition Human Interaction Recognition +1

An Adversarial Approach to Hard Triplet Generation

no code implementations ECCV 2018 Yiru Zhao, Zhongming Jin, Guo-Jun Qi, Hongtao Lu, Xian-Sheng Hua

While deep neural networks have demonstrated competitive results for many visual recognition and image retrieval tasks, the major challenge lies in distinguishing similar images from different categories (i. e., hard negative examples) while clustering images with large variations from the same category (i. e., hard positive examples).

Clustering Image Retrieval +1

Generalized Loss-Sensitive Adversarial Learning with Manifold Margins

no code implementations ECCV 2018 Marzieh Edraki, Guo-Jun Qi

Such a manifold assumption suggests the distance over the manifold should be a better measure to characterize the distinct between real and fake sam- ples.

Large-scale Bisample Learning on ID Versus Spot Face Recognition

no code implementations8 Jun 2018 Xiangyu Zhu, Hao liu, Zhen Lei, Hailin Shi, Fan Yang, Dong Yi, Guo-Jun Qi, Stan Z. Li

In this paper, we propose a deep learning based large-scale bisample learning (LBL) method for IvS face recognition.

Face Recognition General Classification

Interleaved Structured Sparse Convolutional Neural Networks

no code implementations CVPR 2018 Guotian Xie, Jingdong Wang, Ting Zhang, Jian-Huang Lai, Richang Hong, Guo-Jun Qi

In this paper, we study the problem of designing efficient convolutional neural network architectures with the interest in eliminating the redundancy in convolution kernels.

Task-Agnostic Meta-Learning for Few-shot Learning

no code implementations20 May 2018 Muhammad Abdullah Jamal, Guo-Jun Qi, Mubarak Shah

Meta-learning approaches have been proposed to tackle the few-shot learning problem. Typically, a meta-learner is trained on a variety of tasks in the hopes of being generalizable to new tasks.

Classification Few-Shot Learning +1

CapProNet: Deep Feature Learning via Orthogonal Projections onto Capsule Subspaces

no code implementations NeurIPS 2018 Liheng Zhang, Marzieh Edraki, Guo-Jun Qi

In this paper, we formalize the idea behind capsule nets of using a capsule vector rather than a neuron activation to predict the label of samples.

Deep Ordinal Hashing with Spatial Attention

no code implementations7 May 2018 Lu Jin, Xiangbo Shu, Kai Li, Zechao Li, Guo-Jun Qi, Jinhui Tang

However, most existing deep hashing methods directly learn the hash functions by encoding the global semantic information, while ignoring the local spatial information of images.

Deep Hashing Image Retrieval

Sharp Attention Network via Adaptive Sampling for Person Re-identification

no code implementations7 May 2018 Chen Shen, Guo-Jun Qi, Rongxin Jiang, Zhongming Jin, Hongwei Yong, Yaowu Chen, Xian-Sheng Hua

In this paper, we present novel sharp attention networks by adaptively sampling feature maps from convolutional neural networks (CNNs) for person re-identification (re-ID) problem.

Person Re-Identification

IGCV$2$: Interleaved Structured Sparse Convolutional Neural Networks

2 code implementations17 Apr 2018 Guotian Xie, Jingdong Wang, Ting Zhang, Jian-Huang Lai, Richang Hong, Guo-Jun Qi

In this paper, we study the problem of designing efficient convolutional neural network architectures with the interest in eliminating the redundancy in convolution kernels.

Deep Differential Recurrent Neural Networks

no code implementations11 Apr 2018 Naifan Zhuang, The Duc Kieu, Guo-Jun Qi, Kien A. Hua

The proposed model progressively builds up the ability of the LSTM gates to detect salient dynamical patterns in deeper stacked layers modeling higher orders of DoS, and thus the proposed LSTM model is termed deep differential Recurrent Neural Network (d2RNN).

Temporal Sequences

Global versus Localized Generative Adversarial Nets

2 code implementations CVPR 2018 Guo-Jun Qi, Liheng Zhang, Hao Hu, Marzieh Edraki, Jingdong Wang, Xian-Sheng Hua

In this paper, we present a novel localized Generative Adversarial Net (GAN) to learn on the manifold of real data.

General Classification

Interleaved Group Convolutions

no code implementations ICCV 2017 Ting Zhang, Guo-Jun Qi, Bin Xiao, Jingdong Wang

The main point lies in a novel building block, a pair of two successive interleaved group convolutions: primary group convolution and secondary group convolution.

Stock Price Prediction via Discovering Multi-Frequency Trading Patterns

1 code implementation13 Aug 2017 Liheng Zhang, Charu Aggarwal, Guo-Jun Qi

Then the future stock prices are predicted as a nonlinear mapping of the combination of these components in an Inverse Fourier Transform (IFT) fashion.

Stock Price Prediction Time Series Analysis

State-Frequency Memory Recurrent Neural Networks

2 code implementations ICML 2017 Hao Hu, Guo-Jun Qi

Modeling temporal sequences plays a fundamental role in various modern applications and has drawn more and more attentions in the machine learning community.

Temporal Sequences

Interleaved Group Convolutions for Deep Neural Networks

2 code implementations10 Jul 2017 Ting Zhang, Guo-Jun Qi, Bin Xiao, Jingdong Wang

The main point lies in a novel building block, a pair of two successive interleaved group convolutions: primary group convolution and secondary group convolution.

Concurrence-Aware Long Short-Term Sub-Memories for Person-Person Action Recognition

no code implementations3 Jun 2017 Xiangbo Shu, Jinhui Tang, Guo-Jun Qi, Yan Song, Zechao Li, Liyan Zhang

To this end, we propose a novel Concurrence-Aware Long Short-Term Sub-Memories (Co-LSTSM) to model the long-term inter-related dynamics between two interacting people on the bounding boxes covering people.

Action Recognition Temporal Action Localization

Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes

no code implementations22 Mar 2017 Guo-Jun Qi, Wei Liu, Charu Aggarwal, Thomas Huang

One of our goals in this paper is to develop a model for revealing the functional relationships between text and image features as to directly transfer intermodal and intramodal labels to annotate the images.

General Classification Image Classification +3

Loss-Sensitive Generative Adversarial Networks on Lipschitz Densities

1 code implementation23 Jan 2017 Guo-Jun Qi

In this paper, we present the Lipschitz regularization theory and algorithms for a novel Loss-Sensitive Generative Adversarial Network (LS-GAN).

Generative Adversarial Network Image Classification +1

Hierarchically Gated Deep Networks for Semantic Segmentation

no code implementations CVPR 2016 Guo-Jun Qi

While image structures usually have various scales, it is difficult to use a single scale to model the spatial contexts for all individual pixels.

Semantic Segmentation

First-Take-All: Temporal Order-Preserving Hashing for 3D Action Videos

no code implementations6 Jun 2015 Jun Ye, Hao Hu, Kai Li, Guo-Jun Qi, Kien A. Hua

With the prevalence of the commodity depth cameras, the new paradigm of user interfaces based on 3D motion capturing and recognition have dramatically changed the way of interactions between human and computers.

3D Action Recognition

Sparse Composite Quantization

no code implementations CVPR 2015 Ting Zhang, Guo-Jun Qi, Jinhui Tang, Jingdong Wang

The benefit is that the distance evaluation between the query and the dictionary element (a sparse vector) is accelerated using the efficient sparse vector operation, and thus the cost of distance table computation is reduced a lot.

Quantization Retrieval

Differential Recurrent Neural Networks for Action Recognition

no code implementations ICCV 2015 Vivek Veeriah, Naifan Zhuang, Guo-Jun Qi

This change in information gain is quantified by Derivative of States (DoS), and thus the proposed LSTM model is termed as differential Recurrent Neural Network (dRNN).

Action Recognition Temporal Action Localization +2

Rank Subspace Learning for Compact Hash Codes

no code implementations19 Mar 2015 Kai Li, Guo-Jun Qi, Jun Ye, Kien A. Hua

In this work, we propose a novel hash learning framework that encodes feature's rank orders instead of numeric values in a number of optimal low-dimensional ranking subspaces.

Cannot find the paper you are looking for? You can Submit a new open access paper.