Search Results for author: Houqiang Li

Found 86 papers, 36 papers with code

SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition

no code implementations ICCV 2021 Hezhen Hu, Weichao Zhao, Wengang Zhou, Yuechen Wang, Houqiang Li

To validate the effectiveness of our method on SLR, we perform extensive experiments on four public benchmark datasets, i. e., NMFs-CSL, SLR500, MSASL and WLASL.

Self-Supervised Learning Sign Language Recognition

Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent

no code implementations11 Oct 2021 Weiming Liu, Huacong Jiang, Bin Li, Houqiang Li

With these equivalences, two new algorithms, which can be considered as the extensions of vanilla CFR and CFR+, are deduced from the perspective of FTRL and OMD.

One-shot Key Information Extraction from Document with Deep Partial Graph Matching

no code implementations26 Sep 2021 Minghong Yao, Zhiguang Liu, Liangwei Wang, Houqiang Li, Liansheng Zhuang

However, collecting and labeling a large dataset is time-consuming and is not a user-friendly requirement for many cloud platforms.

Graph Matching Key Information Extraction

Learning Fine-Grained Motion Embedding for Landscape Animation

no code implementations6 Sep 2021 Hongwei Xue, Bei Liu, Huan Yang, Jianlong Fu, Houqiang Li, Jiebo Luo

To tackle this problem, we propose a model named FGLA to generate high-quality and realistic videos by learning Fine-Grained motion embedding for Landscape Animation.

Discovering Representation Sprachbund For Multilingual Pre-Training

no code implementations1 Sep 2021 Yimin Fan, Yaobo Liang, Alexandre Muzio, Hany Hassan, Houqiang Li, Ming Zhou, Nan Duan

Then we cluster all the target languages into multiple groups and name each group as a representation sprachbund.

Heredity-aware Child Face Image Generation with Latent Space Disentanglement

no code implementations25 Aug 2021 Xiao Cui, Wengang Zhou, Yang Hu, Weilun Wang, Houqiang Li

The main idea is to disentangle the latent space of a pre-trained generation model and precisely control the face attributes of child images with clear semantics.

Image Generation

Conditional DETR for Fast Training Convergence

1 code implementation ICCV 2021 Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang

Our approach, named conditional DETR, learns a conditional spatial query from the decoder embedding for decoder multi-head cross-attention.

Object Classification Object Detection

Joint Inductive and Transductive Learning for Video Object Segmentation

1 code implementation ICCV 2021 Yunyao Mao, Ning Wang, Wengang Zhou, Houqiang Li

In this work, we propose to integrate transductive and inductive learning into a unified framework to exploit the complementarity between them for accurate and robust video object segmentation.

Semantic Segmentation Semi-Supervised Video Object Segmentation +1

From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection

1 code implementation30 Jul 2021 Jiajun Deng, Wengang Zhou, Yanyong Zhang, Houqiang Li

To this end, in this work, we regard point clouds as hollow-3D data and propose a new architecture, namely Hallucinated Hollow-3D R-CNN ($\text{H}^2$3D R-CNN), to address the problem of 3D object detection.

3D Object Detection Scene Understanding

Supervised Off-Policy Ranking

1 code implementation3 Jul 2021 Yue Jin, Yue Zhang, Tao Qin, Xudong Zhang, Jian Yuan, Houqiang Li, Tie-Yan Liu

Off-policy evaluation (OPE) leverages data generated by other policies to evaluate a target policy.

Revisiting Knowledge Distillation: An Inheritance and Exploration Framework

1 code implementation CVPR 2021 Zhen Huang, Xu Shen, Jun Xing, Tongliang Liu, Xinmei Tian, Houqiang Li, Bing Deng, Jianqiang Huang, Xian-Sheng Hua

The inheritance part is learned with a similarity loss to transfer the existing learned knowledge from the teacher model to the student model, while the exploration part is encouraged to learn representations different from the inherited ones with a dis-similarity loss.

Knowledge Distillation

Weakly Supervised Temporal Adjacent Network for Language Grounding

1 code implementation30 Jun 2021 Yuechen Wang, Jiajun Deng, Wengang Zhou, Houqiang Li

To this end, we introduce a novel weakly supervised temporal adjacent network (WSTAN) for temporal language grounding.

Multiple Instance Learning

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training

no code implementations25 Jun 2021 Hongwei Xue, Yupan Huang, Bei Liu, Houwen Peng, Jianlong Fu, Houqiang Li, Jiebo Luo

To tackle this, we propose a fully Transformer visual embedding for VLP to better learn visual relation and further promote inter-modal alignment.

Question Answering Visual Entailment +2

Representing Videos As Discriminative Sub-Graphs for Action Recognition

no code implementations CVPR 2021 Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

For each action category, we execute online clustering to decompose the graph into sub-graphs on each scale through learning Gaussian Mixture Layer and select the discriminative sub-graphs as action prototypes for recognition.

Action Recognition Graph Learning +1

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation

no code implementations CVPR 2021 Xinyue Huo, Lingxi Xie, Jianzhong He, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian

Semi-supervised learning is a useful tool for image segmentation, mainly due to its ability in extracting knowledge from unlabeled data to assist learning from labeled data.

Continual Learning Semantic Segmentation

Dual-view Molecule Pre-training

no code implementations17 Jun 2021 Jinhua Zhu, Yingce Xia, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

After pre-training, we can use either the Transformer branch (this one is recommended according to empirical results), the GNN branch, or both for downstream tasks.

Molecular Property Prediction Single-step retrosynthesis

Exploring the Diversity and Invariance in Yourself for Visual Pre-Training Task

no code implementations1 Jun 2021 Longhui Wei, Lingxi Xie, Wengang Zhou, Houqiang Li, Qi Tian

By simply pulling the different augmented views of each image together or other novel mechanisms, they can learn much unsupervised knowledge and significantly improve the transfer performance of pre-training models.

Self-Supervised Learning

Improving Sign Language Translation with Monolingual Data by Sign Back-Translation

no code implementations CVPR 2021 Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li

Finally, the synthetic parallel data serves as a strong supplement for the end-to-end training of the encoder-decoder SLT framework.

Sign Language Translation Translation

TransVG: End-to-End Visual Grounding with Transformers

2 code implementations ICCV 2021 Jiajun Deng, Zhengyuan Yang, Tianlang Chen, Wengang Zhou, Houqiang Li

In this paper, we present a neat yet effective transformer-based framework for visual grounding, namely TransVG, to address the task of grounding a language query to the corresponding region onto an image.

Referring Expression Comprehension Visual Grounding

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

no code implementations5 Apr 2021 Chaoqun Wang, Xuejin Chen, Shaobo Min, Xiaoyan Sun, Houqiang Li

First, DCEN leverages task labels to cluster representations of the same semantic category by cross-modal contrastive learning and exploring semantic-visual complementarity.

Contrastive Learning Generalized Zero-Shot Learning

Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE

2 code implementations CVPR 2021 Jialun Peng, Dong Liu, Songcen Xu, Houqiang Li

We propose a two-stage model for diverse inpainting, where the first stage generates multiple coarse results each of which has a different structure, and the second stage refines each coarse result separately by augmenting texture.

Image Inpainting Quantization +1

IOT: Instance-wise Layer Reordering for Transformer Structures

1 code implementation ICLR 2021 Jinhua Zhu, Lijun Wu, Yingce Xia, Shufang Xie, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

Based on this observation, in this work, we break the assumption of the fixed layer order in the Transformer and introduce instance-wise layer reordering into the model structure.

Abstractive Text Summarization Code Generation +2

3D Local Convolutional Neural Networks for Gait Recognition

1 code implementation ICCV 2021 Zhen Huang, Dixiu Xue, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, Xian-Sheng Hua

Second, different body parts possess different scales, and even the same part in different frames can appear at different locations and scales.

Gait Recognition

Consistent Instance Classification for Unsupervised Representation Learning

no code implementations1 Jan 2021 Depu Meng, Zigang Geng, Zhirong Wu, Bin Xiao, Houqiang Li, Jingdong Wang

The proposed consistent instance classification (ConIC) approach simultaneously optimizes the classification loss and an additional consistency loss explicitly penalizing the feature dissimilarity between the augmented views from the same instance.

Classification General Classification +1

Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval

1 code implementation ICCV 2021 Hui Wu, Min Wang, Wengang Zhou, Houqiang Li

To this end, we propose a novel deep local feature learning architecture to simultaneously focus on multiple discriminative local patterns in an image.

Image Retrieval Metric Learning

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

4 code implementations31 Dec 2020 Jiajun Deng, Shaoshuai Shi, Peiwei Li, Wengang Zhou, Yanyong Zhang, Houqiang Li

In this paper, we take a slightly different viewpoint -- we find that precise positioning of raw points is not essential for high performance 3D object detection and that the coarse voxel granularity can also offer sufficient detection accuracy.

3D Object Detection Region Proposal

Contrastive Transformation for Self-supervised Correspondence Learning

1 code implementation9 Dec 2020 Ning Wang, Wengang Zhou, Houqiang Li

It is worth mentioning that our method also surpasses the fully-supervised affinity representation (e. g., ResNet) and performs competitively against the recent fully-supervised algorithms designed for the specific tasks (e. g., VOT and VOS).

Self-Supervised Learning Semantic Segmentation +3

Unsupervised Pre-training for Person Re-identification

1 code implementation CVPR 2021 Dengpan Fu, Dongdong Chen, Jianmin Bao, Hao Yang, Lu Yuan, Lei Zhang, Houqiang Li, Dong Chen

In this paper, we present a large scale unlabeled person re-identification (Re-ID) dataset "LUPerson" and make the first attempt of performing unsupervised pre-training for improving the generalization ability of the learned person Re-ID feature representation.

Ranked #2 on Person Re-Identification on MSMT17 (using extra training data)

Data Augmentation Person Re-Identification +1

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

no code implementations NeurIPS 2020 Qi Zhou, Yufei Kuang, Zherui Qiu, Houqiang Li, Jie Wang

However, in continuous action spaces, integrating entropy regularization with expressive policies is challenging and usually requires complex inference procedures.

Continuous Control

Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations

no code implementations19 Nov 2020 Xinyue Huo, Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Hao Li, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian

Contrastive learning has achieved great success in self-supervised visual representation learning, but existing approaches mostly ignored spatial information which is often crucial for visual representation.

Contrastive Learning Data Augmentation +1

ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine

no code implementations21 Oct 2020 Weizhen Qi, Yeyun Gong, Yu Yan, Jian Jiao, Bo Shao, Ruofei Zhang, Houqiang Li, Nan Duan, Ming Zhou

We build a dataset from a real-word sponsored search engine and carry out experiments to analyze different generative retrieval models.

Masked Contrastive Representation Learning for Reinforcement Learning

1 code implementation15 Oct 2020 Jinhua Zhu, Yingce Xia, Lijun Wu, Jiajun Deng, Wengang Zhou, Tao Qin, Houqiang Li

During inference, the CNN encoder and the policy network are used to take actions, and the Transformer module is discarded.

Atari Games Contrastive Learning +1

Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

no code implementations11 Oct 2020 Junfu Pu, Wengang Zhou, Hezhen Hu, Houqiang Li

Continuous sign language recognition (SLR) deals with unaligned video-text pair and uses the word error rate (WER), i. e., edit distance, as the main evaluation metric.

Sign Language Recognition

Improving Person Re-identification with Iterative Impression Aggregation

no code implementations21 Sep 2020 Dengpan Fu, Bo Xin, Jingdong Wang, Dong-Dong Chen, Jianmin Bao, Gang Hua, Houqiang Li

Not only does such a simple method improve the performance of the baseline models, it also achieves comparable performance with latest advanced re-ranking methods.

Person Re-Identification Re-Ranking

Global-local Enhancement Network for NMFs-aware Sign Language Recognition

no code implementations24 Aug 2020 Hezhen Hu, Wengang Zhou, Junfu Pu, Houqiang Li

Sign language recognition (SLR) is a challenging problem, involving complex manual features, i. e., hand gestures, and fine-grained non-manual features (NMFs), i. e., facial expression, mouth shapes, etc.

Sign Language Recognition

Single Shot Video Object Detector

1 code implementation7 Jul 2020 Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, Tao Mei

Single shot detectors that are potentially faster and simpler than two-stage detectors tend to be more applicable to object detection in videos.

Object Detection

Efficient Integer-Arithmetic-Only Convolutional Neural Networks

1 code implementation21 Jun 2020 Hengrui Zhao, Dong Liu, Houqiang Li

Considering the tradeoff between activation quantization error and network learning ability, we set an empirical rule to tune the bound of each Bounded ReLU.

Image Super-Resolution Quantization

Cascaded Regression Tracking: Towards Online Hard Distractor Discrimination

no code implementations18 Jun 2020 Ning Wang, Wengang Zhou, Qi Tian, Houqiang Li

In the second stage, a discrete sampling based ridge regression is designed to double-check the remaining ambiguous hard samples, which serves as an alternative of fully-connected layers and benefits from the closed-form solver for efficient learning.

Visual Tracking

M-LVC: Multiple Frames Prediction for Learned Video Compression

1 code implementation CVPR 2020 Jianping Lin, Dong Liu, Houqiang Li, Feng Wu

To compensate for the compression error of the auto-encoders, we further design a MV refinement network and a residual refinement network, taking use of the multiple reference frames as well.

MS-SSIM SSIM +1

Long Short-Term Relation Networks for Video Action Detection

no code implementations31 Mar 2020 Dong Li, Ting Yao, Zhaofan Qiu, Houqiang Li, Tao Mei

It has been well recognized that modeling human-object or object-object relations would be helpful for detection task.

Action Detection Region Proposal

Incorporating BERT into Neural Machine Translation

2 code implementations ICLR 2020 Jinhua Zhu, Yingce Xia, Lijun Wu, Di He, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

While BERT is more commonly used as fine-tuning instead of contextual embedding for downstream language understanding tasks, in NMT, our preliminary exploration of using BERT as contextual embedding is better than using for fine-tuning.

Document-level Natural Language Understanding +4

Soft Hindsight Experience Replay

2 code implementations6 Feb 2020 Qiwei He, Liansheng Zhuang, Houqiang Li

However, due to the brittleness of deterministic methods, HER and its variants typically suffer from a major challenge for stability and convergence, which significantly affects the final performance.

Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization

1 code implementation28 Nov 2019 Qi Zhou, Houqiang Li, Jie Wang

In this paper, We propose a Policy Optimization method with Model-Based Uncertainty (POMBU)---a novel model-based approach---that can effectively improve the asymptotic performance using the uncertainty in Q-values.

Model-based Reinforcement Learning

A Generalization Theory based on Independent and Task-Identically Distributed Assumption

no code implementations28 Nov 2019 Guanhua Zheng, Jitao Sang, Houqiang Li, Jian Yu, Changsheng Xu

The derived generalization bound based on the ITID assumption identifies the significance of hypothesis invariance in guaranteeing generalization performance.

Image Classification

Quantization Networks

1 code implementation CVPR 2019 Jiwei Yang, Xu Shen, Jun Xing, Xinmei Tian, Houqiang Li, Bing Deng, Jianqiang Huang, Xian-Sheng Hua

The proposed quantization function can be learned in a lossless and end-to-end manner and works for any weights and activations of neural networks in a simple and uniform way.

Image Classification Object Detection +1

AETv2: AutoEncoding Transformations for Self-Supervised Representation Learning by Minimizing Geodesic Distances in Lie Groups

no code implementations16 Nov 2019 Feng Lin, Haohang Xu, Houqiang Li, Hongkai Xiong, Guo-Jun Qi

For this reason, we should use the geodesic to characterize how an image transform along the manifold of a transformation group, and adopt its length to measure the deviation between transformations.

Representation Learning Self-Supervised Learning

An End-to-End Foreground-Aware Network for Person Re-Identification

no code implementations25 Oct 2019 Yiheng Liu, Wengang Zhou, Jianzhuang Liu, Guo-Jun Qi, Qi Tian, Houqiang Li

By presenting a target attention loss, the pedestrian features extracted from the foreground branch become more insensitive to the backgrounds, which greatly reduces the negative impacts of changing backgrounds on matching an identical across different camera views.

Person Re-Identification

Real-Time Correlation Tracking via Joint Model Compression and Transfer

1 code implementation23 Jul 2019 Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, Houqiang Li

In the distillation process, we propose a fidelity loss to enable the student network to maintain the representation capability of the teacher network.

Image Classification Knowledge Distillation +3

Online Filter Clustering and Pruning for Efficient Convnets

no code implementations28 May 2019 Zhengguang Zhou, Wengang Zhou, Richang Hong, Houqiang Li

Pruning filters is an effective method for accelerating deep neural networks (DNNs), but most existing approaches prune filters on a pre-trained network directly which limits in acceleration.

Progressive Learning of Low-Precision Networks

no code implementations28 May 2019 Zhengguang Zhou, Wengang Zhou, Xutao Lv, Xuan Huang, Xiaoyu Wang, Houqiang Li

Recent years have witnessed the great advance of deep learning in a variety of vision tasks.

Deep Learning-Based Video Coding: A Review and A Case Study

1 code implementation29 Apr 2019 Dong Liu, Yue Li, Jianping Lin, Houqiang Li, Feng Wu

For deep schemes, pixel probability modeling and auto-encoder are the two approaches, that can be viewed as predictive coding scheme and transform coding scheme, respectively.

Multimedia Image and Video Processing

Spatial and Temporal Mutual Promotion for Video-based Person Re-identification

1 code implementation26 Dec 2018 Yiheng Liu, Zhenxun Yuan, Wengang Zhou, Houqiang Li

How to explore the abundant spatial-temporal information in video sequences is the key to solve this problem.

Video-Based Person Re-Identification

Affinity Derivation and Graph Merge for Instance Segmentation

1 code implementation ECCV 2018 Yiding Liu, Siyu Yang, Bin Li, Wengang Zhou, Jizheng Xu, Houqiang Li, Yan Lu

We present an instance segmentation scheme based on pixel affinity information, which is the relationship of two pixels belonging to a same instance.

Instance Segmentation Semantic Segmentation

Multi-Cue Correlation Filters for Robust Visual Tracking

1 code implementation CVPR 2018 Ning Wang, Wengang Zhou, Qi Tian, Richang Hong, Meng Wang, Houqiang Li

By combining different types of features, our approach constructs multiple experts through Discriminative Correlation Filter (DCF) and each of them tracks the target independently.

Visual Tracking

Low-Latency Human Action Recognition with Weighted Multi-Region Convolutional Neural Network

no code implementations8 May 2018 Yunfeng Wang, Wengang Zhou, Qilin Zhang, Xiaotian Zhu, Houqiang Li

Termed "Weighted Multi-Region Convolutional Neural Network" (WMR ConvNet), the proposed system is LSTM-free, and is based on 2D ConvNet that does not require the accumulation of video frames for 3D ConvNet filtering.

Action Recognition Chunking +1

Visual Attribute-augmented Three-dimensional Convolutional Neural Network for Enhanced Human Action Recognition

no code implementations8 May 2018 Yunfeng Wang, Wengang Zhou, Qilin Zhang, Houqiang Li

Visual attributes in individual video frames, such as the presence of characteristic objects and scenes, offer substantial information for action recognition in videos.

Action Recognition Action Recognition In Videos +4

To Create What You Tell: Generating Videos from Captions

no code implementations23 Apr 2018 Yingwei Pan, Zhaofan Qiu, Ting Yao, Houqiang Li, Tao Mei

In this paper, we present a novel Temporal GANs conditioning on Captions, namely TGANs-C, in which the input to the generator network is a concatenation of a latent noise vector and caption embedding, and then is transformed into a frame sequence with 3D spatio-temporal convolutions.

Towards Open-Set Identity Preserving Face Synthesis

no code implementations CVPR 2018 Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, Gang Hua

We then recombine the identity vector and the attribute vector to synthesize a new face of the subject with the extracted attribute.

Face Generation

Video-based Sign Language Recognition without Temporal Segmentation

no code implementations30 Jan 2018 Jie Huang, Wengang Zhou, Qilin Zhang, Houqiang Li, Weiping Li

Worse still, isolated SLR methods typically require strenuous labeling of each word separately in a sentence, severely limiting the amount of attainable training data.

Sign Language Recognition

Feature Selective Networks for Object Detection

no code implementations CVPR 2018 Yao Zhai, Jingjing Fu, Yan Lu, Houqiang Li

The RoI-based sub-region attention map and aspect ratio attention map are selectively pooled from the banks, and then used to refine the original RoI features for RoI classification.

Object Detection Translation

Neural network-based arithmetic coding of intra prediction modes in HEVC

no code implementations18 Sep 2017 Rui Song, Dong Liu, Houqiang Li, Feng Wu

In this paper, we propose an arithmetic coding strategy by training neural networks, and make preliminary studies on coding of the intra prediction modes in HEVC.

Multimedia

Recent Advance in Content-based Image Retrieval: A Literature Survey

no code implementations19 Jun 2017 Wengang Zhou, Houqiang Li, Qi Tian

The explosive increase and ubiquitous accessibility of visual data on the Web have led to the prosperity of research activity in image search or retrieval.

Content-Based Image Retrieval

A Convolutional Neural Network Approach for Half-Pel Interpolation in Video Coding

no code implementations10 Mar 2017 Ning Yan, Dong Liu, Houqiang Li, Feng Wu

To further improve the coding efficiency, sub-pel motion compensation has been utilized, which requires interpolation of fractional samples.

Multimedia

Convolutional Neural Network-Based Block Up-sampling for Intra Frame Coding

no code implementations22 Feb 2017 Yue Li, Dong Liu, Houqiang Li, Li Li, Feng Wu, Hong Zhang, Haitao Yang

A block can be down-sampled before being compressed by normal intra coding, and then up-sampled to its original resolution.

Multimedia

Projection based advanced motion model for cubic mapping for 360-degree video

no code implementations21 Feb 2017 Li Li, Zhu Li, Madhukar Budagavi, Houqiang Li

This paper proposes a novel advanced motion model to handle the irregular motion for the cubic map projection of 360-degree video.

Video Captioning with Transferred Semantic Attributes

no code implementations CVPR 2017 Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

Automatically generating natural language descriptions of videos plays a fundamental challenge for computer vision community.

Video Captioning

Comparative Deep Learning of Hybrid Representations for Image Recommendations

no code implementations CVPR 2016 Chenyi Lei, Dong Liu, Weiping Li, Zheng-Jun Zha, Houqiang Li

In many image-related tasks, learning expressive and discriminative representations of images is essential, and deep learning has been studied for automating the learning of such representations.

Semi-Supervised Domain Adaptation With Subspace Learning for Visual Recognition

no code implementations CVPR 2015 Ting Yao, Yingwei Pan, Chong-Wah Ngo, Houqiang Li, Tao Mei

In many real-world applications, we are often facing the problem of cross domain learning, i. e., to borrow the labeled data or transfer the already learnt knowledge from a source domain to a target domain.

Domain Adaptation Object Recognition

SOM: Semantic Obviousness Metric for Image Quality Assessment

no code implementations CVPR 2015 Peng Zhang, Wengang Zhou, Lei Wu, Houqiang Li

We propose to extract two types of features, one to measure the semantic obviousness of the image and the other to discover local characteristic.

Image Quality Estimation No-Reference Image Quality Assessment

Jointly Modeling Embedding and Translation to Bridge Video and Language

no code implementations CVPR 2016 Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, Yong Rui

Our proposed LSTM-E consists of three components: a 2-D and/or 3-D deep convolutional neural networks for learning powerful video representation, a deep RNN for generating sentences, and a joint embedding model for exploring the relationships between visual content and sentence semantics.

Translation

Separable Kernel for Image Deblurring

no code implementations CVPR 2014 Lu Fang, Haifeng Liu, Feng Wu, Xiaoyan Sun, Houqiang Li

In this paper, we deal with the image deblurring problem in a completely new perspective by proposing separable kernel to represent the inherent properties of the camera and scene system.

Deblurring

Cannot find the paper you are looking for? You can Submit a new open access paper.