Search Results for author: Lingqiao Liu

Found 99 papers, 31 papers with code

Paper
Code

MRScore: Evaluating Radiology Report Generation with LLM-based Reward System

no code implementations • 27 Apr 2024 • Yunyi Liu, Zhanyu Wang, Yingshu Li, Xinyu Liang, Lingqiao Liu, Lei Wang, Luping Zhou

This paper introduces MRScore, an automatic evaluation metric tailored for radiology report generation by leveraging Large Language Models (LLMs).

Model Selection Text Generation

Paper
Add Code

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning

1 code implementation • 15 Mar 2024 • Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang

Large pre-trained VLMs like CLIP have demonstrated superior zero-shot recognition ability, and a number of recent studies leverage this ability to mitigate catastrophic forgetting in CL, but they focus on closed-set CL in a single domain dataset.

Class Incremental Learning Incremental Learning +1

Paper
Code

A Causal Inspired Early-Branching Structure for Domain Generalization

1 code implementation • 13 Mar 2024 • Liang Chen, Yong Zhang, Yibing Song, Zhen Zhang, Lingqiao Liu

By d-separation, we observe that the causal feature can be further characterized by being independent of the domain conditioned on the object, and we propose the following two strategies as complements for the basic framework.

Domain Generalization

Paper
Code

Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework

1 code implementation • 12 Mar 2024 • Vu Minh Hieu Phan, Yutong Xie, Yuankai Qi, Lingqiao Liu, Liyang Liu, BoWen Zhang, Zhibin Liao, Qi Wu, Minh-Son To, Johan W. Verjans

Medical vision language pre-training (VLP) has emerged as a frontier of research, enabling zero-shot pathological recognition by comparing the query image with the textual descriptions for each disease.

Language Modelling Large Language Model

Paper
Code

A Simple-but-effective Baseline for Training-free Class-Agnostic Counting

no code implementations • 3 Mar 2024 • YuHao Lin, HaiMing Xu, Lingqiao Liu, Javen Qinfeng Shi

Class-Agnostic Counting (CAC) seeks to accurately count objects in a given image with only a few reference examples.

Paper
Add Code

I Learn Better If You Speak My Language: Enhancing Large Language Model Fine-Tuning with Style-Aligned Response Adjustments

no code implementations • 17 Feb 2024 • Xuan Ren, Biao Wu, Lingqiao Liu

The potential for overfitting on a limited number of examples can negatively impact the model's ability to generalize and retain its original skills.

Language Modelling Large Language Model

Paper
Add Code

Source-Free Unsupervised Domain Adaptation with Hypothesis Consolidation of Prediction Rationale

1 code implementation • 2 Feb 2024 • Yangyang Shu, Xiaofeng Cao, Qi Chen, BoWen Zhang, Ziqin Zhou, Anton Van Den Hengel, Lingqiao Liu

Source-Free Unsupervised Domain Adaptation (SFUDA) is a challenging task where a model needs to be adapted to a new domain without access to target domain labels or source domain data.

Unsupervised Domain Adaptation

Paper
Code

A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis

no code implementations • 31 Oct 2023 • Yingshu Li, Yunyi Liu, Zhanyu Wang, Xinyu Liang, Lei Wang, Lingqiao Liu, Leyang Cui, Zhaopeng Tu, Longyue Wang, Luping Zhou

This work conducts an evaluation of GPT-4V's multimodal capability for medical image analysis, with a focus on three representative tasks of radiology report generation, medical visual question answering, and medical visual grounding.

Descriptive Medical Visual Question Answering +3

Paper
Add Code

Improving Online Source-free Domain Adaptation for Object Detection by Unsupervised Data Acquisition

no code implementations • 30 Oct 2023 • Xiangyu Shi, Yanyuan Qiao, Qi Wu, Lingqiao Liu, Feras Dayoub

Effective object detection in mobile robots is challenged by deployment in diverse and unfamiliar environments.

Object object-detection +2

Paper
Add Code

R2GenGPT: Radiology Report Generation with Frozen LLMs

1 code implementation • 18 Sep 2023 • Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou

First, it attains state-of-the-art (SOTA) performance by training only the lightweight visual alignment module while freezing all the parameters of LLM.

Paper
Code

Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models

no code implementations • 9 Sep 2023 • Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix

As an effective way to alleviate the burden of data annotation, semi-supervised learning (SSL) provides an attractive solution due to its ability to leverage both labeled and unlabeled data to build a predictive model.

Paper
Add Code

Domain Generalization via Rationale Invariance

1 code implementation • ICCV 2023 • Liang Chen, Yong Zhang, Yibing Song, Anton Van Den Hengel, Lingqiao Liu

Specifically, we propose treating the element-wise contributions to the final results as the rationale for making a decision and representing the rationale for each sample as a matrix.

Decision Making Domain Generalization

Paper
Code

You Can Generate It Again: Data-to-text Generation with Verification and Correction Prompting

no code implementations • 28 Jun 2023 • Xuan Ren, Lingqiao Liu

In this paper, we propose a novel approach that goes beyond traditional one-shot generation methods by introducing a multi-step process consisting of generation, verification, and correction stages.

Data-to-Text Generation

Paper
Add Code

Semantic Role Labeling Guided Out-of-distribution Detection

1 code implementation • 29 May 2023 • Jinan Zou, Maihao Guo, Yu Tian, YuHao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad, Javen Qinfeng Shi

Identifying unexpected domain-shifted instances in natural language processing is crucial in real-world applications.

Out-of-Distribution Detection Semantic Role Labeling +1

Paper
Code

Learning Conditional Attributes for Compositional Zero-Shot Learning

1 code implementation • CVPR 2023 • Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, Peng Wang, Chunhua Shen

Compositional Zero-Shot Learning (CZSL) aims to train models to recognize novel compositional concepts based on learned concepts such as attribute-object combinations.

Ranked #1 on Compositional Zero-Shot Learning on MIT-States

Attribute Compositional Zero-Shot Learning

Paper
Code

Out-of-Distribution Generalization in Text Classification: Past, Present, and Future

no code implementations • 23 May 2023 • Linyi Yang, Yaoxiao Song, Xuan Ren, Chenyang Lyu, Yidong Wang, Lingqiao Liu, Jindong Wang, Jennifer Foster, Yue Zhang

Machine learning (ML) systems in natural language processing (NLP) face significant challenges in generalizing to out-of-distribution (OOD) data, where the test distribution differs from the training data distribution.

Out-of-Distribution Generalization text-classification +1

Paper
Add Code

The CLIP Model is Secretly an Image-to-Prompt Converter

no code implementations • NeurIPS 2023 • Yuxuan Ding, Chunna Tian, Haoxuan Ding, Lingqiao Liu

The Stable Diffusion model is a prominent text-to-image generation model that relies on a text prompt as its input, which is encoded using the Contrastive Language-Image Pre-Training (CLIP).

Image-Variation Text-to-Image Generation

Paper
Add Code

Improved Test-Time Adaptation for Domain Generalization

1 code implementation • CVPR 2023 • Liang Chen, Yong Zhang, Yibing Song, Ying Shan, Lingqiao Liu

Generally, a TTT strategy hinges its performance on two main factors: selecting an appropriate auxiliary TTT task for updating and identifying reliable parameters to update during the test phase.

Domain Generalization Test-time Adaptation

Paper
Code

METransformer: Radiology Report Generation by Transformer with Multiple Learnable Expert Tokens

no code implementations • CVPR 2023 • Zhanyu Wang, Lingqiao Liu, Lei Wang, Luping Zhou

In the encoder, each expert token interacts with both vision tokens and other expert tokens to learn to attend different image regions for image representation.

Decoder

Paper
Add Code

Revisiting Image Reconstruction for Semi-supervised Semantic Segmentation

no code implementations • 17 Mar 2023 • YuHao Lin, HaiMing Xu, Lingqiao Liu, Jinan Zou, Javen Qinfeng Shi

In this paper, we revisit the idea of using image reconstruction as the auxiliary task and incorporate it with a modern semi-supervised semantic segmentation framework.

Image Reconstruction Representation Learning +2

Paper
Add Code

Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems

1 code implementation • CVPR 2023 • Yangyang Shu, Anton Van Den Hengel, Lingqiao Liu

Specifically, we fit the GradCAM with a branch with limited fitting capacity, which allows the branch to capture the common rationales and discard the less common discriminative patterns.

Fine-Grained Visual Recognition Self-Supervised Learning

Paper
Code

Stock Market Prediction via Deep Learning Techniques: A Survey

no code implementations • 24 Dec 2022 • Jinan Zou, Qingying Zhao, Yang Jiao, Haiyao Cao, Yanxi Liu, Qingsen Yan, Ehsan Abbasnejad, Lingqiao Liu, Javen Qinfeng Shi

Existing surveys on stock market prediction often focus on traditional machine learning methods instead of deep learning methods.

Stock Market Prediction

Paper
Add Code

ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation

1 code implementation • CVPR 2023 • Ziqin Zhou, BoWen Zhang, Yinjie Lei, Lingqiao Liu, Yifan Liu

Recently, CLIP has been applied to pixel-level zero-shot learning tasks via a two-stage scheme.

Semantic Segmentation Zero-Shot Learning +1

170

Paper
Code

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion

no code implementations • 5 Dec 2022 • Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Ruiqi Wu, Shizhou Zhang, Peng Wang, Yanning Zhang

The key insight of this design is that the cross-attention mechanism in the transformer could be an ideal solution to align the discriminative texture clues from the original image with the canonical view image, which could compensate for the low-quality texture information of the canonical view image.

Domain Generalization Generalizable Person Re-identification +1

Paper
Add Code

Semi-supervised Semantic Segmentation with Prototype-based Consistency Regularization

1 code implementation • 10 Oct 2022 • Hai-Ming Xu, Lingqiao Liu, Qiuchen Bian, Zhen Yang

Semi-supervised semantic segmentation requires the model to effectively propagate the label information from limited annotated images to unlabeled ones.

Ranked #2 on Semi-Supervised Semantic Segmentation on PASCAL VOC 2012 50%

Semi-Supervised Semantic Segmentation

Paper
Code

Regularizing Neural Network Training via Identity-wise Discriminative Feature Suppression

1 code implementation • 29 Sep 2022 • Avraham Chapman, Lingqiao Liu

It suppresses features that can be utilized to identify individual instances among samples within each class.

Paper
Code

Pretrained Language Encoders are Natural Tagging Frameworks for Aspect Sentiment Triplet Extraction

no code implementations • 20 Aug 2022 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai, Chunxu Shen, Yongqi Tong

Existing works usually formulate the span detection as a 1D token tagging problem, and model the sentiment recognition with a 2D tagging matrix of token pairs.

Aspect Sentiment Triplet Extraction Inductive Bias

Paper
Add Code

Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization

1 code implementation • 18 Aug 2022 • Xizhe Xue, Dongdong Yu, Lingqiao Liu, Yu Liu, Satoshi Tsutsui, Ying Li, Zehuan Yuan, Ping Song, Mike Zheng Shou

Based on the single-stage instance segmentation framework, we propose a regularization model to predict foreground pixels and use its relation to instance segmentation to construct a cross-task consistency loss.

Autonomous Driving Object +3

Paper
Code

Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-Boosting Attention Mechanism

1 code implementation • 1 Aug 2022 • Yangyang Shu, Baosheng Yu, HaiMing Xu, Lingqiao Liu

In low data regimes, a network often struggles to choose the correct regions for recognition and tends to overfit spurious correlated patterns from the training data.

Fine-Grained Visual Recognition

Paper
Code

Don't Stop Learning: Towards Continual Learning for the CLIP Model

no code implementations • 19 Jul 2022 • Yuxuan Ding, Lingqiao Liu, Chunna Tian, Jingyuan Yang, Haoxuan Ding

The Contrastive Language-Image Pre-training (CLIP) Model is a recently proposed large-scale pre-train model which attracts increasing attention in the computer vision community.

Continual Learning Image-text matching +2

Paper
Add Code

Learning Resolution-Adaptive Representations for Cross-Resolution Person Re-Identification

no code implementations • 9 Jul 2022 • Lin Wu, Lingqiao Liu, Yang Wang, Zheng Zhang, Farid Boussaid, Mohammed Bennamoun

It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.

Person Re-Identification Super-Resolution

Paper
Add Code

Dual Decision Improves Open-Set Panoptic Segmentation

no code implementations • 6 Jul 2022 • Hai-Ming Xu, Hao Chen, Lingqiao Liu, Yufei Yin

Then we distinguish the "unknown things" from the background by using the additional object prediction head.

Panoptic Segmentation

Paper
Add Code

Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model

1 code implementation • 14 Jun 2022 • Jinan Zou, Haiyao Cao, Lingqiao Liu, YuHao Lin, Ehsan Abbasnejad, Javen Qinfeng Shi

In addition, we propose a self-supervised learning strategy based on SRLP to enhance the out-of-distribution generalization performance of our system.

Ranked #1 on Stock Price Prediction on Astock

Decision Making News Classification +5

189

Paper
Code

Progressive Class Semantic Matching for Semi-supervised Text Classification

1 code implementation • NAACL 2022 • Hai-Ming Xu, Lingqiao Liu, Ehsan Abbasnejad

Semi-supervised learning is a promising way to reduce the annotation cost for text-classification.

General Classification Language Modelling +1

Paper
Code

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

1 code implementation • CVPR 2022 • Liang Chen, Yong Zhang, Yibing Song, Lingqiao Liu, Jue Wang

Following this principle, we propose to enrich the "diversity" of forgeries by synthesizing augmented forgeries with a pool of forgery configurations and strengthen the "sensitivity" to the forgeries by enforcing the model to predict the forgery configurations.

DeepFake Detection Face Swapping +1

126

Paper
Code

Multi-Domain Joint Training for Person Re-Identification

no code implementations • 6 Jan 2022 • Lu Yang, Lingqiao Liu, Yunlong Wang, Peng Wang, Yanning Zhang

Our discovery is that training with such an adaptive model can better benefit from more training samples.

Person Re-Identification

Paper
Add Code

Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation

no code implementations • 5 Aug 2021 • Duo Peng, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Jun Liu

In this work, we propose two simple yet effective texture randomization mechanisms, Global Texture Randomization (GTR) and Local Texture Randomization (LTR), for Domain Generalization based SRSS.

Ranked #13 on Domain Generalization on GTA-to-Avg(Cityscapes,BDD,Mapillary)

Domain Generalization Segmentation +1

Paper
Add Code

Feature Encoding with AutoEncoders for Weakly-supervised Anomaly Detection

2 code implementations • 22 May 2021 • Yingjie Zhou, Xucheng Song, Yanru Zhang, Fanxing Liu, Ce Zhu, Lingqiao Liu

Weakly-supervised anomaly detection aims at learning an anomaly detector from a limited amount of labeled data and abundant unlabeled data.

Supervised Anomaly Detection Weakly-supervised Anomaly Detection

294

Paper
Code

CAT: Cross-Attention Transformer for One-Shot Object Detection

no code implementations • 30 Apr 2021 • Weidong Lin, Yuyan Deng, Yang Gao, Ning Wang, Jinghao Zhou, Lingqiao Liu, Lei Zhang, Peng Wang

Given a query patch from a novel class, one-shot object detection aims to detect all instances of that class in a target image through the semantic similarity comparison.

Object object-detection +3

Paper
Add Code

Center Prediction Loss for Re-identification

no code implementations • 30 Apr 2021 • Lu Yang, Yunlong Wang, Lingqiao Liu, Peng Wang, Lu Chi, Zehuan Yuan, Changhu Wang, Yanning Zhang

In this paper, we propose a new loss based on center predictivity, that is, a sample must be positioned in a location of the feature space such that from it we can roughly predict the location of the center of same-class samples.

Paper
Add Code

Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification

no code implementations • 9 Mar 2021 • Lu Yang, Hongbang Liu, Jinghao Zhou, Lingqiao Liu, Lei Zhang, Peng Wang, Yanning Zhang

Learning cross-view consistent feature representation is the key for accurate vehicle Re-identification (ReID), since the visual appearance of vehicles changes significantly under different viewpoints.

Vehicle Re-Identification

Paper
Add Code

Where to Look and How to Describe: Fashion Image Retrieval with an Attentional Heterogeneous Bilinear Network

no code implementations • 26 Oct 2020 • Haibo Su, Peng Wang, Lingqiao Liu, Hui Li, Zhen Li, Yanning Zhang

Fashion products typically feature in compositions of a variety of styles at different clothing parts.

Image Retrieval Retrieval

Paper
Add Code

Contextualize Knowledge Bases with Transformer for End-to-end Task-Oriented Dialogue Systems

no code implementations • EMNLP 2021 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai, Chunxu Shen

Incorporating knowledge bases (KB) into end-to-end task-oriented dialogue systems is challenging, since it requires to properly represent the entity of KB, which is associated with its KB context and dialogue context.

Ranked #2 on Task-Oriented Dialogue Systems on KVRET

Response Generation Task-Oriented Dialogue Systems

Paper
Add Code

Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks

no code implementations • ECCV 2020 • Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, Yinjie Lei

Most existing crowd counting systems rely on the availability of the object location annotation which can be expensive to obtain.

Crowd Counting

Paper
Add Code

Towards Using Count-level Weak Supervision for Crowd Counting

no code implementations • 29 Feb 2020 • Yinjie Lei, Yan Liu, Pingping Zhang, Lingqiao Liu

Most existing crowd counting methods require object location-level annotation, i. e., placing a dot at the center of an object.

Crowd Counting

Paper
Add Code

Hyperspectral Classification Based on 3D Asymmetric Inception Network with Data Fusion Transfer Learning

1 code implementation • 11 Feb 2020 • Haokui Zhang, Yu Liu, Bei Fang, Ying Li, Lingqiao Liu, Ian Reid

Hyperspectral image(HSI) classification has been improved with convolutional neural network(CNN) in very recent years.

General Classification Transfer Learning

Paper
Code

Semi-supervised Learning via Conditional Rotation Angle Estimation

no code implementations • 9 Jan 2020 • Hai-Ming Xu, Lingqiao Liu, Dong Gong

Our insight is that the prediction target in SemSL can be modeled as the latent factor in the predictor for the SlfSL target.

Self-Supervised Learning

Paper
Add Code

To Balance or Not to Balance: A Simple-yet-Effective Approach for Learning with Long-Tailed Distributions

no code implementations • 10 Dec 2019 • Jun-Jie Zhang, Lingqiao Liu, Peng Wang, Chunhua Shen

Such imbalanced distribution causes a great challenge for learning a deep neural network, which can be boiled down into a dilemma: on the one hand, we prefer to increase the exposure of tail class samples to avoid the excessive dominance of head classes in the classifier training.

Auxiliary Learning Self-Supervised Learning

Paper
Add Code

Improving Distant Supervised Relation Extraction by Dynamic Neural Network

no code implementations • 15 Nov 2019 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Xi Peng

To account for this style shift, the model should adjust its parameters in accordance with entity types.

Relation Relation Extraction

Paper
Add Code

Creating Auxiliary Representations from Charge Definitions for Criminal Charge Prediction

no code implementations • 12 Nov 2019 • Liangyi Kang, Jie Liu, Lingqiao Liu, Qinfeng Shi, Dan Ye

Thus, we propose to create auxiliary fact representations from charge definitions to augment fact descriptions representation.

Sentence

Paper
Add Code

Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation

no code implementations • 28 Sep 2019 • Yu Liu, Lingqiao Liu, Haokui Zhang, Hamid Rezatofighi, Ian Reid

This paper tackles the problem of video object segmentation.

Meta-Learning Object +4

Paper
Add Code

Structured Binary Neural Networks for Image Recognition

no code implementations • 22 Sep 2019 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Peng Chen, Lingqiao Liu, Ian Reid

Experiments on both classification, semantic segmentation and object detection tasks demonstrate the superior performance of the proposed methods over various quantized networks in the literature.

object-detection Object Detection +2

Paper
Add Code

In defense of OSVOS

no code implementations • 19 Aug 2019 • Yu Liu, Yutong Dai, Anh-Dzung Doan, Lingqiao Liu, Ian Reid

Through adding a common module, video loss, which we formulate with various forms of constraints (including weighted BCE loss, high-dimensional triplet loss, as well as a novel mixed instance-aware video loss), to train the parent network in the step (2), the network is then better prepared for the step (3), i. e. online fine-tuning on the target instance.

Depth Estimation Object +6

Paper
Add Code

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

no code implementations • 10 Aug 2019 • Bohan Zhuang, Jing Liu, Mingkui Tan, Lingqiao Liu, Ian Reid, Chunhua Shen

Furthermore, we propose a second progressive quantization scheme which gradually decreases the bit-width from high-precision to low-precision during training.

Knowledge Distillation Quantization

Paper
Add Code

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

no code implementations • 29 Jul 2019 • Damien Teney, Peng Wang, Jiewei Cao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

One of the primary challenges faced by deep learning is the degree to which current methods exploit superficial statistics and dataset bias, rather than learning to generalise over the specific representations they have experienced.

Visual Reasoning

Paper
Add Code

Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection

5 code implementations • ICCV 2019 • Dong Gong, Lingqiao Liu, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, Anton Van Den Hengel

At the test stage, the learned memory will be fixed, and the reconstruction is obtained from a few selected memory records of the normal data.

Unsupervised Anomaly Detection

447

Paper
Code

Training Quantized Neural Networks with a Full-precision Auxiliary Module

no code implementations • CVPR 2020 • Bohan Zhuang, Lingqiao Liu, Mingkui Tan, Chunhua Shen, Ian Reid

In this paper, we seek to tackle a challenge in training low-precision networks: the notorious difficulty in propagating gradient through a low-precision network due to the non-differentiable quantization function.

Image Classification object-detection +2

Paper
Add Code

A Sketch Based 3D Shape Retrieval Approach Based on Efficient Deep Point-to-Subspace Metric Learning

no code implementations • 1 Mar 2019 • Yinjie Lei, Ziqin Zhou, Pingping Zhang, Yulan Guo, Zijun Ma, Lingqiao Liu

A sketch based 3D shape retrieval

3D Shape Classification 3D Shape Retrieval +2

Paper
Add Code

RPC: A Large-Scale Retail Product Checkout Dataset

no code implementations • 22 Jan 2019 • Xiu-Shen Wei, Quan Cui, Lei Yang, Peng Wang, Lingqiao Liu

The main challenge of this problem comes from the large scale and the fine-grained nature of the product categories as well as the difficulty for collecting training images that reflect the realistic checkout scenarios due to continuous update of the products.

Paper
Add Code

Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes

no code implementations • 12 Jan 2019 • Yu Liu, Lingqiao Liu, Hamid Rezatofighi, Thanh-Toan Do, Qinfeng Shi, Ian Reid

As the post-processing step for object detection, non-maximum suppression (GreedyNMS) is widely used in most of the detectors for many years.

object-detection Object Detection

Paper
Add Code

Mask-aware networks for crowd counting

no code implementations • 18 Dec 2018 • Shengqin Jiang, Xiaobo Lu, Yinjie Lei, Lingqiao Liu

Our rationale is that the mask prediction could be better modeled as a binary segmentation problem and the difficulty of estimating the density could be reduced if the mask is known.

Crowd Counting Object

Paper
Add Code

Coarse-to-fine: A RNN-based hierarchical attention model for vehicle re-identification

no code implementations • 11 Dec 2018 • Xiu-Shen Wei, Chen-Lin Zhang, Lingqiao Liu, Chunhua Shen, Jianxin Wu

Inspired by the coarse-to-fine hierarchical process, we propose an end-to-end RNN-based Hierarchical Attention (RNN-HA) classification model for vehicle re-identification.

Vehicle Re-Identification

Paper
Add Code

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

no code implementations • CVPR 2019 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

In this paper, we propose to train convolutional neural networks (CNNs) with both binarized weights and activations, leading to quantized models specifically} for mobile devices with limited power capacity and computation resources.

General Classification Image Classification +2

Paper
Add Code

Seeing Deeply and Bidirectionally: A Deep Learning Approach for Single Image Reflection Removal

1 code implementation • ECCV 2018 • Jie Yang, Dong Gong, Lingqiao Liu, Qinfeng Shi

Reflections often obstruct the desired scene when taking photos through glass panels.

Reflection Removal

Paper
Code

Towards Effective Deep Embedding for Zero-Shot Learning

no code implementations • 30 Aug 2018 • Lei Zhang, Peng Wang, Lingqiao Liu, Chunhua Shen, Wei Wei, Yannning Zhang, Anton Van Den Hengel

Towards this goal, we present a simple but effective two-branch network to simultaneously map semantic descriptions and visual samples into a joint space, on which visual embeddings are forced to regress to their class-level semantic embeddings and the embeddings crossing classes are required to be distinguishable by a trainable classifier.

Zero-Shot Learning

Paper
Add Code

Adaptive Importance Learning for Improving Lightweight Image Super-resolution Network

no code implementations • 5 Jun 2018 • Lei Zhang, Peng Wang, Chunhua Shen, Lingqiao Liu, Wei Wei, Yanning Zhang, Anton Van Den Hengel

In this study, we revisit this problem from an orthog- onal view, and propose a novel learning strategy to maxi- mize the pixel-wise fitting capacity of a given lightweight network architecture.

Image Super-Resolution

Paper
Add Code

Piecewise classifier mappings: Learning fine-grained learners for novel categories with few examples

1 code implementation • 11 May 2018 • Xiu-Shen Wei, Peng Wang, Lingqiao Liu, Chunhua Shen, Jianxin Wu

To solve this problem, we propose an end-to-end trainable deep network which is inspired by the state-of-the-art fine-grained recognition model and is tailored for the FSFG task.

Few-Shot Learning Fine-Grained Image Recognition

Paper
Code

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

no code implementations • 1 Nov 2017 • Yu Chen, Chunhua Shen, Hao Chen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang

In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity.

Pose Estimation

Paper
Add Code

Towards Effective Low-bitwidth Convolutional Neural Networks

2 code implementations • CVPR 2018 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

This paper tackles the problem of training a deep convolutional neural network with both low-precision weights and low-bitwidth activations.

Quantization

Paper
Code

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

1 code implementation • ICCV 2017 • Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

The proposed method still builds one classifier for one interaction (as per type (ii) above), but the classifier built is adaptive to context via weights which are context dependent.

Relationship Detection Visual Relationship Detection

Paper
Code

Visually Aligned Word Embeddings for Improving Zero-shot Learning

no code implementations • 18 Jul 2017 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

To overcome this visual-semantic discrepancy, this work proposes an objective function to re-align the distributed word embeddings with visual information by learning a neural network to map it into a new representation called visually aligned word embedding (VAWE).

Paper
Add Code

Multi-Attention Network for One Shot Learning

no code implementations • CVPR 2017 • Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton Van Den Hengel, Heng Tao Shen

One-shot learning is a challenging problem where the aim is to recognize a class identified by a single training image.

One-Shot Learning TAG +1

Paper
Add Code

Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation

no code implementations • 25 May 2017 • Tong Shen, Guosheng Lin, Lingqiao Liu, Chunhua Shen, Ian Reid

Training a Fully Convolutional Network (FCN) for semantic segmentation requires a large number of masks with pixel level labelling, which involves a large amount of human labour and time for annotation.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation

2 code implementations • ICCV 2017 • Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang

In contrast, human vision is able to predict poses by exploiting geometric constraints of joint inter-connectivity.

Ranked #15 on Pose Estimation on MPII Human Pose

Pose Estimation

Paper
Code

Towards Context-aware Interaction Recognition

no code implementations • 18 Mar 2017 • Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

Recognizing how objects interact with each other is a crucial task in visual recognition.

Paper
Add Code

Deep Learning Features at Scale for Visual Place Recognition

no code implementations • 18 Jan 2017 • Zetao Chen, Adam Jacobson, Niko Sunderhauf, Ben Upcroft, Lingqiao Liu, Chunhua Shen, Ian Reid, Michael Milford

The success of deep learning techniques in the computer vision domain has triggered a range of initial investigations into their utility for visual place recognition, all using generic features from networks that were trained for other types of recognition tasks.

Visual Place Recognition

Paper
Add Code

From Motion Blur to Motion Flow: a Deep Learning Solution for Removing Heterogeneous Motion Blur

no code implementations • CVPR 2017 • Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton Van Den Hengel, Qinfeng Shi

The critical observation underpinning our approach is thus that learning the motion flow instead allows the model to focus on the cause of the blur, irrespective of the image content.

Paper
Add Code

Attend in groups: a weakly-supervised deep learning framework for learning from web data

no code implementations • CVPR 2017 • Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition.

Paper
Add Code

Sequential Person Recognition in Photo Albums with a Recurrent Network

no code implementations • CVPR 2017 • Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

In this work, we propose to model the relational information between people as a sequence prediction task.

Person Recognition

Paper
Add Code

Graph-Structured Representations for Visual Question Answering

no code implementations • CVPR 2017 • Damien Teney, Lingqiao Liu, Anton Van Den Hengel

This paper proposes to improve visual question answering (VQA) with structured representations of both scene contents and questions.

Ranked #1 on Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) abstract images 1.0 open ended

Multiple-choice Question Answering +1

Paper
Add Code

Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification

no code implementations • 1 Aug 2016 • ZongYuan Ge, Chris McCool, Conrad Sanderson, Peng Wang, Lingqiao Liu, Ian Reid, Peter Corke

Fine-grained classification is a relatively new field that has concentrated on using information from a single image, while ignoring the enormous potential of using video data to improve classification.

Classification General Classification

Paper
Add Code

Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps

no code implementations • 22 Jun 2016 • Jiewei Cao, Lingqiao Liu, Peng Wang, Zi Huang, Chunhua Shen, Heng Tao Shen

Instance retrieval requires one to search for images that contain a particular object within a large corpus.

Retrieval

Paper
Add Code

What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution

no code implementations • CVPR 2016 • Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton Van Den Hengel, Heng Tao Shen

The key observation motivating our approach is that "regular object" images, "unusual object" images and "other objects" images exhibit different region-level scores in terms of both the score values and the spatial distributions.

Gaussian Processes Object +2

Paper
Add Code

Less is more: zero-shot learning from online textual documents with noise suppression

no code implementations • CVPR 2016 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

Classifying a visual concept merely from its associated online textual source, such as a Wikipedia article, is an attractive research topic in zero-shot learning because it alleviates the burden of manually collecting semantic attributes.

Zero-Shot Learning

Paper
Add Code

Hi Detector, What's Wrong with that Object? Identifying Irregular Object From Images by Modelling the Detection Score Distribution

no code implementations • 14 Feb 2016 • Peng Wang, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel, Heng Tao Shen

To address this problem, we propose a novel approach by inspecting the distribution of the detection scores at multiple image regions based on the detector trained from the "regular object" and "other objects".

Gaussian Processes Object

Paper
Add Code

Order-aware Convolutional Pooling for Video Based Action Recognition

no code implementations • 31 Jan 2016 • Peng Wang, Lingqiao Liu, Chunhua Shen, Heng Tao Shen

Most video based action recognition approaches create the video-level representation by temporally pooling the features extracted at each frame.

Action Recognition Temporal Action Localization

Paper
Add Code

Compositional Model based Fisher Vector Coding for Image Classification

1 code implementation • 16 Jan 2016 • Lingqiao Liu, Peng Wang, Chunhua Shen, Lei Wang, Anton Van Den Hengel, Chao Wang, Heng Tao Shen

To handle this limitation, in this paper we break the convention which assumes that a local feature is drawn from one of few Gaussian distributions.

Classification General Classification +1

Paper
Code

Cross-convolutional-layer Pooling for Image Recognition

no code implementations • 4 Oct 2015 • Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

Most of these studies adopt activations from a single DCNN layer, usually the fully-connected layer, as the image representation.

General Classification Image Classification

Paper
Add Code

Learning Discriminative Bayesian Networks from High-dimensional Continuous Neuroimaging Data

no code implementations • 23 Jun 2015 • Luping Zhou, Lei Wang, Lingqiao Liu, Philip Ogunbona, Dinggang Shen

This brings two general discriminative learning frameworks for Gaussian Bayesian networks (GBN).

Vocal Bursts Intensity Prediction

Paper
Add Code

Mining Mid-level Visual Patterns with Deep CNN Activations

1 code implementation • 21 Jun 2015 • Yao Li, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

The purpose of mid-level visual element discovery is to find clusters of image patches that are both representative and discriminative.

Paper
Code

What value do explicit high level concepts have in vision to language problems?

1 code implementation • CVPR 2016 • Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony Dick, Anton Van Den Hengel

Much of the recent progress in Vision-to-Language (V2L) problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).

Image Captioning Question Answering +1

Paper
Code

Learning discriminative trajectorylet detector sets for accurate skeleton-based action recognition

no code implementations • 20 Apr 2015 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton von den Hengel

The introduction of low-cost RGB-D sensors has promoted the research in skeleton-based human action recognition.

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Temporal Pyramid Pooling Based Convolutional Neural Networks for Action Recognition

no code implementations • 4 Mar 2015 • Peng Wang, Yuanzhouhan Cao, Chunhua Shen, Lingqiao Liu, Heng Tao Shen

One challenge is that video contains a varying number of frames which is incompatible to the standard input format of CNNs.

Action Recognition Image Classification +1

Paper
Add Code

The Treasure beneath Convolutional Layers: Cross-convolutional-layer Pooling for Image Classification

1 code implementation • CVPR 2015 • Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

This paper, however, advocates that if used appropriately convolutional layer activations can be turned into a powerful image representation which enjoys many advantages over fully-connected layer activations.

General Classification Image Classification

Paper
Code

Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors

no code implementations • NeurIPS 2014 • Lingqiao Liu, Chunhua Shen, Lei Wang, Anton Van Den Hengel, Chao Wang

By calculating the gradient vector of the proposed model, we derive a new fisher vector encoding strategy, termed Sparse Coding based Fisher Vector Coding (SCFVC).

Fine-Grained Image Classification General Classification +2

Paper
Add Code

Mid-level Deep Pattern Mining

no code implementations • CVPR 2015 • Yao Li, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

We apply our approach to scene and object classification tasks, and demonstrate that our approach outperforms all previous works on mid-level visual element discovery by a sizeable margin with far fewer elements being used.

Paper
Add Code

A Generalized Probabilistic Framework for Compact Codebook Creation

no code implementations • 30 Jan 2014 • Lingqiao Liu, Lei Wang, Chunhua Shen

In the third criterion, which shows the best merging performance, we propose a max-margin-based parameter estimation method and apply it with multinomial distribution.

Paper
Add Code

Discriminative Brain Effective Connectivity Analysis for Alzheimer's Disease: A Kernel Learning Approach upon Sparse Gaussian Bayesian Network

no code implementations • CVPR 2013 • Luping Zhou, Lei Wang, Lingqiao Liu, Philip Ogunbona, Dinggang Shen

Analyzing brain networks from neuroimages is becoming a promising approach in identifying novel connectivitybased biomarkers for the Alzheimer's disease (AD).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.