Search Results for author: Jianping Fan

Found 35 papers, 15 papers with code

Correlative Multi-Label Multi-Instance Image Annotation

no code implementations • IEEE International Conference on Computer Vision 2011 • Xiangyang Xue, Wei zhang, Jie Zhang, Bin Wu, Jianping Fan, Yao Lu

The cross-level label coherence en-codes the consistency between the labels at the image leveland the labels at the region level.

Paper
Add Code

Efficiently Detecting Overlapping Communities through Seeding and Semi-Supervised Learning

no code implementations • 23 Jan 2014 • Changxing Shang, Shengzhong Feng, Zhongying Zhao, Jianping Fan

This paper proposes a new method that transforms a network into a corpus where each edge is treated as a document, and all nodes of the network are treated as terms of the corpus.

Clustering Community Detection

Paper
Add Code

Re-ranking Object Proposals for Object Detection in Automatic Driving

no code implementations • 19 May 2016 • Zhun Zhong, Mingyi Lei, Shaozi Li, Jianping Fan

In this paper, we propose a semantic, class-specific approach to re-rank object proposals, which can consistently improve the recall performance even with less proposals.

Object object-detection +3

Paper
Add Code

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

no code implementations • 24 Jun 2017 • Tianyi Zhao, Jun Yu, Zhenzhong Kuang, Wei zhang, Jianping Fan

In this paper, a deep mixture of diverse experts algorithm is developed for seamlessly combining a set of base deep CNNs (convolutional neural networks) with diverse outputs (task spaces), e. g., such base deep CNNs are trained to recognize different subsets of tens of thousands of atomic object classes.

Multi-Task Learning Object +1

Paper
Add Code

Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition

no code implementations • 8 Jul 2017 • Tianyi Zhao, Baopeng Zhang, Wei zhang, Ning Zhou, Jun Yu, Jianping Fan

Our LMM model can provide an end-to-end approach for jointly learning: (a) the deep networks to extract more discriminative deep features for image and object class representation; (b) the tree classifier for recognizing large numbers of object classes hierarchically; and (c) the visual hierarchy adaptation for achieving more accurate indexing of large numbers of object classes hierarchically.

Object Object Recognition

Paper
Add Code

Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering

6 code implementations • ICCV 2017 • Zhou Yu, Jun Yu, Jianping Fan, DaCheng Tao

For multi-modal feature fusion, here we develop a Multi-modal Factorized Bilinear (MFB) pooling approach to efficiently and effectively combine multi-modal features, which results in superior performance for VQA compared with other bilinear pooling approaches.

Question Answering Visual Question Answering

183

Paper
Code

Beyond Bilinear: Generalized Multimodal Factorized High-order Pooling for Visual Question Answering

2 code implementations • 10 Aug 2017 • Zhou Yu, Jun Yu, Chenchao Xiang, Jianping Fan, DaCheng Tao

For fine-grained image and question representations, a `co-attention' mechanism is developed by using a deep neural network architecture to jointly learn the attentions for both the image and the question, which can allow us to reduce the irrelevant features effectively and obtain more discriminative features for image and question representations.

Question Answering Visual Question Answering +1

183

Paper
Code

Deep Boosting of Diverse Experts

no code implementations • ICLR 2018 • Wei Zhang, Qiuyu Chen, Jun Yu, Jianping Fan

In this paper, a deep boosting algorithm is developed to learn more discriminative ensemble classifier by seamlessly combining a set of base deep CNNs (base experts) with diverse capabilities, e. g., these base deep CNNs are sequentially trained to recognize a set of object classes in an easy-to-hard way according to their learning complexities.

Object Recognition

Paper
Add Code

NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification

1 code implementation • 12 Nov 2018 • Rongcheng Lin, Jing Xiao, Jianping Fan

This paper introduces a fast and efficient network architecture, NeXtVLAD, to aggregate frame-level features into a compact feature vector for large-scale video classification.

Efficient Neural Network General Classification +2

204

Paper
Code

Learning Competitive and Discriminative Reconstructions for Anomaly Detection

no code implementations • 17 Mar 2019 • Kai Tian, Shuigeng Zhou, Jianping Fan, Jihong Guan

Most of the existing methods for anomaly detection use only positive data to learn the data distribution, thus they usually need a pre-defined threshold at the detection stage to determine whether a test instance is an outlier.

Anomaly Detection

Paper
Add Code

Imitating Targets from all sides: An Unsupervised Transfer Learning method for Person Re-identification

no code implementations • 10 Apr 2019 • Jiajie Tian, Zhu Teng, Rui Li, Yan Li, Baopeng Zhang, Jianping Fan

Person re-identification (Re-ID) models usually show a limited performance when they are trained on one dataset and tested on another dataset due to the inter-dataset bias (e. g. completely different identities and backgrounds) and the intra-dataset difference (e. g. camera invariance).

Person Re-Identification Transfer Learning

Paper
Add Code

MOD: A Deep Mixture Model with Online Knowledge Distillation for Large Scale Video Temporal Concept Localization

1 code implementation • 27 Oct 2019 • Rongcheng Lin, Jing Xiao, Jianping Fan

In this paper, we present and discuss a deep mixture model with online knowledge distillation (MOD) for large-scale video temporal concept localization, which is ranked 3rd in the 3rd YouTube-8M Video Understanding Challenge.

Knowledge Distillation Video Understanding

Paper
Code

Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment

no code implementations • CVPR 2020 • Qiuyu Chen, Wei zhang, Ning Zhou, Peng Lei, Yi Xu, Yu Zheng, Jianping Fan

Specifically, the fractional dilated kernel is adaptively constructed according to the image aspect ratios, where the interpolation of nearest two integers dilated kernels is used to cope with the misalignment of fractional sampling.

Paper
Add Code

Boundary-aware Context Neural Network for Medical Image Segmentation

1 code implementation • 3 May 2020 • Ruxin Wang, Shuyuan Chen, Chaojie Ji, Jianping Fan, Ye Li

In this paper, we formulate a boundary-aware context neural network (BA-Net) for 2D medical image segmentation to capture richer context and preserve fine spatial information.

Image Segmentation Medical Image Segmentation +3

Paper
Code

Automatic Image Labelling at Pixel Level

no code implementations • 15 Jul 2020 • Xiang Zhang, Wei zhang, Jinye Peng, Jianping Fan

A Guided Filter Network (GFN) is first developed to learn the segmentation knowledge from a source domain, and such GFN then transfers such segmentation knowledge to generate coarse object masks in the target domain.

Image Segmentation Object +2

Paper
Add Code

Cluster-level Feature Alignment for Person Re-identification

1 code implementation • 15 Aug 2020 • Qiuyu Chen, Wei zhang, Jianping Fan

Instance-level alignment is widely exploited for person re-identification, e. g. spatial alignment, latent semantic alignment and triplet alignment.

Ranked #30 on Person Re-Identification on DukeMTMC-reID

Person Re-Identification

Paper
Code

Trust It or Not: Confidence-Guided Automatic Radiology Report Generation

no code implementations • 21 Jun 2021 • Yixin Wang, Zihao Lin, Zhe Xu, Haoyu Dong, Jiang Tian, Jie Luo, Zhongchao shi, Yang Zhang, Jianping Fan, Zhiqiang He

Experimental results have demonstrated that the proposed method for model uncertainty characterization and estimation can produce more reliable confidence scores for radiology report generation, and the modified loss function, which takes into account the uncertainties, leads to better model performance on two public radiology report datasets.

Decision Making Image Captioning +2

Paper
Add Code

ACN: Adversarial Co-training Network for Brain Tumor Segmentation with Missing Modalities

2 code implementations • 28 Jun 2021 • Yixin Wang, Yang Zhang, Yang Liu, Zihao Lin, Jiang Tian, Cheng Zhong, Zhongchao shi, Jianping Fan, Zhiqiang He

Specifically, ACN adopts a novel co-training network, which enables a coupled learning process for both full modality and missing modality to supplement each other's domain and feature representations, and more importantly, to recover the `missing' information of absent modalities.

Brain Tumor Segmentation Transfer Learning +1

Paper
Code

A Survey of Visual Transformers

1 code implementation • 11 Nov 2021 • Yang Liu, Yao Zhang, Yixin Wang, Feng Hou, Jin Yuan, Jiang Tian, Yang Zhang, Zhongchao shi, Jianping Fan, Zhiqiang He

Transformer, an attention-based encoder-decoder model, has already revolutionized the field of natural language processing (NLP).

225

Paper
Code

Graph Attention Transformer Network for Multi-Label Image Classification

1 code implementation • 8 Mar 2022 • Jin Yuan, Shikai Chen, Yao Zhang, Zhongchao shi, Xin Geng, Jianping Fan, Yong Rui

Subsequently, we design the graph attention transformer layer to transfer this adjacency matrix to adapt to the current domain.

Classification Graph Attention +2

Paper
Code

Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering

1 code implementation • 24 Mar 2022 • Zhou Yu, Zitian Jin, Jun Yu, Mingliang Xu, Hongbo Wang, Jianping Fan

Recent advances in Transformer architectures [1] have brought remarkable improvements to visual question answering (VQA).

Question Answering Visual Question Answering

Paper
Code

Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation

1 code implementation • 8 Apr 2022 • Jin Yuan, Feng Hou, Yangzhou Du, Zhongchao shi, Xin Geng, Jianping Fan, Yong Rui

Domain adaptation (DA) tries to tackle the scenarios when the test data does not fully follow the same distribution of the training data, and multi-source domain adaptation (MSDA) is very attractive for real world applications.

Domain Adaptation Self-Supervised Learning +1

Paper
Code

SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency

1 code implementation • CVPR 2023 • Yang Liu, Yao Zhang, Yixin Wang, Yang Zhang, Jiang Tian, Zhongchao shi, Jianping Fan, Zhiqiang He

To bridge the gap between the reference points of salient queries and Transformer detectors, we propose SAlient Point-based DETR (SAP-DETR) by treating object detection as a transformation from salient points to instance objects.

Object object-detection +1

Paper
Code

Learning to Learn Domain-invariant Parameters for Domain Generalization

no code implementations • 4 Nov 2022 • Feng Hou, Yao Zhang, Yang Liu, Jin Yuan, Cheng Zhong, Yang Zhang, Zhongchao shi, Jianping Fan, Zhiqiang He

Due to domain shift, deep neural networks (DNNs) usually fail to generalize well on unknown test data in practice.

Domain Generalization

Paper
Add Code

Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification

1 code implementation • ICCV 2023 • Jiangming Shi, Yachao Zhang, Xiangbo Yin, Yuan Xie, Zhizhong Zhang, Jianping Fan, Zhongchao shi, Yanyun Qu

Visible-infrared person re-identification (VI-ReID) aims to match a specific person from a gallery of images captured from non-overlapping visible and infrared cameras.

Person Re-Identification Pseudo Label

Paper
Code

Cross-domain recommendation via user interest alignment

no code implementations • 26 Jan 2023 • Chuang Zhao, Hongke Zhao, Ming He, Jian Zhang, Jianping Fan

Specifically, we first construct a unified cross-domain heterogeneous graph and redefine the message passing mechanism of graph convolutional networks to capture high-order similarity of users and items across domains.

Recommendation Systems

Paper
Add Code

GLOW: Global Layout Aware Attacks on Object Detection

no code implementations • 27 Feb 2023 • Buyu Liu, BaoJun, Jianping Fan, Xi Peng, Kui Ren, Jun Yu

More desired attacks, to this end, should be able to fool defenses with such consistency checks.

Object object-detection +1

Paper
Add Code

ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos

1 code implementation • CVPR 2023 • Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu

A recent benchmark AGQA poses a promising paradigm to generate QA pairs automatically from pre-annotated scene graphs, enabling it to measure diverse reasoning abilities with granular control.

Question Answering Spatio-temporal Scene Graphs +1

Paper
Code

Multi-task Paired Masking with Alignment Modeling for Medical Vision-Language Pre-training

no code implementations • 13 May 2023 • Ke Zhang, Yan Yang, Jun Yu, Hanliang Jiang, Jianping Fan, Qingming Huang, Weidong Han

To address this limitation, we propose a unified Med-VLP framework based on Multi-task Paired Masking with Alignment (MPMA) to integrate the cross-modal alignment task into the joint image-text reconstruction framework to achieve more comprehensive cross-modal interaction, while a Global and Local Alignment (GLA) module is designed to assist self-supervised paradigm in obtaining semantic representations with rich domain knowledge.

Paper
Add Code

Epistemic Graph: A Plug-And-Play Module For Hybrid Representation Learning

no code implementations • 30 May 2023 • Jin Yuan, Yang Zhang, Yangzhou Du, Zhongchao shi, Xin Geng, Jianping Fan, Yong Rui

In this paper, a novel Epistemic Graph Layer (EGLayer) is introduced to enable hybrid learning, enhancing the exchange of information between deep features and a structured knowledge graph.

Few-Shot Learning Knowledge Graphs +1

Paper
Add Code

Image Clustering with External Guidance

no code implementations • 18 Oct 2023 • Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Jianping Fan, Xi Peng

The core of clustering is incorporating prior knowledge to construct supervision signals.

Clustering Image Clustering

Paper
Add Code

ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields

no code implementations • 19 Dec 2023 • Xiang Feng, Yongbo He, YuBo Wang, Chengkai Wang, Zhenzhong Kuang, Jiajun Ding, Feiwei Qin, Jun Yu, Jianping Fan

This framework aims to guide the NeRF model to synthesize high-resolution novel views via single-scene internal learning rather than requiring any external high-resolution training data.

Inverse Rendering Super-Resolution

Paper
Add Code

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property

1 code implementation • 26 Feb 2024 • Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, BoWen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li, Xiping Hu, Ye Li, Jianping Fan

In this paper, we contribute a new benchmark, the first Multilingual-oriented quiZ on Intellectual Property (MoZIP), for the evaluation of LLMs in the IP domain.

Language Modelling Large Language Model +2

Paper
Code

DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization

no code implementations • 5 Mar 2024 • Feng Hou, Jin Yuan, Ying Yang, Yang Liu, Yang Zhang, Cheng Zhong, Zhongchao shi, Jianping Fan, Yong Rui, Zhiqiang He

With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Adaptive Domain Generalization (ADG).

Domain Generalization

Paper
Add Code

SRGS: Super-Resolution 3D Gaussian Splatting

no code implementations • 16 Apr 2024 • Xiang Feng, Yongbo He, YuBo Wang, Yan Yang, Zhenzhong Kuang, Yu Jun, Jianping Fan, Jiajun Ding

This approach relies on the representation power of Gaussian primitives to provide a high-quality rendering.

Novel View Synthesis Super-Resolution

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.