Search Results for author: Jianbo Jiao

Found 39 papers, 22 papers with code

Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation

1 code implementation • 21 Apr 2024 • Gensheng Pei, Yazhou Yao, Jianbo Jiao, Wenguan Wang, Liqiang Nie, Jinhui Tang

To achieve this objective, we present a unified self-supervised approach to learn visual representations of static-dynamic feature similarity.

Paper
Code

360+x: A Panoptic Multi-modal Scene Understanding Dataset

no code implementations • 1 Apr 2024 • Hao Chen, Yuqi Hou, Chenyuan Qu, Irene Testini, Xiaohan Hong, Jianbo Jiao

While many existing datasets focus on scene understanding from a certain perspective (e. g. egocentric or third-person views), our dataset offers a panoptic perspective (i. e. multiple viewpoints with multiple data modalities).

Scene Understanding

Paper
Add Code

PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling

1 code implementation • 24 Mar 2024 • Xiaoyun Zheng, Liwei Liao, Xufeng Li, Jianbo Jiao, Rongjie Wang, Feng Gao, Shiqi Wang, Ronggang Wang

To facilitate the development of these fields, in this paper, we present PKU-DyMVHumans, a versatile human-centric dataset for high-fidelity reconstruction and rendering of dynamic human scenarios from dense multi-view videos.

Novel View Synthesis

Paper
Code

Show from Tell: Audio-Visual Modelling in Clinical Settings

no code implementations • 25 Oct 2023 • Jianbo Jiao, Mohammad Alsharid, Lior Drukker, Aris T. Papageorghiou, Andrew Zisserman, J. Alison Noble

Auditory and visual signals usually present together and correlate with each other, not only in natural environments but also in clinical settings.

Self-Supervised Learning

Paper
Add Code

CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental Segmentation

1 code implementation • ICCV 2023 • Zekang Zhang, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei

However, most state-of-the-art methods use the freeze strategy for stability, which compromises the model's plasticity. In contrast, releasing parameter training for plasticity could lead to the best performance for all categories, but this requires discriminative feature representation. Therefore, we prioritize the model's plasticity and propose the Contrast inter- and intra-class representations for Incremental Segmentation (CoinSeg), which pursues discriminative representations for flexible parameter tuning.

Class-Incremental Semantic Segmentation

Paper
Code

Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training

no code implementations • 25 Sep 2023 • Jiangliu Wang, Jianbo Jiao, Yibing Song, Stephen James, Zhan Tong, Chongjian Ge, Pieter Abbeel, Yun-hui Liu

This work aims to improve unsupervised audio-visual pre-training.

Contrastive Learning Data Augmentation

Paper
Add Code

Multi-view Self-supervised Disentanglement for General Image Denoising

1 code implementation • ICCV 2023 • Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao

It is understandable as the model is designed to learn paired mapping (e. g. from a noisy image to its clean version).

Ranked #1 on Denoising on CBSD68 sigm75

Disentanglement Image Denoising +1

Paper
Code

Med-Tuning: Parameter-Efficient Transfer Learning with Fine-Grained Feature Enhancement for Medical Volumetric Segmentation

no code implementations • 21 Apr 2023 • Wenxuan Wang, Jiachen Shen, Chen Chen, Jianbo Jiao, Jing Liu, Yan Zhang, Shanshan Song, Jiangyun Li

In this paper, we present the study on parameter-efficient transfer learning for medical volumetric segmentation and propose a new framework named Med-Tuning based on intra-stage feature enhancement and inter-stage feature interaction.

Segmentation Transfer Learning

Paper
Add Code

FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation

1 code implementation • 21 Apr 2023 • Wenxuan Wang, Jing Wang, Chen Chen, Jianbo Jiao, Yuanxiu Cai, Shanshan Song, Jiangyun Li

The research community has witnessed the powerful potential of self-supervised Masked Image Modeling (MIM), which enables the models capable of learning visual representation from unlabeled data.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Revisiting Implicit Neural Representations in Low-Level Vision

1 code implementation • 20 Apr 2023 • Wentian Xu, Jianbo Jiao

Implicit Neural Representation (INR) has been emerging in computer vision in recent years.

Deblurring Image Denoising +2

Paper
Code

Disentangled Pre-training for Image Matting

1 code implementation • 3 Apr 2023 • Yanda Li, Zilong Huang, Gang Yu, Ling Chen, Yunchao Wei, Jianbo Jiao

The pre-training task is designed in a similar manner as image matting, where random trimap and alpha matte are generated to achieve an image disentanglement objective.

Disentanglement Image Matting

Paper
Code

Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion

1 code implementation • ICCV 2023 • Yutao Jiang, Yang Zhou, Yuan Liang, Wenxi Liu, Jianbo Jiao, Yuhui Quan, Shengfeng He

To address the above issues, we propose Diffuse3D which employs a pre-trained diffusion model for global synthesis, while amending the model to activate depth-aware inference.

Denoising Novel View Synthesis

Paper
Code

CL-MVSNet: Unsupervised Multi-View Stereo with Dual-Level Contrastive Learning

no code implementations • ICCV 2023 • Kaiqiang Xiong, Rui Peng, Zhe Zhang, Tianxing Feng, Jianbo Jiao, Feng Gao, Ronggang Wang

On the one hand, we present an image-level contrastive branch to guide the model to acquire more context awareness, thus leading to more complete depth estimation in indistinguishable regions.

Contrastive Learning Depth Estimation

Paper
Add Code

Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation

1 code implementation • 13 Nov 2022 • Zekang Zhang, Guangyu Gao, Zhiyuan Fang, Jianbo Jiao, Yunchao Wei

Our MicroSeg is based on the assumption that background regions with strong objectness possibly belong to those concepts in the historical or future stages.

Class-Incremental Semantic Segmentation Continual Learning +1

Paper
Code

Anatomy-Aware Contrastive Representation Learning for Fetal Ultrasound

1 code implementation • 22 Aug 2022 • Zeyu Fu, Jianbo Jiao, Robail Yasrab, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble

The proposed approach is demonstrated for automated fetal ultrasound imaging tasks, enabling the positive pairs from the same or different ultrasound scans that are anatomically similar to be pulled together and thus improving the representation learning.

Anatomy Contrastive Learning +2

Paper
Code

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

1 code implementation • NeurIPS 2021 • Chongjian Ge, Youwei Liang, Yibing Song, Jianbo Jiao, Jue Wang, Ping Luo

Motivated by the transformers that explore visual attention effectively in recognition scenarios, we propose a CNN Attention REvitalization (CARE) framework to train attentive CNN encoders guided by transformers in SSL.

Image Classification object-detection +3

116

Paper
Code

Quantised Transforming Auto-Encoders: Achieving Equivariance to Arbitrary Transformations in Deep Networks

no code implementations • 25 Nov 2021 • Jianbo Jiao, João F. Henriques

In this work we investigate how to achieve equivariance to input transformations in deep networks, purely from data, without being given a model of those transformations.

Pose Estimation Translation

Paper
Add Code

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

1 code implementation • 11 Oct 2021 • Chongjian Ge, Youwei Liang, Yibing Song, Jianbo Jiao, Jue Wang, Ping Luo

Image Classification object-detection +3

116

Paper
Code

Facial Anatomical Landmark Detection using Regularized Transfer Learning with Application to Fetal Alcohol Syndrome Recognition

no code implementations • 12 Sep 2021 • Zeyu Fu, Jianbo Jiao, Michael Suttie, J. Alison Noble

This imaging application is characterized by large variations in data appearance and limited availability of labeled data.

Facial Landmark Detection Transfer Learning

Paper
Add Code

Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation

1 code implementation • 8 Jun 2021 • Bingfeng Zhang, Jimin Xiao, Jianbo Jiao, Yunchao Wei, Yao Zhao

More importantly, our approach can be readily applied to bounding box supervised instance segmentation task or other weakly supervised semantic segmentation tasks, with state-of-the-art or comparable performance among almot all weakly supervised tasks on PASCAL VOC or COCO dataset.

Box-supervised Instance Segmentation Model Optimization +3

Paper
Code

Scene Context-Aware Salient Object Detection

1 code implementation • ICCV 2021 • Avishek Siris, Jianbo Jiao, Gary K.L. Tam, Xianghua Xie, Rynson W.H. Lau

To our knowledge, such high-level semantic contextual information of image scenes is under-explored for saliency detection in the literature.

Object object-detection +3

Paper
Code

Cross-Task Representation Learning for Anatomical Landmark Detection

no code implementations • 28 Sep 2020 • Zeyu Fu, Jianbo Jiao, Michael Suttie, J. Alison Noble

The main idea of the proposed method is to retain the feature representations of the source model on the target task data, and to leverage them as an additional source of supervisory signals for regularizing the target model learning, thereby improving its performance under limited training samples.

Face Recognition Representation Learning +1

Paper
Add Code

Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics

2 code implementations • 31 Aug 2020 • Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Wei Liu, Yun-hui Liu

Specifically, given an unlabeled video clip, we compute a series of spatio-temporal statistical summaries, such as the spatial location and dominant direction of the largest motion, the spatial location and dominant color of the largest color diversity along the temporal axis, etc.

Action Recognition Representation Learning +3

Paper
Code

Self-Supervised Ultrasound to MRI Fetal Brain Image Synthesis

1 code implementation • 19 Aug 2020 • Jianbo Jiao, Ana I. L. Namburete, Aris T. Papageorghiou, J. Alison Noble

To regularise the anatomical structures between US and MRI during synthesis, we further propose an adversarial structural constraint.

Image Generation

Paper
Code

Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound

no code implementations • 14 Aug 2020 • Jianbo Jiao, Yifan Cai, Mohammad Alsharid, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble

For this case, we assume that there is a high correlation between the ultrasound video and the corresponding narrative speech audio of the sonographer.

Contrastive Learning Gaze Prediction +1

Paper
Add Code

Self-supervised Video Representation Learning by Pace Prediction

1 code implementation • ECCV 2020 • Jiangliu Wang, Jianbo Jiao, Yun-hui Liu

This paper addresses the problem of self-supervised video representation learning from a new perspective -- by video pace prediction.

Action Recognition Contrastive Learning +3

Paper
Code

Referring Image Segmentation by Generative Adversarial Learning

no code implementations • IEEE 2020 • Shuang Qiu, Yao Zhao, Jianbo Jiao, Yunchao Wei, Shikui Wei

To this end, we propose to train the referring image segmentation model in a generative adversarial fashion, which well addresses the distribution similarity problem.

Image Segmentation Referring Expression +4

Paper
Add Code

Laplacian Denoising Autoencoder

no code implementations • 30 Mar 2020 • Jianbo Jiao, Linchao Bao, Yunchao Wei, Shengfeng He, Honghui Shi, Rynson Lau, Thomas S. Huang

This can be naturally generalized to span multiple scales with a Laplacian pyramid representation of the input data.

Denoising Self-Supervised Learning

Paper
Add Code

Unified Image and Video Saliency Modeling

2 code implementations • ECCV 2020 • Richard Droste, Jianbo Jiao, J. Alison Noble

We evaluate our method on the video saliency datasets DHF1K, Hollywood-2 and UCF-Sports, and the image saliency datasets SALICON and MIT300.

Ranked #4 on Video Saliency Detection on MSU Video Saliency Prediction

Domain Adaptation Saliency Prediction +1

116

Paper
Code

Self-supervised Representation Learning for Ultrasound Video

no code implementations • 28 Feb 2020 • Jianbo Jiao, Richard Droste, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble

Therefore, there is significant interest in learning representations from unlabelled raw data.

Anatomy Representation Learning +2

Paper
Add Code

Anatomy-Aware Self-supervised Fetal MRI Synthesis from Unpaired Ultrasound Images

no code implementations • 8 Sep 2019 • Jianbo Jiao, Ana I. L. Namburete, Aris T. Papageorghiou, J. Alison Noble

The feasibility of the approach to produce realistic looking MR images is demonstrated quantitatively and with a qualitative evaluation compared to real fetal MR images.

Anatomy

Paper
Add Code

Geometry-Aware Distillation for Indoor Semantic Segmentation

1 code implementation • CVPR 2019 • Jianbo Jiao, Yunchao Wei, Zequn Jie, Honghui Shi, Rynson W.H. Lau, Thomas S. Huang

It has been shown that jointly reasoning the 2D appearance and 3D information from RGB-D domains is beneficial to indoor scene semantic segmentation.

Segmentation Semantic Segmentation

Paper
Code

Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics

1 code implementation • CVPR 2019 • Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Yun-hui Liu, Wei Liu

We conduct extensive experiments with C3D to validate the effectiveness of our proposed approach.

Ranked #47 on Self-Supervised Action Recognition on HMDB51

General Classification Representation Learning +2

Paper
Code

When AWGN-based Denoiser Meets Real Noises

2 code implementations • 6 Apr 2019 • Yuqian Zhou, Jianbo Jiao, Haibin Huang, Yang Wang, Jue Wang, Honghui Shi, Thomas Huang

In this paper, we propose a novel approach to boost the performance of a real image denoiser which is trained only with synthetic pixel-independent noise data dominated by AWGN.

Ranked #2 on Denoising on Darmstadt Noise Dataset

Denoising

1,361

Paper
Code

Connecting Image Denoising and High-Level Vision Tasks via Deep Learning

1 code implementation • 6 Sep 2018 • Ding Liu, Bihan Wen, Jianbo Jiao, Xian-Ming Liu, Zhangyang Wang, Thomas S. Huang

Second we propose a deep neural network solution that cascades two modules for image denoising and various high-level tasks, respectively, and use the joint loss for updating only the denoising network via back-propagation.

Image Denoising Vocal Bursts Intensity Prediction

Paper
Code

Task-driven Webpage Saliency

no code implementations • ECCV 2018 • Quanlong Zheng, Jianbo Jiao, Ying Cao, Rynson W. H. Lau

Inspired by the observation that given a specific task, human attention is strongly correlated with certain semantic components on a webpage (e. g., images, buttons and input boxes), our network explicitly disentangles saliency prediction into two independent sub-tasks: task-specific attention shift prediction and task-free saliency prediction.

Saliency Prediction

Paper
Add Code

Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven Loss

no code implementations • ECCV 2018 • Jianbo Jiao, Ying Cao, Yibing Song, Rynson Lau

Monocular depth estimation benefits greatly from learning based techniques.

Monocular Depth Estimation

Paper
Add Code

Delving Into Salient Object Subitizing and Detection

no code implementations • ICCV 2017 • Shengfeng He, Jianbo Jiao, Xiaodan Zhang, Guoqiang Han, Rynson W. H. Lau

Experiments show that the proposed multi-task network outperforms existing multi-task architectures, and the auxiliary subitizing network provides strong guidance to salient object detection by reducing false positives and producing coherent saliency maps.

Object object-detection +2

Paper
Add Code

3D Hand Pose Tracking and Estimation Using Stereo Matching

no code implementations • 23 Oct 2016 • Jiawei Zhang, Jianbo Jiao, Mingliang Chen, Liangqiong Qu, Xiaobin Xu, Qingxiong Yang

This paper demonstrates that the performance of the state-of-the art tracking/estimation algorithms can be maintained with most stereo matching algorithms on the proposed benchmark, as long as the hand segmentation is correct.

Hand Segmentation Pose Tracking +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.