Search Results for author: XianTong Zhen

Found 52 papers, 23 papers with code

Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation

no code implementations17 Oct 2024 Xuezhi Xiang, Xi Wang, Lei Zhang, Denis Ombati, Himaloy Himu, XianTong Zhen

Scene flow estimation aims to generate the 3D motion field of points between two consecutive frames of point clouds, which has wide applications in various fields.

Self-supervised Scene Flow Estimation

LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections

no code implementations14 Oct 2024 Xuezhi Xiang, Yibo Ning, Lei Zhang, Denis Ombati, Himaloy Himu, XianTong Zhen

In this paper, we propose a remote-sensing image semantic segmentation network named LKASeg, which combines Large Kernel Attention(LSKA) and Full-Scale Skip Connections(FSC).

Decoder Semantic Segmentation

Self-supervised Monocular Depth Estimation with Large Kernel Attention

no code implementations26 Sep 2024 Xuezhi Xiang, Yao Wang, Lei Zhang, Denis Ombati, Himaloy Himu, XianTong Zhen

Self-supervised monocular depth estimation has emerged as a promising approach since it does not rely on labeled training data.

Decoder Monocular Depth Estimation

LKA-ReID:Vehicle Re-Identification with Large Kernel Attention

no code implementations26 Sep 2024 Xuezhi Xiang, Zhushan Ma, Lei Zhang, Denis Ombati, Himaloy Himu, XianTong Zhen

Using attention mechanism to capture global and local features is crucial to solve the challenge of high similarity between classes in vehicle Re-ID tasks.

Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning

no code implementations19 Aug 2024 Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, XianTong Zhen, Pei Dong, Zhen Qian

Intraoperative CT imaging serves as a crucial resource for surgical guidance; however, it may not always be readily accessible or practical to implement.

CT Reconstruction Image Generation +1

Coarse-Fine View Attention Alignment-Based GAN for CT Reconstruction from Biplanar X-Rays

no code implementations19 Aug 2024 Zhi Qiao, Hanqiang Ouyang, Dongheng Chu, Huishu Yuan, XianTong Zhen, Pei Dong, Zhen Qian

For surgical planning and intra-operation imaging, CT reconstruction using X-ray images can potentially be an important alternative when CT imaging is not available or not feasible.

CT Reconstruction

HYDEN: Hyperbolic Density Representations for Medical Images and Reports

no code implementations19 Aug 2024 Zhi Qiao, Linbin Han, XianTong Zhen, Jia-Hong Gao, Zhen Qian

In light of the inherent entailment relations between images and text, hyperbolic point vector embeddings, leveraging the hierarchical modeling advantages of hyperbolic space, have been utilized for visual semantic representation learning.

Representation Learning

Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation

1 code implementation24 Jun 2024 Yuchen Yang, Yingdong Shi, Cheems Wang, XianTong Zhen, Yuxuan Shi, Jun Xu

Fine-tuning pretrained large models to downstream tasks is an important problem, which however suffers from huge memory overhead due to large-scale parameters.

Order-preserving Consistency Regularization for Domain Adaptation and Generalization

1 code implementation ICCV 2023 Mengmeng Jing, XianTong Zhen, Jingjing Li, Cees Snoek

To alleviate this problem, data augmentation coupled with consistency regularization are commonly adopted to make the model less sensitive to domain-specific attributes.

Data Augmentation Domain Adaptation +1

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models

no code implementations ICCV 2023 Baoshuo Kan, Teng Wang, Wenpeng Lu, XianTong Zhen, Weili Guan, Feng Zheng

Pre-trained vision-language models, e. g., CLIP, working with manually designed prompts have demonstrated great capacity of transfer learning.

Few-Shot Image Classification Transfer Learning

Probabilistic Test-Time Generalization by Variational Neighbor-Labeling

no code implementations8 Jul 2023 Sameer Ambekar, Zehao Xiao, Jiayi Shen, XianTong Zhen, Cees G. M. Snoek

We formulate the generalization at test time as a variational inference problem, by modeling pseudo labels as distributions, to consider the uncertainty during generalization and alleviate the misleading signal of inaccurate pseudo labels.

Domain Generalization Variational Inference

EMO: Episodic Memory Optimization for Few-Shot Meta-Learning

no code implementations8 Jun 2023 Yingjun Du, Jiayi Shen, XianTong Zhen, Cees G. M. Snoek

By learning to retain and recall the learning process of past training tasks, EMO nudges parameter updates in the right direction, even when the gradients provided by a limited number of examples are uninformative.

Few-Shot Learning

CageViT: Convolutional Activation Guided Efficient Vision Transformer

no code implementations17 May 2023 Hao Zheng, Jinbao Wang, XianTong Zhen, Hong Chen, Jingkuan Song, Feng Zheng

Recently, Transformers have emerged as the go-to architecture for both vision and language modeling tasks, but their computational efficiency is limited by the length of the input sequence.

Computational Efficiency Image Classification +1

MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks

1 code implementation17 May 2023 Wenfang Sun, Yingjun Du, XianTong Zhen, Fan Wang, Ling Wang, Cees G. M. Snoek

To account for the uncertainty caused by the limited training tasks, we propose a variational MetaModulation where the modulation parameters are treated as latent variables.

Diversity Few-Shot Learning

Implicit Diffusion Models for Continuous Super-Resolution

1 code implementation CVPR 2023 Sicheng Gao, Xuhui Liu, Bohan Zeng, Sheng Xu, Yanjing Li, Xiaoyan Luo, Jianzhuang Liu, XianTong Zhen, Baochang Zhang

IDM integrates an implicit neural representation and a denoising diffusion model in a unified end-to-end framework, where the implicit neural representation is adopted in the decoding process to learn continuous-resolution representation.

Denoising Image Super-Resolution

Parameter-Free Channel Attention for Image Classification and Super-Resolution

no code implementations20 Mar 2023 Yuxuan Shi, Lingxiao Yang, Wangpeng An, XianTong Zhen, Liuqing Wang

The channel attention mechanism is a useful technique widely employed in deep convolutional neural networks to boost the performance for image processing tasks, eg, image classification and image super-resolution.

Classification Image Classification +1

Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning

1 code implementation28 Feb 2023 Ivona Najdenkoska, XianTong Zhen, Marcel Worring

Existing methods are trying to communicate visual concepts as prompts to frozen language models, but rely on hand-engineered task induction to reduce the hypothesis space.

Few-Shot Learning

CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression Comprehension

1 code implementation17 Feb 2023 Zhi Zhang, Helen Yannakoudakis, XianTong Zhen, Ekaterina Shutova

The task of multimodal referring expression comprehension (REC), aiming at localizing an image region described by a natural language expression, has recently received increasing attention within the research comminity.

Referring Expression Referring Expression Comprehension

Variational Model Perturbation for Source-Free Domain Adaptation

1 code implementation19 Oct 2022 Mengmeng Jing, XianTong Zhen, Jingjing Li, Cees G. M. Snoek

Our model perturbation provides a new probabilistic way for domain adaptation which enables efficient adaptation to target domains while maximally preserving knowledge in source models.

Bayesian Inference Source-Free Domain Adaptation

Probabilistic Integration of Object Level Annotations in Chest X-ray Classification

no code implementations13 Oct 2022 Tom van Sonsbeek, XianTong Zhen, Dwarikanath Mahapatra, Marcel Worring

This shows how two-stage learning of labels from coarse to fine-grained, in particular with object level annotations, is an effective method for more optimal annotation usage.

Knowledge Distillation Variational Inference

LifeLonger: A Benchmark for Continual Disease Classification

1 code implementation12 Apr 2022 Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Tom van Sonsbeek, XianTong Zhen, Dwarikanath Mahapatra, Marcel Worring, Cees G. M. Snoek

Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch, while cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.

Classification class-incremental learning +2

FCNet: A Convolutional Neural Network for Arbitrary-Length Exposure Estimation

1 code implementation5 Mar 2022 Jin Liang, Yuchen Yang, Anran Zhang, Jun Xu, Hui Li, XianTong Zhen

For image exposure enhancement, the tasks of Single-Exposure Correction (SEC) and Multi-Exposure Fusion (MEF) are widely studied in the image processing community.

Exposure Correction

Learning to Generalize across Domains on Single Test Samples

1 code implementation ICLR 2022 Zehao Xiao, XianTong Zhen, Ling Shao, Cees G. M. Snoek

We leverage a meta-learning paradigm to learn our model to acquire the ability of adaptation with single samples at training time so as to further adapt itself to each single test sample at test time.

Bayesian Inference Domain Generalization +1

Generative Kernel Continual learning

no code implementations26 Dec 2021 Mohammad Mahdi Derakhshani, XianTong Zhen, Ling Shao, Cees G. M. Snoek

Kernel continual learning by \citet{derakhshani2021kernel} has recently emerged as a strong continual learner due to its non-parametric ability to tackle task interference and catastrophic forgetting.

Continual Learning

Hierarchical Variational Memory for Few-shot Learning Across Domains

1 code implementation ICLR 2022 Yingjun Du, XianTong Zhen, Ling Shao, Cees G. M. Snoek

To explore and exploit the importance of different semantic levels, we further propose to learn the weights associated with the prototype at each level in a data-driven way, which enables the model to adaptively choose the most generalizable features.

Few-Shot Learning Variational Inference

Learning to Learn Dense Gaussian Processes for Few-Shot Learning

no code implementations NeurIPS 2021 Ze Wang, Zichen Miao, XianTong Zhen, Qiang Qiu

In contrast to sparse Gaussian processes, we define a set of dense inducing variables to be of a much larger size than the support set in each task, which collects prior knowledge from experienced tasks.

Few-Shot Learning Gaussian Processes +2

Multi-Task Neural Processes

no code implementations10 Nov 2021 Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao

Our multi-task neural processes methodologically expand the scope of vanilla neural processes and provide a new way of exploring task relatedness in function spaces for multi-task learning.

Bayesian Inference Brain Image Segmentation +4

Variational Multi-Task Learning with Gumbel-Softmax Priors

1 code implementation NeurIPS 2021 Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao

Multi-task learning aims to explore task relatedness to improve individual tasks, which is of particular significance in the challenging scenario that only limited data is available for each task.

Bayesian Inference Multi-Task Learning

Variational Topic Inference for Chest X-Ray Report Generation

no code implementations15 Jul 2021 Ivona Najdenkoska, XianTong Zhen, Marcel Worring, Ling Shao

The topics are inferred in a conditional variational inference framework, with each topic governing the generation of a sentence in the report.

Diversity Sentence +2

Kernel Continual Learning

1 code implementation12 Jul 2021 Mohammad Mahdi Derakhshani, XianTong Zhen, Ling Shao, Cees G. M. Snoek

We further introduce variational random features to learn a data-driven kernel for each task.

Continual Learning Variational Inference

Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation

no code implementations ACL 2021 Yingjun Du, Nithin Holla, XianTong Zhen, Cees G. M. Snoek, Ekaterina Shutova

A critical challenge faced by supervised word sense disambiguation (WSD) is the lack of large annotated datasets with sufficient coverage of words in their diversity of senses.

Diversity Meta-Learning +2

Attentional Prototype Inference for Few-Shot Segmentation

1 code implementation14 May 2021 Haoliang Sun, Xiankai Lu, Haochen Wang, Yilong Yin, XianTong Zhen, Cees G. M. Snoek, Ling Shao

We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution.

Bayesian Inference Few-Shot Semantic Segmentation +2

A Bit More Bayesian: Domain-Invariant Learning with Uncertainty

1 code implementation9 May 2021 Zehao Xiao, Jiayi Shen, XianTong Zhen, Ling Shao, Cees G. M. Snoek

Domain generalization is challenging due to the domain shift and the uncertainty caused by the inaccessibility of target domain data.

Bayesian Inference Domain Generalization

MetaKernel: Learning Variational Random Features with Limited Labels

1 code implementation8 May 2021 Yingjun Du, Haoliang Sun, XianTong Zhen, Jun Xu, Yilong Yin, Ling Shao, Cees G. M. Snoek

Specifically, we propose learning variational random features in a data-driven manner to obtain task-specific kernels by leveraging the shared knowledge provided by related tasks in a meta-learning setting.

Few-Shot Image Classification Few-Shot Learning +1

Variational Knowledge Distillation for Disease Classification in Chest X-Rays

no code implementations19 Mar 2021 Tom van Sonsbeek, XianTong Zhen, Marcel Worring, Ling Shao

It is challenging to incorporate this information into disease classification due to the high reliance on clinician input in EHRs, limiting the possibility for automated diagnosis.

General Classification Image Classification +3

Variational Invariant Learning for Bayesian Domain Generalization

no code implementations1 Jan 2021 Zehao Xiao, Jiayi Shen, XianTong Zhen, Ling Shao, Cees G. M. Snoek

In the probabilistic modeling framework, we introduce a domain-invariant principle to explore invariance across domains in a unified way.

Domain Generalization

Variational Multi-Task Learning

no code implementations1 Jan 2021 Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao

Multi-task learning aims to improve the overall performance of a set of tasks by leveraging their relatedness.

Bayesian Inference Inductive Bias +1

Direct Estimation of Spinal Cobb Angles by Structured Multi-Output Regression

no code implementations23 Dec 2020 Haoliang Sun, XianTong Zhen, Chris Bailey, Parham Rasoulinejad, Yilong Yin, Shuo Li

The Cobb angle that quantitatively evaluates the spinal curvature plays an important role in the scoliosis diagnosis and treatment.

regression

Learning to Learn Variational Semantic Memory

1 code implementation NeurIPS 2020 XianTong Zhen, Yingjun Du, Huan Xiong, Qiang Qiu, Cees G. M. Snoek, Ling Shao

The variational semantic memory accrues and stores semantic information for the probabilistic inference of class prototypes in a hierarchical Bayesian framework.

Few-Shot Learning General Knowledge +1

Learning to Learn Kernels with Variational Random Features

2 code implementations ICML 2020 Xiantong Zhen, Haoliang Sun, Ying-Jun Du, Jun Xu, Yilong Yin, Ling Shao, Cees Snoek

We propose meta variational random features (MetaVRF) to learn adaptive kernels for the base-learner, which is developed in a latent variable model by treating the random feature basis as the latent variable.

Few-Shot Learning Variational Inference

Towards Direct Medical Image Analysis without Segmentation

no code implementations21 Oct 2015 Xiantong Zhen, Shuo Li

Direct methods have recently emerged as an effective and efficient tool in automated medical image analysis and become a trend to solve diverse challenging tasks in clinical practise.

Medical Image Analysis Segmentation

Supervised Descriptor Learning for Multi-Output Regression

no code implementations CVPR 2015 Xiantong Zhen, Zhijie Wang, Mengyang Yu, Shuo Li

In this paper, we propose a novel supervised descriptor learning (SDL) algorithm to establish a discriminative and compact feature representation for multi-output regression.

Head Pose Estimation regression

Cannot find the paper you are looking for? You can Submit a new open access paper.