Latest Research

Lens functions for exploring UMAP Projections with Domain Knowledge

vda-lab/lensed_umap • 15 May 2024

The effectiveness of the lens functions is demonstrated in two use cases and their computational cost is analysed in a synthetic benchmark.

15 May 2024

Paper
Code

MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer

wuchengyu123/mmfusion • 15 May 2024

In addition, efficient and effective interactions between multi-modal representations need to be further explored, lacking insightful exploration of prognostic correlation in multi-modality features.

15 May 2024

Paper
Code

OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality

shiqiyu/opengait • • 15 May 2024

To this end, we first develop OpenGait, a flexible and efficient gait recognition platform.

642

15 May 2024

Paper
Code

Improving Transformers with Dynamically Composable Multi-Head Attention

caiyun-ai/dcformer • • 14 May 2024

At the core of DCMHA is a $\it{Compose}$ function that transforms the attention score and weight matrices in an input-dependent way.

Language Modelling

14 May 2024

Paper
Code

Dual-level Hypergraph Contrastive Learning with Adaptive Temperature Enhancement

graphprojects/HyGCL-AdT • • International World Wide Web Conference 2024

However, these works have the following limitations in modeling the high-order relationships over unlabeled data: (i) They primarily focus on maximizing the agreements among individual node embeddings while neglecting the capture of group-wise collective behaviors within hypergraphs; (ii) Most of them disregard the importance of the temperature index in discriminating contrastive pairs during contrast optimization.

Contrastive Learning Hypergraph Contrastive Learning

14 May 2024

Paper
Code

Self-Distillation Improves DNA Sequence Inference

wiedersehne/findna • • 14 May 2024

Self-supervised pretraining (SSP) has been recognized as a method to enhance prediction accuracy in various downstream tasks.

Contrastive Learning Language Modelling +1

14 May 2024

Paper
Code

EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

beileicui/endodac • • 14 May 2024

We propose Endoscopic Depth Any Camera (EndoDAC) which is an efficient self-supervised depth estimation framework that adapts foundation models to endoscopic scenes.

Depth Estimation Surface Reconstruction

14 May 2024

Paper
Code

Output-decomposed Learning of Mealy Machines

SCRK16/OLStar • 14 May 2024

We present an active automata learning algorithm which learns a decomposition of a finite state machine, based on projecting onto individual outputs.

14 May 2024

Paper
Code

Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events

shank2358/mudet • • 14 May 2024

To this end, we first construct two multimodal dense and occlusion vehicle detection datasets for large-scale events, utilizing RGB and height map modalities.

4k object-detection +1

14 May 2024

Paper
Code

Efficient Vision-Language Pre-training by Cluster Masking

zi-hao-wei/efficient-vision-language-pre-training-by-cluster-masking • • 14 May 2024

We propose a simple strategy for masking image patches during visual-language contrastive learning that improves the quality of the learned representations and the training speed.

Contrastive Learning

14 May 2024

Paper
Code