Lens functions for exploring UMAP Projections with Domain Knowledge

vda-lab/lensed_umap 15 May 2024

The effectiveness of the lens functions is demonstrated in two use cases and their computational cost is analysed in a synthetic benchmark.

1
15 May 2024

MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer

wuchengyu123/mmfusion 15 May 2024

In addition, efficient and effective interactions between multi-modal representations need to be further explored, lacking insightful exploration of prognostic correlation in multi-modality features.

1
15 May 2024

OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality

shiqiyu/opengait 15 May 2024

To this end, we first develop OpenGait, a flexible and efficient gait recognition platform.

642
15 May 2024

Improving Transformers with Dynamically Composable Multi-Head Attention

caiyun-ai/dcformer 14 May 2024

At the core of DCMHA is a $\it{Compose}$ function that transforms the attention score and weight matrices in an input-dependent way.

Language Modelling

32
14 May 2024

Dual-level Hypergraph Contrastive Learning with Adaptive Temperature Enhancement

graphprojects/HyGCL-AdT International World Wide Web Conference 2024

However, these works have the following limitations in modeling the high-order relationships over unlabeled data: (i) They primarily focus on maximizing the agreements among individual node embeddings while neglecting the capture of group-wise collective behaviors within hypergraphs; (ii) Most of them disregard the importance of the temperature index in discriminating contrastive pairs during contrast optimization.

Contrastive Learning Hypergraph Contrastive Learning

2
14 May 2024

Self-Distillation Improves DNA Sequence Inference

wiedersehne/findna 14 May 2024

Self-supervised pretraining (SSP) has been recognized as a method to enhance prediction accuracy in various downstream tasks.

Contrastive Learning Language Modelling +1

0
14 May 2024

EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

beileicui/endodac 14 May 2024

We propose Endoscopic Depth Any Camera (EndoDAC) which is an efficient self-supervised depth estimation framework that adapts foundation models to endoscopic scenes.

Depth Estimation Surface Reconstruction

4
14 May 2024

Output-decomposed Learning of Mealy Machines

SCRK16/OLStar 14 May 2024

We present an active automata learning algorithm which learns a decomposition of a finite state machine, based on projecting onto individual outputs.

1
14 May 2024

Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events

shank2358/mudet 14 May 2024

To this end, we first construct two multimodal dense and occlusion vehicle detection datasets for large-scale events, utilizing RGB and height map modalities.

4k object-detection +1

0
14 May 2024

Efficient Vision-Language Pre-training by Cluster Masking

zi-hao-wei/efficient-vision-language-pre-training-by-cluster-masking 14 May 2024

We propose a simple strategy for masking image patches during visual-language contrastive learning that improves the quality of the learned representations and the training speed.

Contrastive Learning

8
14 May 2024