Search Results for author: Mohamed Afham

Found 6 papers, 3 papers with code

Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

no code implementations • 20 Sep 2023 • Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, SerNam Lim

While most modern video understanding models operate on short-range clips, real-world videos are often several minutes long with semantically consistent segments of variable length.

Temporal Action Localization Video Classification +1

Paper
Add Code

3DLatNav: Navigating Generative Latent Spaces for Semantic-Aware 3D Object Manipulation

1 code implementation • 17 Nov 2022 • Amaya Dharmasiri, Dinithi Dissanayake, Mohamed Afham, Isuru Dissanayake, Ranga Rodrigo, Kanchana Thilakarathna

However, most models do not offer controllability to manipulate the shape semantics of component object parts without extensive semantic attribute labels or other reference point clouds.

Attribute Disentanglement +1

Paper
Code

Visual-Semantic Contrastive Alignment for Few-Shot Image Classification

no code implementations • 20 Oct 2022 • Mohamed Afham, Ranga Rodrigo

The pre-trained semantic feature extractor (learned from a large-scale text corpora) we use in our approach provides a strong contextual prior knowledge to assist FSL.

Classification Contrastive Learning +2

Paper
Add Code

CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding

1 code implementation • CVPR 2022 • Mohamed Afham, Isuru Dissanayake, Dinithi Dissanayake, Amaya Dharmasiri, Kanchana Thilakarathna, Ranga Rodrigo

Manual annotation of large-scale point cloud dataset for varying tasks such as 3D object classification, segmentation and detection is often laborious owing to the irregular structure of point clouds.

Ranked #2 on Few-Shot 3D Point Cloud Classification on ScanObjectNN 10-way (10-shot)

3D Object Classification 3D Point Cloud Linear Classification +3

222

Paper
Code

Towards Accurate Cross-Domain In-Bed Human Pose Estimation

1 code implementation • 7 Oct 2021 • Mohamed Afham, Udith Haputhanthri, Jathurshan Pradeepkumar, Mithunjha Anandakumar, Ashwin De Silva, Chamira Edussooriya

Majority of the contactless human pose estimation algorithms are based on RGB modality, causing ineffectiveness in in-bed pose estimation due to occlusions by blankets and varying illumination conditions.

Data Augmentation Knowledge Distillation +1

Paper
Code

Rich Semantics Improve Few-shot Learning

no code implementations • 26 Apr 2021 • Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, Fahad Shahbaz Khan

Human learning benefits from multi-modal inputs that often appear as rich semantics (e. g., description of an object's attributes while learning about it).

Ranked #1 on Few-Shot Image Classification on Oxford 102 Flower (using extra training data)

Few-Shot Image Classification Few-Shot Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.