1 code implementation • 4 Apr 2024 • Amrin Kareem, Jean Lahoud, Hisham Cholakkal
We introduce a novel segmentation task known as reasoning part segmentation for 3D objects, aiming to output a segmentation mask based on complex and implicit textual queries about specific parts of a 3D object.
1 code implementation • 3 Oct 2023 • Yahia Dalbah, Jean Lahoud, Hisham Cholakkal
Scene understanding plays an essential role in enabling autonomous driving and maintaining high standards of performance and safety.
1 code implementation • NeurIPS 2023 • Mohamed El Amine Boudjoghra, Salwa K. Al Khatib, Jean Lahoud, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Khan
We argue that such a closed-world assumption is restrictive and explore for the first time 3D indoor instance segmentation in an open-world setting, where the model is allowed to distinguish a set of known classes as well as identify an unknown object as unknown and then later incrementally learning the semantic category of the unknown when the corresponding category labels are available.
1 code implementation • 17 Apr 2023 • Yahia Dalbah, Jean Lahoud, Hisham Cholakkal
This improvement was associated with the increasing use of LiDAR sensors and point cloud data to facilitate the task of object detection and recognition in autonomous driving.
no code implementations • 13 Feb 2023 • Sultan Abu Ghazal, Jean Lahoud, Rao Anwer
Self-attention is proven to be effective in encoding correlation information in 3D point clouds by (xie2020mlcvnet).
no code implementations • ICCV 2023 • Salwa Al Khatib, Mohamed El Amine Boudjoghra, Jean Lahoud, Fahad Shahbaz Khan
Specifically, we provide the transformer block with spatial features to facilitate differentiation between similar object queries and incorporate semantic supervision to enhance prediction accuracy based on object class.
no code implementations • 13 Sep 2022 • Dhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Muhammad Anwer, Hisham Cholakkal
In this work, we propose Contextualized Multi-Stage Refinement for 3D Object Detection (CMR3D) framework, which takes a 3D scene as input and strives to explicitly integrate useful contextual information of the scene at multiple levels to predict a set of object bounding-boxes along with their corresponding semantic labels.
1 code implementation • 8 Aug 2022 • Jean Lahoud, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Ming-Hsuan Yang
The success of the transformer architecture in natural language processing has recently triggered attention in the computer vision field.
no code implementations • 20 Jul 2022 • Fatima Albreiki, Sultan Abughazal, Jean Lahoud, Rao Anwer, Hisham Cholakkal, Fahad Khan
To the best of our knowledge, we are the first to investigate the robustness of point-based 3D object detectors.
no code implementations • 6 Feb 2020 • Jean Lahoud, Bernard Ghanem
These labels, denoted by HN-labels, represent different height and normal patches, which allow mining of local semantic information that is useful in the task of semantic RGB segmentation.
Ranked #101 on Semantic Segmentation on NYU Depth v2
no code implementations • ICCV 2019 • Jean Lahoud, Bernard Ghanem, Marc Pollefeys, Martin R. Oswald
The second goal is to learn instance information by densely estimating directional information of the instance's center of mass for each voxel.
Ranked #2 on 3D Semantic Instance Segmentation on ScanNetV2
no code implementations • ICCV 2017 • Jean Lahoud, Bernard Ghanem
We then use the 3D information to orient, place, and score bounding boxes around objects.
Ranked #4 on Object Detection In Indoor Scenes on SUN RGB-D
no code implementations • 19 Aug 2017 • Matthias Müller, Vincent Casser, Jean Lahoud, Neil Smith, Bernard Ghanem
We present a photo-realistic training and evaluation simulator (Sim4CV) with extensive applications across various fields of computer vision.