1 code implementation • 28 Apr 2024 • Navve Wasserman, Noam Rotstein, Roy Ganz, Ron Kimmel
We address this by leveraging the insight that removing objects (Inpaint) is significantly simpler than its inverse process of adding them (Paint), attributed to the utilization of segmentation mask datasets alongside inpainting models that inpaint within these masks.
1 code implementation • 28 May 2023 • Noam Rotstein, David Bensaid, Shaked Brody, Roy Ganz, Ron Kimmel
Our proposed method, FuseCap, fuses the outputs of such vision experts with the original captions using a large language model (LLM), yielding comprehensive image descriptions.
Ranked #1 on Image Captioning on COCO Captions (CLIPScore metric)
no code implementations • 15 Dec 2021 • Amit Bracha, Noam Rotstein, David Bensaïd, Ron Slossberg, Ron Kimmel
To mitigate this quadratic relation, we propose a simple but effective method that uses a refinement network for depth estimation.
1 code implementation • CVPR 2022 • Noam Rotstein, Amit Bracha, Ron Kimmel
To overcome this difficulty, we consider a differential optimization method that aligns a colored point cloud with a given color image through iterative geometric and color matching.