1 code implementation • ECCV 2020 • Panos Achlioptas, Ahmed Abdelreheem, Fei Xia, Mohamed Elhoseiny, Leonidas Guibas
Due to the scarcity and unsuitability of existent 3D-oriented linguistic resources for this task, we first develop two large-scale and complementary visio-linguistic datasets: i) extbf{ extit{Sr3D}}, which contains 83. 5K template-based utterances leveraging extit{spatial relations} with other fine-grained object classes to localize a referred object in a given scene, and ii) extbf{ extit{Nr3D}} which contains 41. 5K extit{natural, free-form}, utterances collected by deploying a 2-player object reference game in 3D scenes.
no code implementations • 11 Dec 2023 • Panos Achlioptas, Alexandros Benetatos, Iordanis Fostiropoulos, Dimitris Skourtis
In this work, we systematically study the problem of personalized text-to-image generation, where the output image is expected to portray information about specific human subjects.
no code implementations • 23 Mar 2023 • Willi Menapace, Aliaksandr Siarohin, Stéphane Lathuilière, Panos Achlioptas, Vladislav Golyanik, Sergey Tulyakov, Elisa Ricci
Most captivatingly, our PGM unlocks the director's mode, where the game is played by specifying goals for the agents in the form of a prompt.
1 code implementation • CVPR 2023 • Panos Achlioptas, IAn Huang, Minhyuk Sung, Sergey Tulyakov, Leonidas Guibas
In this work, we aim to facilitate the task of editing the geometry of 3D models through the use of natural language.
no code implementations • 12 Dec 2022 • Ahmed Abdelreheem, Kyle Olszewski, Hsin-Ying Lee, Peter Wonka, Panos Achlioptas
The two popular datasets ScanRefer [16] and ReferIt3D [3] connect natural language to real-world 3D data.
1 code implementation • 9 Dec 2022 • IAn Huang, Panos Achlioptas, Tianyi Zhang, Sergey Tulyakov, Minhyuk Sung, Leonidas Guibas
Additionally, to measure edit locality, we define a new metric that we call part-wise edit precision.
no code implementations • CVPR 2023 • Panos Achlioptas, Maks Ovsjanikov, Leonidas Guibas, Sergey Tulyakov
To embark on this journey, we introduce and share with the research community a large-scale dataset that contains emotional reactions and free-form textual explanations for 85, 007 publicly available images, analyzed by 6, 283 annotators who were asked to indicate and explain how and why they felt in a particular way when observing a specific image, producing a total of 526, 749 responses.
1 code implementation • 1 Apr 2022 • Ye Zhu, Kyle Olszewski, Yu Wu, Panos Achlioptas, Menglei Chai, Yan Yan, Sergey Tulyakov
We present Dance2Music-GAN (D2M-GAN), a novel adversarial multi-modal framework that generates complex musical samples conditioned on dance videos.
1 code implementation • 7 Jan 2022 • Zhengfei Kuang, Kyle Olszewski, Menglei Chai, Zeng Huang, Panos Achlioptas, Sergey Tulyakov
We present a novel method to acquire object representations from online image collections, capturing high-quality geometry and material properties of arbitrary objects from photographs with varying cameras, illumination, and backgrounds.
2 code implementations • CVPR 2022 • Juil Koo, IAn Huang, Panos Achlioptas, Leonidas Guibas, Minhyuk Sung
We introduce PartGlot, a neural framework and associated architectures for learning semantic part segmentation of 3D shape geometry, based solely on part referential language.
5 code implementations • CVPR 2021 • Panos Achlioptas, Maks Ovsjanikov, Kilichbek Haydarov, Mohamed Elhoseiny, Leonidas Guibas
We present a novel large-scale dataset and accompanying machine learning models aimed at providing a detailed understanding of the interplay between visual content, its emotional effect, and explanations for the latter in language.
1 code implementation • 3 Sep 2020 • Minhyuk Sung, Zhenyu Jiang, Panos Achlioptas, Niloy J. Mitra, Leonidas J. Guibas
Shape deformation is an important component in any geometry processing toolbox.
Graphics
3 code implementations • ICCV 2021 • Sherif Abdelkarim, Aniket Agarwal, Panos Achlioptas, Jun Chen, Jiaji Huang, Boyang Li, Kenneth Church, Mohamed Elhoseiny
We use these benchmarks to study the performance of several state-of-the-art long-tail models on the LTVRR setup.
1 code implementation • ICCV 2019 • Panos Achlioptas, Judy Fan, Robert X. D. Hawkins, Noah D. Goodman, Leonidas J. Guibas
We also find that these models are amenable to zero-shot transfer learning to novel object classes (e. g. transfer from training on chairs to testing on lamps), as well as to real-world images drawn from furniture catalogs.
no code implementations • ICLR 2019 • Panos Achlioptas, Judy E. Fan, Robert X. D. Hawkins, Noah D. Goodman, Leo Guibas
We further show that a neural speaker that is `listener-aware' --- that plans its utterances according to how an imagined listener would interpret its words in context --- produces more discriminative referring expressions than an `listener-unaware' speaker, as measured by human performance in identifying the correct object.
1 code implementation • ICCV 2019 • Ruqi Huang, Marie-Julie Rakotosaona, Panos Achlioptas, Leonidas Guibas, Maks Ovsjanikov
This paper proposes a learning-based framework for reconstructing 3D shapes from functional operators, compactly encoded as small-sized matrices.
no code implementations • ICCV 2019 • Anastasia Dubrovina, Fei Xia, Panos Achlioptas, Mira Shalah, Raphael Groscot, Leonidas Guibas
We present a novel neural network architecture, termed Decomposer-Composer, for semantic structure-aware 3D shape modeling.
3 code implementations • ICML 2018 • Panos Achlioptas, Olga Diamanti, Ioannis Mitliagkas, Leonidas Guibas
Three-dimensional geometric data offer an excellent domain for studying representation learning and generative modeling.