no code implementations • 20 Jan 2023 • Xudong Hong, Asad Sayeed, Khushboo Mehra, Vera Demberg, Bernt Schiele
The image sequences are aligned with a total of 12K stories which were collected via crowdsourcing given the image sequences and a set of grounded characters from the corresponding image sequence.
no code implementations • 28 Sep 2022 • Khushboo Mehra, Hassan Soliman, Soumya Ranjan Sahoo
Deep Learning models, in particular, show promising results on image segmentation and classification problems, but they require very large datasets for training.
no code implementations • CONLL 2020 • Xudong Hong, Rakshith Shetty, Asad Sayeed, Khushboo Mehra, Vera Demberg, Bernt Schiele
A problem in automatically generated stories for image sequences is that they use overly generic vocabulary and phrase structure and fail to match the distributional characteristics of human-generated text.
Ranked #5 on Visual Storytelling on VIST