Search Results for author: Rohit Saxena

Found 10 papers, 5 papers with code

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization

1 code implementation24 Feb 2025 Rohit Saxena, Pasquale Minervini, Frank Keller

We benchmark state-of-the-art Multimodal Large Language Models (MLLMs) on PosterSum and demonstrate that they struggle to accurately interpret and summarize scientific posters.

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

no code implementations7 Feb 2025 Rohit Saxena, Aryo Pradipta Gema, Pasquale Minervini

Understanding time from visual representations is a fundamental cognitive skill, yet it remains a challenge for multimodal large language models (MLLMs).

End-to-End Long Document Summarization using Gradient Caching

no code implementations3 Jan 2025 Rohit Saxena, Hao Tang, Frank Keller

Training transformer-based encoder-decoder models for long document summarization poses a significant challenge due to the quadratic memory consumption during training.

Decoder Document Summarization

MovieSum: An Abstractive Summarization Dataset for Movie Screenplays

1 code implementation12 Aug 2024 Rohit Saxena, Frank Keller

Movie screenplay summarization is challenging, as it requires an understanding of long input contexts and various elements unique to movies.

Abstractive Text Summarization Document Summarization

Select and Summarize: Scene Saliency for Movie Script Summarization

1 code implementation4 Apr 2024 Rohit Saxena, Frank Keller

Abstractive summarization for long-form narrative texts such as movie scripts is challenging due to the computational and memory constraints of current language models.

Abstractive Text Summarization

Data-Driven Compression of Convolutional Neural Networks

no code implementations28 Nov 2019 Ramit Pahwa, Manoj Ghuhan Arivazhagan, Ankur Garg, Siddarth Krishnamoorthy, Rohit Saxena, Sunav Choudhary

Designing and training a CNN architecture that does well on all three metrics is highly non-trivial and can be very time-consuming if done by hand.

Knowledge Distillation Model Compression +1

Cannot find the paper you are looking for? You can Submit a new open access paper.