no code implementations • 8 Mar 2025 • Taewon Kang, Divya Kothandaraman, Ming C. Lin
We introduce a novel storytelling approach to enable seamless video generation with natural action transitions and structured narratives.
1 code implementation • 24 Sep 2024 • Divya Kothandaraman, Kuldeep Kulkarni, Sumit Shekhar, Balaji Vasan Srinivasan, Dinesh Manocha
We propose a novel diffusion guidance formulation, image frequency guidance, to steer the generation towards the manifold of the source subject and the driving action at every step of the inference denoising.
no code implementations • 12 Aug 2024 • Taewon Kang, Divya Kothandaraman, Dinesh Manocha, Ming C. Lin
Recent 3D novel view synthesis (NVS) methods often require extensive 3D data for training, and also typically lack generalization beyond the training distribution.
no code implementations • 22 May 2024 • Divya Kothandaraman, Kihyuk Sohn, Ruben Villegas, Paul Voigtlaender, Dinesh Manocha, Mohammad Babaeizadeh
We present a method for multi-concept customization of pretrained text-to-video (T2V) models.
1 code implementation • 22 May 2024 • Divya Kothandaraman, Ming Lin, Dinesh Manocha
We introduce a novel approach for prompt mixing, aiming to generate images at the intersection of multiple text prompts using pre-trained text-to-image diffusion models.
2 code implementations • 27 Nov 2023 • Divya Kothandaraman, Tianyi Zhou, Ming Lin, Dinesh Manocha
It seamlessly blends the visual features from the input image within a pretrained text-to-2Dimage stable diffusion model with a test-time optimization process for a careful bias-variance trade-off, which uses an Inverse Perspective Mapping (IPM) homography transformation to provide subtle cues for aerialview synthesis.
1 code implementation • 14 Apr 2023 • Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha
Our algorithm utilizes the motion bias within aerial videos, which enables the selection of motion-salient frames.
Ranked #1 on
Action Recognition
on UAV-Human
3 code implementations • 15 Mar 2023 • Divya Kothandaraman, Tianyi Zhou, Ming Lin, Dinesh Manocha
Aerial Diffusion leverages a pretrained text-image diffusion model for prior knowledge.
no code implementations • 15 Sep 2022 • Divya Kothandaraman, Ming Lin, Dinesh Manocha
We build a differentiable static-dynamic frequency mask prior to model the salient static and dynamic pixels in the video, crucial for the underlying task of action recognition.
no code implementations • 13 Sep 2022 • James F. Mullen Jr, Divya Kothandaraman, Aniket Bera, Dinesh Manocha
We compare our method, which we call PAAK, with prior approaches, including POSA, PROX ground truth, and a motion synthesis method, and highlight the benefits of our method with a perceptual study.
1 code implementation • 24 May 2022 • Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha
SALAD has three key benefits: (i) it is task-agnostic, and can be applied across various visual tasks such as classification, segmentation and detection; (ii) it can handle shifts in output label space from the pre-trained source network to the target domain; (iii) it does not require access to source data for adaptation.
1 code implementation • 21 Mar 2022 • Divya Kothandaraman, Tianrui Guan, Xijun Wang, Sean Hu, Ming Lin, Dinesh Manocha
Our formulation uses a novel Fourier object disentanglement method to innately separate out the human agent (which is typically small) from the background.
Ranked #1 on
Action Recognition
on UAV Human
1 code implementation • 7 Mar 2021 • Tianrui Guan, Divya Kothandaraman, Rohan Chandra, Adarsh Jagan Sathyamoorthy, Kasun Weerakoon, Dinesh Manocha
We interface GANav with a deep reinforcement learning-based navigation algorithm and highlight its benefits in terms of navigation in real-world unstructured terrains.
Ranked #1 on
Semantic Segmentation
on RUGD
2 code implementations • 27 Nov 2020 • Divya Kothandaraman, Rohan Chandra, Dinesh Manocha
We present a novel approach for unsupervised road segmentation in adverse weather conditions such as rain or fog.
1 code implementation • 3 Nov 2020 • Divya Kothandaraman, Athira Nambiar, Anurag Mittal
Practical autonomous driving systems face two crucial challenges: memory constraints and domain gap issues.
1 code implementation • 22 Sep 2020 • Divya Kothandaraman, Rohan Chandra, Dinesh Manocha
We present an unsupervised adaptation approach for visual scene understanding in unstructured traffic environments.
no code implementations • 18 Aug 2020 • Yuqian Zhou, Michael Kwan, Kyle Tolentino, Neil Emerton, Sehoon Lim, Tim Large, Lijiang Fu, Zhihong Pan, Baopu Li, Qirui Yang, Yihao Liu, Jigang Tang, Tao Ku, Shibin Ma, Bingnan Hu, Jiarong Wang, Densen Puthussery, Hrishikesh P. S, Melvin Kuriakose, Jiji C. V, Varun Sundar, Sumanth Hegde, Divya Kothandaraman, Kaushik Mitra, Akashdeep Jassal, Nisarg A. Shah, Sabari Nathan, Nagat Abdalla Esiad Rahel, Dafan Chen, Shichao Nie, Shuting Yin, Chengconghui Ma, Haoran Wang, Tongtong Zhao, Shanshan Zhao, Joshua Rego, Huaijin Chen, Shuai Li, Zhenhua Hu, Kin Wai Lau, Lai-Man Po, Dahai Yu, Yasar Abbas Ur Rehman, Yiqun Li, Lianping Xing
The results in the paper are state-of-the-art restoration performance of Under-Display Camera Restoration.
1 code implementation • 14 Aug 2020 • Varun Sundar, Sumanth Hegde, Divya Kothandaraman, Kaushik Mitra
In this work, we present Deep Atrous Guided Filter (DAGF), a two-stage, end-to-end approach for image restoration in UDC systems.