Search Results for author: Xiaohang Sun

Found 5 papers, 0 papers with code

Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment

no code implementations ICCV 2023 Sarah Ibrahimi, Xiaohang Sun, Pichao Wang, Amanmeet Garg, Ashutosh Sanan, Mohamed Omar

Nonetheless, the objective of the text-to-video retrieval task is to capture the complementary audio and video information that is pertinent to the text query rather than simply achieving better audio and video alignment.

Retrieval Text to Video Retrieval +2

AVT: Audio-Video Transformer for Multimodal Action Recognition

no code implementations Submitted to ICLR 2022 Wentao Zhu, Jingru Yi, Kevin Hsu, Xiaohang Sun, Xiang Hao, Linda Liu, Mohamed Omar

AVT uses a combination of video and audio signals to improve action recognition accuracy, leveraging the effective spatio-temporal representation by the video Transformer.

Action Recognition Audio Classification +3

When deep denoising meets iterative phase retrieval

no code implementations ICML 2020 Yaotian Wang, Xiaohang Sun, Jason W. Fleischer

Recovering a signal from its Fourier intensity underlies many important applications, including lensless imaging and imaging through scattering media.

Denoising Retrieval

DuDoNet: Dual Domain Network for CT Metal Artifact Reduction

no code implementations CVPR 2019 Wei-An Lin, Haofu Liao, Cheng Peng, Xiaohang Sun, Jingdan Zhang, Jiebo Luo, Rama Chellappa, Shaohua Kevin Zhou

The linkage between the sigogram and image domains is a novel Radon inversion layer that allows the gradients to back-propagate from the image domain to the sinogram domain during training.

Computed Tomography (CT) Medical Diagnosis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.