Search Results for author: Xiaoyan Wang

Found 12 papers, 4 papers with code

3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding

1 code implementation6 Jan 2024 Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu

The remarkable potential of multi-modal large language models (MLLMs) in comprehending both vision and language information has been widely acknowledged.

Scene Understanding Visual Question Answering (VQA)

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis

no code implementations20 Dec 2023 Bichen Wu, Ching-Yao Chuang, Xiaoyan Wang, Yichen Jia, Kapil Krishnakumar, Tong Xiao, Feng Liang, Licheng Yu, Peter Vajda

In this paper, we introduce Fairy, a minimalist yet robust adaptation of image-editing diffusion models, enhancing them for video editing applications.

Data Augmentation Video Editing +1

AVID: Any-Length Video Inpainting with Diffusion Model

1 code implementation6 Dec 2023 Zhixing Zhang, Bichen Wu, Xiaoyan Wang, Yaqiao Luo, Luxin Zhang, Yinan Zhao, Peter Vajda, Dimitris Metaxas, Licheng Yu

Given a video, a masked region at its initial frame, and an editing prompt, it requires a model to do infilling at each frame following the editing guidance while keeping the out-of-mask region intact.

Image Inpainting Video Inpainting

Distinguish Confusing Law Articles for Legal Judgment Prediction

1 code implementation ACL 2020 Nuo Xu, Pinghui Wang, Long Chen, Li Pan, Xiaoyan Wang, Junzhou Zhao

Legal Judgment Prediction (LJP) is the task of automatically predicting a law case's judgment results given a text describing its facts, which has excellent prospects in judicial assistance systems and convenient services for the public.

Consistent Classification with Generalized Metrics

no code implementations24 Aug 2019 Xiaoyan Wang, Ran Li, Bowei Yan, Oluwasanmi Koyejo

We propose a framework for constructing and analyzing multiclass and multioutput classification metrics, i. e., involving multiple, possibly correlated multiclass labels.

Classification General Classification

Answering Science Exam Questions Using Query Rewriting with Background Knowledge

no code implementations15 Sep 2018 Ryan Musa, Xiaoyan Wang, Achille Fokoue, Nicholas Mattei, Maria Chang, Pavan Kapanipathi, Bassem Makni, Kartik Talamadupula, Michael Witbrock

Open-domain question answering (QA) is an important problem in AI and NLP that is emerging as a bellwether for progress on the generalizability of AI methods and techniques.

Information Retrieval Multiple-choice +3

Cannot find the paper you are looking for? You can Submit a new open access paper.