Search Results for author: Xinyi Bai

Found 9 papers, 3 papers with code

ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations

1 code implementation16 Feb 2025 Bowen Jiang, Yuan Yuan, Xinyi Bai, Zhuoqun Hao, Alyson Yin, Yaojie Hu, Wenyu Liao, Lyle Ungar, Camillo J. Taylor

This work demonstrates that diffusion models can achieve font-controllable multilingual text rendering using just raw images without font label annotations.

Text Segmentation

MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following Benchmark

no code implementations26 Sep 2024 Elliot L. Epstein, Kaisheng Yao, Jing Li, Xinyi Bai, Hamid Palangi

When all the instructions are also appended to the end of the model input context, the $\operatorname{PIF}$ metric improves by 22. 3 points on average, showing that the challenge with the task lies not only in following the instructions, but also in retrieving the instructions spread out in the model context.

Instruction Following

Entity-Aware Self-Attention and Contextualized GCN for Enhanced Relation Extraction in Long Sentences

no code implementations15 Sep 2024 Xin Wang, Xinyi Bai

To be specific, relative position self-attention obtains the overall semantic pairwise correlation related to word position, and contextualized graph convolutional networks capture rich intra-sentence dependencies between words by adequately pruning operations.

Position Relation +3

Towards Rationality in Language and Multimodal Agents: A Survey

1 code implementation1 Jun 2024 Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Yuan Yuan, Zhuoqun Hao, Xinyi Bai, Weijie J. Su, Camillo J. Taylor, Tanwi Mallick

This work discusses how to build more rational language and multimodal agents and what criteria define rationality in intelligent systems.

Decision Making Survey

Faithful Persona-based Conversational Dataset Generation with Large Language Models

1 code implementation15 Dec 2023 Pegah Jandaghi, XiangHai Sheng, Xinyi Bai, Jay Pujara, Hakim Sidahmed

Training Natural Language Processing (NLP) models on a diverse and comprehensive persona-based dataset can lead to conversational models that create a deeper connection with the user, and maintain their engagement.

Chatbot Dataset Generation

GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching

no code implementations17 Aug 2023 Lu Yang, Zhenglun Kong, Ting Li, Xinyi Bai, Zhiye Lin, Hong Cheng

Traditional image stitching focuses on a single panorama frame without considering the spatial-temporal consistency in videos.

Camera Calibration Image Stitching +1

Close-up View synthesis by Interpolating Optical Flow

no code implementations12 Jul 2023 Xinyi Bai, Ze Wang, Lu Yang, Hong Cheng

The virtual viewpoint is perceived as a new technique in virtual navigation, as yet not supported due to the lack of depth information and obscure camera parameters.

Optical Flow Estimation

Bio-Inspired Night Image Enhancement Based on Contrast Enhancement and Denoising

no code implementations11 Jul 2023 Xinyi Bai, Steffi Agino Priyanka, Hsiao-Jung Tung, Yuankai Wang

In this paper, a bio-inspired image enhancement algorithm is proposed to convert a low illuminance image to a brighter and clear one.

Denoising Image Enhancement +2

Cannot find the paper you are looking for? You can Submit a new open access paper.