Search Results for author: Xiaopeng Lu

Found 9 papers, 5 papers with code

Improvement of Frequency Source Phase Noise Reduction Design under Vibration Condition

no code implementations6 Feb 2024 Liwei Yin, Yongjiang Shu, Heng Zhang, Yuefei Dai, Xiaopeng Lu, Yunlong Lian, Zhonghua Wang, Yong Ding

Reasonable vibration reduction design is an important way to achieve low phase noise index of airborne frequency source output signal.

Core Challenges in Embodied Vision-Language Planning

no code implementations5 Apr 2023 Jonathan Francis, Nariaki Kitamura, Felix Labelle, Xiaopeng Lu, Ingrid Navarro, Jean Oh

Recent advances in the areas of Multimodal Machine Learning and Artificial Intelligence (AI) have led to the development of challenging tasks at the intersection of Computer Vision, Natural Language Processing, and Robotics.

VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations

1 code implementation1 Jul 2022 Tiancheng Zhao, Tianqi Zhang, Mingwei Zhu, Haozhan Shen, Kyusong Lee, Xiaopeng Lu, Jianwei Yin

Inspired by the CheckList for testing natural language processing, we exploit VL-CheckList, a novel framework to understand the capabilities of VLP models.

Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling

no code implementations20 Aug 2021 Xiaopeng Lu, Zhen Fan, Yansen Wang, Jean Oh, Carolyn P. Rose

LOGOS leverages two grounding tasks to better localize the key information of the image, utilizes scene text clustering to group individual OCR tokens, and learns to select the best answer from different sources of OCR (Optical Character Recognition) texts.

Data Ablation Optical Character Recognition +4

CIGLI: Conditional Image Generation from Language & Image

1 code implementation20 Aug 2021 Xiaopeng Lu, Lynnette Ng, Jared Fernandez, Hao Zhu

Instead of generating an image based on text as in text-image generation, this task requires the generation of an image from a textual description and an image prompt.

Conditional Image Generation

Core Challenges in Embodied Vision-Language Planning

no code implementations26 Jun 2021 Jonathan Francis, Nariaki Kitamura, Felix Labelle, Xiaopeng Lu, Ingrid Navarro, Jean Oh

Recent advances in the areas of multimodal machine learning and artificial intelligence (AI) have led to the development of challenging tasks at the intersection of Computer Vision, Natural Language Processing, and Embodied AI.

SF-QA: Simple and Fair Evaluation Library for Open-domain Question Answering

1 code implementation EACL 2021 Xiaopeng Lu, Kyusong Lee, Tiancheng Zhao

Although open-domain question answering (QA) draws great attention in recent years, it requires large amounts of resources for building the full system and is often difficult to reproduce previous results due to complex configurations.

Open-Domain Question Answering

VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words

1 code implementation ACL 2021 Xiaopeng Lu, Tiancheng Zhao, Kyusong Lee

To the best of our knowledge, VisualSparta is the first transformer-based text-to-image retrieval model that can achieve real-time searching for large-scale datasets, with significant accuracy improvement compared to previous state-of-the-art methods.

Cross-Modal Retrieval Image Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.