Search Results for author: Xiyang Wu

Found 8 papers, 7 papers with code

A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges

2 code implementations4 Jan 2025 Zongxia Li, Xiyang Wu, Hongyang Du, Fuxiao Liu, Huy Nghiem, Guangyao Shi

Multimodal Vision Language Models (VLMs) have emerged as a transformative topic at the intersection of computer vision and natural language processing, enabling machines to perceive and reason about the world through both visual and textual modalities.

Fairness Hallucination +3

SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining

no code implementations26 Sep 2024 Ruiqi Xian, Xiyang Wu, Tianrui Guan, Xijun Wang, Boqing Gong, Dinesh Manocha

We introduce SOAR, a novel Self-supervised pretraining algorithm for aerial footage captured by Unmanned Aerial Vehicles (UAVs).

Action Recognition Object +1

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

1 code implementation4 Apr 2024 Tianrui Guan, Ruiqi Xian, Xijun Wang, Xiyang Wu, Mohamed Elnoor, Daeun Song, Dinesh Manocha

We present AGL-NET, a novel learning-based method for global localization using LiDAR point clouds and satellite maps.

On the Vulnerability of LLM/VLM-Controlled Robotics

1 code implementation15 Feb 2024 Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, Jing Liang, Tianrui Guan, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi

In this work, we highlight vulnerabilities in robotic systems integrating large language models (LLMs) and vision-language models (VLMs) due to input modality sensitivities.

Language Modelling Robot Manipulation

FireCommander: An Interactive, Probabilistic Multi-agent Environment for Heterogeneous Robot Teams

1 code implementation31 Oct 2020 Esmaeil Seraj, Xiyang Wu, Matthew Gombolay

The FireCommander environment can be useful for research topics spanning a wide range of applications from Reinforcement Learning (RL) and Learning from Demonstration (LfD), to Coordination, Psychology, Human-Robot Interaction (HRI) and Teaming.

Combinatorial Optimization reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.