Search Results for author: Mengdi Li

Found 10 papers, 6 papers with code

Large Language Models for Orchestrating Bimanual Robots

no code implementations • 2 Apr 2024 • Kun Chu, Xufeng Zhao, Cornelius Weber, Mengdi Li, Wenhao Lu, Stefan Wermter

Although there has been rapid progress in endowing robots with the ability to solve complex manipulation tasks, generating control policies for bimanual robots to solve tasks involving two hands is still challenging because of the difficulties in effective temporal and spatial coordination.

In-Context Learning Language Modelling

Paper
Add Code

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs

no code implementations • 30 Mar 2024 • Shu Yang, Jiayuan Su, Han Jiang, Mengdi Li, Keyuan Cheng, Muhammad Asif Ali, Lijie Hu, Di Wang

With the rise of large language models (LLMs), ensuring they embody the principles of being helpful, honest, and harmless (3H), known as Human Alignment, becomes crucial.

knowledge editing Navigate +1

Paper
Add Code

Causal State Distillation for Explainable Reinforcement Learning

1 code implementation • 30 Dec 2023 • Wenhao Lu, Xufeng Zhao, Thilo Fryen, Jae Hee Lee, Mengdi Li, Sven Magg, Stefan Wermter

This lack of transparency in RL models has been a long-standing problem, making it difficult for users to grasp the reasons behind an agent's behaviour.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models

no code implementations • 4 Nov 2023 • Kun Chu, Xufeng Zhao, Cornelius Weber, Mengdi Li, Stefan Wermter

Large Language Models (LLMs) demonstrate remarkable abilities to provide human-like feedback on user inputs in natural language.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic

1 code implementation • 23 Sep 2023 • Xufeng Zhao, Mengdi Li, Wenhao Lu, Cornelius Weber, Jae Hee Lee, Kun Chu, Stefan Wermter

Recent advancements in large language models have showcased their remarkable generalizability across various domains.

Causal Inference

Paper
Code

A Closer Look at Reward Decomposition for High-Level Robotic Explanations

no code implementations • 25 Apr 2023 • Wenhao Lu, Xufeng Zhao, Sven Magg, Martin Gromniak, Mengdi Li, Stefan Wermter

Explaining the behaviour of intelligent agents learned by reinforcement learning (RL) to humans is challenging yet crucial due to their incomprehensible proprioceptive states, variational intermediate goals, and resultant unpredictability.

Reinforcement Learning (RL) Vocal Bursts Intensity Prediction

Paper
Add Code

Chat with the Environment: Interactive Multimodal Perception Using Large Language Models

1 code implementation • 14 Mar 2023 • Xufeng Zhao, Mengdi Li, Cornelius Weber, Muhammad Burhan Hafez, Stefan Wermter

However, it remains challenging to ground LLMs in multimodal sensory input and continuous action output, while enabling a robot to interact with its environment and acquire novel information as its policies unfold.

Paper
Code

Internally Rewarded Reinforcement Learning

1 code implementation • 1 Feb 2023 • Mengdi Li, Xufeng Zhao, Jae Hee Lee, Cornelius Weber, Stefan Wermter

We study a class of reinforcement learning problems where the reward signals for policy learning are generated by an internal reward model that is dependent on and jointly optimized with the policy.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Visually Grounded Commonsense Knowledge Acquisition

1 code implementation • 22 Nov 2022 • Yuan YAO, Tianyu Yu, Ao Zhang, Mengdi Li, Ruobing Xie, Cornelius Weber, Zhiyuan Liu, Hai-Tao Zheng, Stefan Wermter, Tat-Seng Chua, Maosong Sun

In this work, we present CLEVER, which formulates CKE as a distantly supervised multi-instance learning problem, where models learn to summarize commonsense relations from a bag of images about an entity pair without any human annotation on image instances.

Language Modelling

Paper
Code

Visual Distant Supervision for Scene Graph Generation

1 code implementation • ICCV 2021 • Yuan YAO, Ao Zhang, Xu Han, Mengdi Li, Cornelius Weber, Zhiyuan Liu, Stefan Wermter, Maosong Sun

In this work, we propose visual distant supervision, a novel paradigm of visual relation learning, which can train scene graph models without any human-labeled data.

Graph Generation Predicate Classification +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.