Search Results for author: Renato Mancuso

Found 7 papers, 4 papers with code

The SwaNNFlight System: On-the-Fly Sim-to-Real Adaptation via Anchored Learning

1 code implementation17 Jan 2023 Bassel El Mabsout, Shahin Roozkhosh, Siddharth Mysore, Kate Saenko, Renato Mancuso

And (iii) anchor critics to help stabilize the fine-tuning of agents during sim-to-real transfer, online learning from real data while retaining behavior optimized in simulation.

Reinforcement Learning (RL)

Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL

no code implementations23 Feb 2021 Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures.

Reinforcement Learning (RL)

Regularizing Action Policies for Smooth Control with Reinforcement Learning

no code implementations11 Dec 2020 Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

A critical problem with the practical utility of controllers trained with deep Reinforcement Learning (RL) is the notable lack of smoothness in the actions learned by the RL policies.

reinforcement-learning Reinforcement Learning (RL)

Observing the Invisible: Live Cache Inspection for High-Performance Embedded Systems

1 code implementation23 Jul 2020 Dharmesh Tarapore, Shahin Roozkhosh, Steven Brzozowski, Renato Mancuso

The vast majority of high-performance embedded systems implement multi-level CPU cache hierarchies.

Other Computer Science

Reinforcement Learning for UA V Attitude Control

no code implementations ACM Transactions on Cyber-Physical Systems 2019 William Koch, Renato Mancuso, Richard West, Azer Bestavros

Abstract—Autopilot systems are typically composed of an “inner loop” providing stability and control, while an “outer loop” is responsible for mission-level objectives, e. g. way-point navigation.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning for UAV Attitude Control

2 code implementations11 Apr 2018 William Koch, Renato Mancuso, Richard West, Azer Bestavros

Autopilot systems are typically composed of an "inner loop" providing stability and control, while an "outer loop" is responsible for mission-level objectives, e. g. way-point navigation.

Robotics

Deterministic Memory Abstraction and Supporting Multicore System Architecture

2 code implementations17 Jul 2017 Farzad Farshchi, Prathap Kumar Valsan, Renato Mancuso, Heechul Yun

In this paper, we make a case that a fundamental problem that prevents efficient and predictable real-time computing on multicore is the lack of a proper memory abstraction to express memory criticality, which cuts across various layers of the system: the application, OS, and hardware.

Hardware Architecture Operating Systems Performance

Cannot find the paper you are looking for? You can Submit a new open access paper.