Reinforcement Learning for Autonomous Lunar Landing: A Comparative Analysis of Algorithm Performance

Manimegalai, R (2025) Reinforcement Learning for Autonomous Lunar Landing: A Comparative Analysis of Algorithm Performance. 2025 Second International Conference on Intelligent Technologies for Sustainable Electric and Communications Systems (iTech SECOM). pp. 1-7.

Full text not available from this repository.

Abstract

This paper addresses the challenge of developing autonomous control systems for various vehicles and devicesspecifically spacecraft landing procedures. Precise planetary landing is a classical control problem in aerospace engineering, which requires that the spacecraft decelerates/repels, orients and lands safely under uncertain dynamics with limited fuel. Conventional control techniques like PID and model predictive control need good models and substantial tuning. In this paper we explore the ability of recent reinforcement learning (RL) algorithms to learn such control policies directly from interaction with a physics simulator. We compare our approach against four popular algorithms - Deep Q-Network (DQN), Double DQN, Dueling DQN and Proximal Policy Optimization (PPO) on OpenAI Gymnasium and LunarLander-v2 benchmark. We present a unified PyTorch implementation, conduct multi-seed experiments and compare in terms of convergence speed, stability of rewards and computation efforts. We observe that PPO achieves competitive rewards in a quarter of training time compared to value-based methods, while the best peak reward is obtained by Dueling DQN. We discuss for real world autonomous landing and other high risk robotic tasks.

Item Type: Article
Subjects: Computer Science and Engineering > Drones and Automated Vehicles
Divisions: Computer Science and Engineering
Depositing User: Dr Krishnamurthy V
Date Deposited: 05 May 2026 10:21
Last Modified: 05 May 2026 10:22
URI: https://ir.psgitech.ac.in/id/eprint/1782

Actions (login required)

View Item
View Item