A Reinforcement Learning Method Based on an Improved Sampling Mechanism for Unmanned Aerial Vehicle Penetration

doi:10.3390/aerospace10070642

Yue Wang, Kexv Li, Xing Zhuang, Xinyu Liu, Hanyu Li

A Reinforcement Learning Method Based on an Improved Sampling Mechanism for Unmanned Aerial Vehicle Penetration

Aerospace Engineering

The penetration of unmanned aerial vehicles (UAVs) is an important aspect of UAV games. In recent years, UAV penetration has generally been solved using artificial intelligence methods such as reinforcement learning. However, the high sample demand of the reinforcement learning method poses a significant challenge specifically in the context of UAV games. To improve the sample utilization in UAV penetration, this paper innovatively proposes an improved sampling mechanism called task completion division (TCD) and combines this method with the soft actor critic (SAC) algorithm to form the TCD-SAC algorithm. To compare the performance of the TCD-SAC algorithm with other related baseline algorithms, this study builds a dynamic environment, a UAV game, and conducts training and testing experiments in this environment. The results show that among all the algorithms, the TCD-SAC algorithm has the highest sample utilization rate and the best actual penetration results, and the algorithm has a good adaptability and robustness in dynamic environments.

Need a simple solution for managing your BibTeX entries? Explore CiteDrive!

Web-based, modern reference management
Collaborate and share with fellow researchers
Integration with Overleaf
Comprehensive BibTeX/BibLaTeX support
Save articles and websites directly from your browser
Search for new articles from a database of tens of millions of references

Try out CiteDrive

A Reinforcement Learning Method Based on an Improved Sampling Mechanism for Unmanned Aerial Vehicle Penetration

Need a simple solution for managing your BibTeX entries? Explore CiteDrive!

More from our Archive

Improved Amplification Factor Transport Transition Model for Transonic Boundary Layers

An Efficient Framework for Autonomous UAV Missions in Partially-Unknown GNSS-Denied Environments

Flow Structure around a Multicopter Drone: A Computational Fluid Dynamics Analysis for Sensor Placement Considerations

Data-Enabled Recalibration of the Spalart–Allmaras Model

Modular quasi-zero-stiffness isolator based on compliant constant-force mechanisms for low-frequency vibration isolation

TGC-YOLOv5: An Enhanced YOLOv5 Drone Detection Model Based on Transformer, GAM & CA Attention Mechanism

Machine Learning Assisted Prediction of Airfoil Lift-to-Drag Characteristics for Mars Helicopter

Natural Language Processing (NLP) in Aviation Safety: Systematic Review of Research and Outlook into the Future

Digitalization and Spatial Documentation of Post-Earthquake Temporary Housing in Central Italy: An Integrated Geomatic Approach Involving UAV and a GIS-Based System

Airfoil Analysis and Optimization Using a Petrov–Galerkin Finite Element and Machine Learning

A Reinforcement Learning Method Based on an Improved Sampling Mechanism for Unmanned Aerial Vehicle Penetration

Need a simple solution for managing your BibTeX entries? Explore CiteDrive!

More from our Archive

Improved Amplification Factor Transport Transition Model for Transonic Boundary Layers

An Efficient Framework for Autonomous UAV Missions in Partially-Unknown GNSS-Denied Environments

Flow Structure around a Multicopter Drone: A Computational Fluid Dynamics Analysis for Sensor Placement Considerations

Data-Enabled Recalibration of the Spalart–Allmaras Model

Modular quasi-zero-stiffness isolator based on compliant constant-force mechanisms for low-frequency vibration isolation

TGC-YOLOv5: An Enhanced YOLOv5 Drone Detection Model Based on Transformer, GAM &amp; CA Attention Mechanism

Machine Learning Assisted Prediction of Airfoil Lift-to-Drag Characteristics for Mars Helicopter

Natural Language Processing (NLP) in Aviation Safety: Systematic Review of Research and Outlook into the Future

Digitalization and Spatial Documentation of Post-Earthquake Temporary Housing in Central Italy: An Integrated Geomatic Approach Involving UAV and a GIS-Based System

Airfoil Analysis and Optimization Using a Petrov–Galerkin Finite Element and Machine Learning

TGC-YOLOv5: An Enhanced YOLOv5 Drone Detection Model Based on Transformer, GAM & CA Attention Mechanism