An Improved Hierarchical Deep Reinforcement Learning For Complex Imperfect-Information Card Games

Luo, Qian (2025) An Improved Hierarchical Deep Reinforcement Learning For Complex Imperfect-Information Card Games. PhD thesis, Perpustakaan Hamzah Sendut.

PDF
Download (420kB)

Abstract

Deep reinforcement learning (drl) has achieved significant breakthroughs in a variety of games, both with perfect and imperfect information, such as go, texas hold’em, and starcraft ii. However, doudizhu and big2 are classic complex card games with imperfect information and are popular in asia. They present new challenges for ai in competition, cooperation, inferring imperfect information, handling large state-action spaces, and training with sparse rewards. The deep monte carlo (dmc) method for these card games achieves significant success but still faces three key research problems: slowlearning speed, high loss during learning, and performance optimization. The primary objective of this research is to enhance the performance for these complex imperfect-information card games with a hierarchical deep reinforcement learning (hdrl) framework. Specifically, this main goal is divided into three sub-research objectives: improving learning efficiency indmctraining through oracle guiding, enhancing learning stability with adaptive deep monte carlo (admc), and improving the performance of proximal policy optimization (ppo) using relative advantage reward shaping (rars).

Item Type:	Thesis (PhD)
Subjects:	Q Science > QA Mathematics > QA75.5-76.95 Electronic computers. Computer science
Divisions:	Pusat Pengajian Sains Komputer (School of Computer Sciences) > Thesis
Depositing User:	Mr Hasmizar Mansor
Date Deposited:	04 May 2026 02:58
Last Modified:	04 May 2026 02:58
URI:	http://eprints.usm.my/id/eprint/64044

Actions (login required)

View Item