Luo, Qian (2025) An Improved Hierarchical Deep Reinforcement Learning For Complex Imperfect-Information Card Games. PhD thesis, Perpustakaan Hamzah Sendut.
|
PDF
Download (420kB) |
Abstract
Deep reinforcement learning (drl) has achieved significant breakthroughs in a variety of games, both with perfect and imperfect information, such as go, texas hold’em, and starcraft ii. However, doudizhu and big2 are classic complex card games with imperfect information and are popular in asia. They present new challenges for ai in competition, cooperation, inferring imperfect information, handling large state-action spaces, and training with sparse rewards. The deep monte carlo (dmc) method for these card games achieves significant success but still faces three key research problems: slowlearning speed, high loss during learning, and performance optimization. The primary objective of this research is to enhance the performance for these complex imperfect-information card games with a hierarchical deep reinforcement learning (hdrl) framework. Specifically, this main goal is divided into three sub-research objectives: improving learning efficiency indmctraining through oracle guiding, enhancing learning stability with adaptive deep monte carlo (admc), and improving the performance of proximal policy optimization (ppo) using relative advantage reward shaping (rars).
| Item Type: | Thesis (PhD) |
|---|---|
| Subjects: | Q Science > QA Mathematics > QA75.5-76.95 Electronic computers. Computer science |
| Divisions: | Pusat Pengajian Sains Komputer (School of Computer Sciences) > Thesis |
| Depositing User: | Mr Hasmizar Mansor |
| Date Deposited: | 04 May 2026 02:58 |
| Last Modified: | 04 May 2026 02:58 |
| URI: | http://eprints.usm.my/id/eprint/64044 |
Actions (login required)
![]() |
View Item |



