A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model

Ouwabunmi, Babalola AbdulHafeez (2025) A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model. Masters thesis, Universiti Sains Malaysia.

[img] PDF - Submitted Version
Download (658kB)

Abstract

The accurate prediction and characterization of small open reading frames (smORFs) are critical for understanding their functional roles in gene regulation and cellular processes. This study presents the development and evaluation of a novel hybrid machine learning algorithm that integrates the strengths of Random Forest and Gradient Boosting models to enhance the prediction of smORFs. The performance of the hybrid algorithm was rigorously assessed and compared to the standalone models using comprehensive evaluation metrics, including accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC). Results demonstrated that the hybrid model achieved superior performance, with an accuracy of 0.998, a sensitivity of 0.998, and a specificity of 1.00, significantly outperforming both the Random Forest and Gradient Boosting models individually. Additionally, transcriptomic data from Mycobacterium tuberculosis were utilized to validate the predictions, highlighting the biological relevance and potential applications of the proposed approach in computational biology. This study underscores the importance of combining machine learning techniques to improve prediction accuracy and provides a robust framework for advancing smORF discovery. While the focus was on comparing standalone and hybrid models, the study identifies opportunities for future benchmarking against external tools to further validate its contributions. The findings contribute to both computational and biological research, offering insights into smORF prediction methodologies and their applications.

Item Type: Thesis (Masters)
Uncontrolled Keywords: hybrid algorithm
Subjects: R Medicine
R Medicine > RA Public aspects of medicine
Divisions: Kampus Kesihatan (Health Campus) > Pusat Pengajian Sains Kesihatan (School of Health Sciences) > Thesis
Depositing User: Mr Abdul Hadi Mohammad
Date Deposited: 05 Nov 2025 01:41
Last Modified: 19 Jan 2026 04:13
URI: http://eprints.usm.my/id/eprint/62965

Actions (login required)

View Item View Item
Share