A Study on Predicting Lung Cancer Using RNA-Sequencing Data with Ensemble Learning

In this paper, we explore the application of RNA-sequencing data and ensemble machine learning to predict lung cancer and treatment strategies for lung cancer, a leading cause of cancer mortality worldwide. The research utilizes Random Forest, XGBoost, and LightGBM models to analyze gene expression profiles from extensive datasets, aiming to enhance predictive accuracy for lung cancer prognosis. The methodology focuses on preprocessing RNA-seq data to standardize expression levels across samples and applying ensemble algorithms to maximize prediction stability and reduce model overfitting. Key findings indicate that ensemble models, especially XGBoost, substantially outperform traditional predictive models. Significant genetic markers such as ADGRF5 is identified as crucial for predicting lung cancer outcomes. In conclusion, ensemble learning using RNA-seq data proves highly effective in predicting lung cancer, suggesting a potential shift towards more precise and personalized treatment approaches. The results advocate for further integration of molecular and clinical data to refine diagnostic models and improve clinical outcomes, underscoring the critical role of advanced molecular diagnostics in enhancing patient survival rates and quality of life. This study lays the groundwork for future research in the application of RNA-sequencing data and ensemble machine learning techniques in clinical settings.

keywords: Lung Cancer, RNA-sequencing, Gene Expression, Ensemble Learning, Machine Learning

투고일Submission Date: 2024-05-20

수정일Revised Date: 2024-06-14

게재확정일Accepted Date: 2024-06-30

다운로드 수
조회수

0KCI 피인용수
0WOS 피인용수

PDF 다운로드

한국인공지능학회지

논문 상세

앙상블 기법을 활용한 RNA-Sequencing 데이터의 폐암 예측 연구

A Study on Predicting Lung Cancer Using RNA-Sequencing Data with Ensemble Learning

Abstract

권호 내 다른 논문

추천 논문

권호 목록

소개

규정 및 지침

게시판

한국인공지능학회지