英文摘要 |
Cardiovascular disease is the leading cause of death worldwide, so early detection is important. Therefore, this study mainly explores the importance of risk factors and their explanatory power for cardiovascular disease when using machine learning methods to establish a cardiovascular disease prediction model. It is hoped that the performance of prediction models for cardiovascular disease can be improved in the future through this study. We use the heart disease dataset obtained from Kaggle to explore risk factors adopted by the cardiovascular disease prediction model. According to the result obtained, the risk factors used are, ranked in order of their impact, chest pain type-asymptomatic (compared to the type of non-anginal pain), fasting blood sugar-high, exercise-induced angina-y, exercise-induced ST-segment depression, sex-female, and ST-segment slope-up (compared to the type of flat). Among them, the influence of the first four factors is positive, indicating that the possibility of suffering from cardiovascular disease increases; the impact of the last two factors is negative, and the possibility of suffering from cardiovascular disease decreases. The prediction model's explanatory power (R2) for new samples is 0.464. It is suggested that investigating other potentially important risk factors to help improve the predictive power of models is needed. |