A Comparative Study for Modelling the Survival of Breast Cancer Patients in the West of Iran


  •  Mozhgan Safe    
  •  Hossein Mahjub    
  •  Javad Faradmal    

Abstract

BACKGROUND: Breast cancer is the main cause of women cancer mortality. Therefore, precise prediction of patients’ risk level is the major concern in therapeutic strategies. Although statistical learning algorithms are high quality risk prediction methods, but usually their better prediction quality leads to more loss of interpretability. Therefore, the aim of this study is to compare ‘Model-Based Recursive Partitioning’ and ‘Random Survival Forest’; whether the partitioning, as the more interpretable learning technique, could be a suitable successor for forests.

PATIENTS & METHODS: The applied dataset for this retrospective cohort study includes the information of 539 Iranian females with breast cancer. To model the patients’ survival, various learning algorithms were fitted and their accuracy measures were statistically compared by means of several precision criteria.

RESULTS: This study verified the stability of ‘Model-based Recursive Partitioning’, further to ‘Random Survival Forest’ deficiency to present a unique pervasive model. Moreover, except ‘Log-Logistic-Based Recursive Partitioning’, none of the methods significantly outperformed ‘Exponential- Based Recursive Partitioning’.

CONCLUSIONS: Briefly, it was concluded that the loss of interpretability due to the use of over complex models, may not always counterbalanced by the amount of prediction improvements.



This work is licensed under a Creative Commons Attribution 4.0 License.