MS, University of Cincinnati, 2024, Medicine: Biostatistics (Environmental Health)
Abstract
Background: Breast cancer remains a leading cause of cancer-related morbidity and mortality in women worldwide. Traditional methods for predicting metastasis in breast cancer rely primarily on tumor pathology characteristics, such as tumor size, TNM grade, and receptor status. However, these methods do not fully account for patient-centered health factors, which could also play a role in metastasis risk. Factors such as a patient's overall physiological health, history of anti-neoplastic treatments, and personal and family history of cancer may also significantly impact the likelihood of developing distant metastasis in breast cancer. This study aims to develop a predictive model for distant metastasis in breast cancer that incorporates these broader, patient-centered factors for a more comprehensive risk assessment.
Methods: This study analyzed all 4296 female breast cancer cases from the 2021 NIS, assessing 130 variables. Among these cases, 1691 (39.36%) had distant metastasis, while 2605 (60.64%) did not. For metastasis prediction, 21 key variables were selected, including age, race, anti-neoplastic treatment, presence of other cancers, cancer history, smoking, depression/anxiety, elective admission, All Patient Refined DRG Severity of Illness Subclass (APRDRG Severity), and various comorbidities. A binary logistic regression model was developed to build the predictive model for distant metastasis in breast cancer, and refined through backward elimination with cross-validation used for validation. Additionally, eight additional variables, such as morbidity, length of stay, and total charges were analyzed for comparison but were not included in the predictive model. Statistical comparisons between metastatic and non-metastatic groups were conducted, with continuous variables assessed using the Mann-Whitney U test and categorical variables using the Chi-Square test or Fisher's Exact test. The significance level (a) was set at 0.05. All analy (open full item for complete abstract)
Committee: Roman Jandarov Ph.D. (Committee Member); Marepalli Rao Ph.D. (Committee Chair)
Subjects: Biostatistics