Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer

2021 Computational and Structural Biotechnology Journal 1,673 citations

Abstract

Extensive research is directed to uncover new biomarkers capable to stratify breast cancer patients into clinically relevant cohorts. However, the overall performance ranking of such marker candidates compared to other genes is virtually absent. Here, we present the ranking of all survival related genes in chemotherapy treated basal and estrogen positive/HER2 negative breast cancer. We searched the GEO repository to uncover transcriptomic datasets with available follow-up and clinical data. After quality control and normalization, samples entered an integrated database. Molecular subtypes were designated using gene expression data. Relapse-free survival analysis was performed using Cox proportional hazards regression. False discovery rate was computed to combat multiple hypothesis testing. Kaplan-Meier plots were drawn to visualize the best performing genes. The entire database includes 7,830 unique samples from 55 independent datasets. Of those with available relapse-free survival time, 3,382 samples were estrogen receptor-positive and 696 were basal. In chemotherapy treated ER positive/ERBB2 negative patients the significant prognostic biomarker genes achieved hazard rates between 1.76 and 3.33 with a p value below 5.8E-04. The significant prognostic genes in adjuvant chemotherapy treated basal breast cancer samples reached hazard rates between 1.88 and 3.61 with a p value below 7.2E-04. Our integrated platform was extended enabling the validation of future biomarker candidates. A reference ranking for all genes in two chemotherapy treated breast cancer cohorts is presented. The results help to neglect those with unlikely clinical significance and to focus future research on the most promising candidates.

Keywords

Breast cancerOncologyProportional hazards modelBiomarkerHazard ratioInternal medicineMedicineTranscriptomeCancerSurvival analysisBioinformaticsGeneBiologyGene expressionGenetics

Affiliated Institutions

Related Publications

Breast Cancer Treatment

Breast cancer consists of 3 major tumor subtypes categorized according to estrogen or progesterone receptor expression and ERBB2 gene amplification. The 3 subtypes have distinct...

2019 JAMA 4488 citations

Publication Info

Year
2021
Type
article
Volume
19
Pages
4101-4109
Citations
1673
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1673
OpenAlex
37
Influential
762
CrossRef

Cite This

Balázs Győrffy (2021). Survival analysis across the entire transcriptome identifies biomarkers with the highest prognostic power in breast cancer. Computational and Structural Biotechnology Journal , 19 , 4101-4109. https://doi.org/10.1016/j.csbj.2021.07.014

Identifiers

DOI
10.1016/j.csbj.2021.07.014
PMID
34527184
PMCID
PMC8339292

Data Quality

Data completeness: 86%