Clinical potential of whole-genome data linked to mortality statistics in patients with breast cancer in the UK: a retrospective analysis


Publication: The Lancet Oncology

07 October 2025

Daniella Black, Helen Ruth Davies, Gene Ching Chiek Koh, Lucia Chmelova, Marko Cubric, Georgia Chalivelaki Chan, Andrea Degasperi, Jan Czarnecki, Ping Jing Toong, Yasin Memari, James Whitworth, Salome Jingchen Zhao, Yogesh Kumar, Shadi Basyuni, Giuseppe Rinaldi, Scott Shooter, Vladyslav Dembrovskyi, Rosie Davies, Maria Chatzou Dunford, Ellen Copson, Carlo Palmieri, Åke Borg,  John Ambrose,  Catey Bunce,  Alona Sosinsky,  Prabhu Arumugam,  Matthew Arthur Brown, Johan Staaf, Nicholas Turner,  Serena Nik-Zainal

 

Background


Breast cancer is the most frequently diagnosed cancer in women. Survival is generally considered favourable, yet some patients remain at risk of early death. We aimed to assess whether comprehensive whole-genome sequencing (WGS) linked to mortality data could add prognostic value to existing clinical measures and identify patients who might respond to targeted therapeutics.



Methods


In this integrative, retrospective analysis,  2445 breast cancer tumours were analysed  (any stage and molecular subtype) collected from 2403 patients recruited through 13 National Health Service Genomic Medicine Centres or hospitals in England affiliated to the 100 000 Genomes Project (100kGP) between 2012 and 2018. 2208 (90%) cases were linked with clinical data; mortality data were obtained for 1188 patients. Following high-depth WGS of tumour and matched normal DNA, comprehensive WGS profiling was performed, seeking driver mutations, mutational signatures, and compound algorithmic scores for homologous recombination repair deficiency (HRD), mismatch repair deficiency, and tumour mutational burden. Data from 1803 additional patients with breast cancer from three independent cohorts were used to validate various findings. To evaluate the prognostic value of WGS features,  univariable and multivariable Cox regression on data from patients was performed with stage I–III, ER-positive, HER2-negative breast cancer with a cancer-specific mortality endpoint (around 5-year follow-up).



Findings


Among 2445 tumours in the 100kGP breast cancer cohort, genomic characteristics with immediate personalised medicine potential in 656 (26·8%) was observed, including features reporting HRD (298 [12·2%] total cases and 76 [6·3%] ER-positive, HER2-negative cases), highly individualised driver events, mutations underpinning resistance to endocrine therapy, and mutational signatures indicating therapeutic vulnerabilities. 373 (15·2%) cases had WGS features with potential for translational research, including compromised base excision repair and non-homologous end-joining dependency. Structural variation burden (hazard ratio 3·9 [95 CI% 2·4–6·2]; p<0·0001), high levels of APOBEC signatures (2·5 [1·6–4·1]; p<0·0001), and TP53 drivers (3·9 [2·4–6·2]; p<0·0001) were independently prognostic of customary clinical measures (age at diagnosis, stage, and grade) in patients with ER-positive, HER2-negative breast cancer. A prognosticator was developed for ER-positive, HER2-negative breast cancer capable of identifying patients who require either increased intervention or therapy de-escalation, validating the framework in the independent Swedish Cancerome Analysis Network-Breast (SCAN-B) dataset.



Interpretation


Breast cancer genomes are rich in predictive and prognostic value. A two-step model is proposed for effective clinical application. First, the identification of candidates for targeted therapies or clinical trials using highly individualised genomic markers. Second, for patients without such features, the implementation of enhanced prognostication using genomic features alongside existing clinical decision-making factors.
© Copyright - NIHR Cambridge Biomedical Research Centre 2025