Karolinska Institutet
Browse

Study design requirements for RNA sequencing-based breast cancer diagnostics.

Download (606.42 kB)
journal contribution
posted on 2024-10-30, 13:40 authored by Arvind Singh Mer, Daniel Klevebring, Henrik GrönbergHenrik Grönberg, Mattias RantalainenMattias Rantalainen
Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.

History

File version

  • Published

Publication status

Published online

Sub type

Article

Journal

Sci Rep

ISSN

2045-2322

eISSN

2045-2322

Volume

6

Issue

1

Article number

20200

Language

  • eng

Original self archiving date

2017-01-16

Usage metrics

    Articles

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC