Multiple Omics Data Integration to Identify Long Noncoding RNA Responsible for Breast Cancer-Related Mortality. Academic Article uri icon


  • Long non-coding RNAs (lncRNAs) are a large and diverse class of transcribed RNAs, which have been shown to play a significant role in developing cancer. In this study, we apply integrative modeling framework to integrate the DNA copy number variation (CNV), lncRNA expression, and downstream target protein expression to predict patient survival in breast cancer. We develop a 3-stage model combining a mechanical model (lncRNA regressed on CNV and target proteins regressed on lncRNA) and a clinical model (survival regressed on estimated effects from the mechanical models). Using lncRNAs (such as HOTAIR and MALAT1) along with their CNV, target protein expressions, and survival outcomes from The Cancer Genome Atlas (TCGA) database, we show that predicted mean square error and integrated Brier score (IBS) are both lower for the proposed 3-step integrated model than that of 2-step model. Therefore, the integrative model has better predictive ability than the 2-step model not considering target protein information.

published proceedings

  • Cancer Inform

altmetric score

  • 0.5

author list (cited authors)

  • Roy Sarkar, T., Maity, A. K., Niu, Y., & Mallick, B. K.

citation count

  • 3

complete list of authors

  • Roy Sarkar, Tapasree||Maity, Arnab Kumar||Niu, Yabo||Mallick, Bani K

publication date

  • January 2019