Abstract
Predictive, stable and interpretable gene signatures are generally seen as an important step towards a better personalized medicine. During the last decade various methods have been proposed for that purpose. However, one important obstacle for making gene signatures a standard tool in clinics is the typical low reproducibility of signatures combined with the difficulty to achieve a clear biological interpretation. For that purpose in the last years there has been a growing interest in approaches that try to integrate information from molecular interaction networks. We here propose a technique that integrates network information as well as different kinds of experimental data (here exemplified by mRNA and miRNA expression) into one classifier. This is done by smoothing t-statistics of individual genes or miRNAs over the structure of a combined protein-protein interaction (PPI) and miRNA-target gene network. A permutation test is conducted to select features in a highly consistent manner, and subsequently a Support Vector Machine (SVM) classifier is trained. Compared to several other competing methods our algorithm reveals an overall better prediction performance for early versus late disease relapse and a higher signature stability. Moreover, obtained gene lists can be clearly associated to biological knowledge, such as known disease genes and KEGG pathways. We demonstrate that our data integration strategy can improve classification performance compared to using a single data source only. Our method, called stSVM, is available in R-package netClass on CRAN (http://cran.r-project.org).
Keywords
Affiliated Institutions
Related Publications
Incorporating Biological Information as a Prior in an Empirical Bayes Approach to Analyzing Microarray Data
Currently the practice of using existing biological knowledge in analyzing high throughput genomic and proteomic data is mainly for the purpose of validations. Here we take a di...
GeneSigDB—a curated database of gene expression signatures
The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated w...
Non‐coding RNAs in cancer initiation and progression and as novel biomarkers
Cancer represents a complex group of heterogeneous diseases. While many cancers share fundamental biological processes (hallmarks of cancer) necessary for their development and ...
Systematic analysis of microRNA expression of RNA extracted from fresh frozen and formalin-fixed paraffin-embedded samples
microRNAs (miRNAs) are noncoding small RNAs that regulate gene expression at the translational level by mainly interacting with 3′ UTRs of their target mRNAs. Archived formalin-...
A library of gene expression signatures to illuminate normal and pathological lymphoid biology
Summary: Genomics has provided a lever to pry open lymphoid cells and examine their regulatory biology. The large body of available gene expression data has also allowed us to d...
Publication Info
- Year
- 2013
- Type
- article
- Volume
- 8
- Issue
- 9
- Pages
- e73074-e73074
- Citations
- 73
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1371/journal.pone.0073074