Publication:
Uniformly shaped harmonization combines human transcriptomic data from different platforms while retaining their biological properties and differential gene expression patterns

dc.contributor.authorYILMAZ, BETÜL
dc.contributor.authorsBorisov N., Tkachev V., Simonov A., Sorokin M., Kim E., Kuzmin D., Karademir-Yilmaz B., Buzdin A.
dc.date.accessioned2023-10-03T10:29:14Z
dc.date.accessioned2026-01-11T17:23:34Z
dc.date.available2023-10-03T10:29:14Z
dc.date.issued2023-01-01
dc.description.abstractIntroduction: Co-normalization of RNA profiles obtained using different experimental platforms and protocols opens avenue for comprehensive comparison of relevant features like differentially expressed genes associated with disease. Currently, most of bioinformatic tools enable normalization in a flexible format that depends on the individual datasets under analysis. Thus, the output data of such normalizations will be poorly compatible with each other. Recently we proposed a new approach to gene expression data normalization termed Shambhala which returns harmonized data in a uniform shape, where every expression profile is transformed into a pre-defined universal format. We previously showed that following shambhalization of human RNA profiles, overall tissue-specific clustering features are strongly retained while platform-specific clustering is dramatically reduced. Methods: Here, we tested Shambhala performance in retention of fold-change gene expression features and other functional characteristics of gene clusters such as pathway activation levels and predicted cancer drug activity scores. Results: Using 6,793 cancer and 11,135 normal tissue gene expression profiles from the literature and experimental datasets, we applied twelve performance criteria for different versions of Shambhala and other methods of transcriptomic harmonization with flexible output data format. Such criteria dealt with the biological type classifiers, hierarchical clustering, correlation/regression properties, stability of drug efficiency scores, and data quality for using machine learning classifiers. Discussion: Shambhala-2 harmonizer demonstrated the best results with the close to 1 correlation and linear regression coefficients for the comparison of training vs validation datasets and more than two times lesser instability for calculation of drug efficiency scores compared to other methods.
dc.identifier.citationBorisov N., Tkachev V., Simonov A., Sorokin M., Kim E., Kuzmin D., Karademir-Yilmaz B., Buzdin A., "Uniformly shaped harmonization combines human transcriptomic data from different platforms while retaining their biological properties and differential gene expression patterns", Frontiers in Molecular Biosciences, cilt.10, 2023
dc.identifier.doi10.3389/fmolb.2023.1237129
dc.identifier.issn2296-889X
dc.identifier.urihttps://avesis.marmara.edu.tr/api/publication/12608574-792a-420e-9c29-46c9baad8f72/file
dc.identifier.urihttps://hdl.handle.net/11424/294226
dc.identifier.volume10
dc.language.isoeng
dc.relation.ispartofFrontiers in Molecular Biosciences
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectYaşam Bilimleri
dc.subjectMoleküler Biyoloji ve Genetik
dc.subjectSitogenetik
dc.subjectTemel Bilimler
dc.subjectLife Sciences
dc.subjectMolecular Biology and Genetics
dc.subjectCytogenetic
dc.subjectNatural Sciences
dc.subjectYaşam Bilimleri (LIFE)
dc.subjectBİYOKİMYA VE MOLEKÜLER BİYOLOJİ
dc.subjectLife Sciences (LIFE)
dc.subjectMOLECULAR BIOLOGY & GENETICS
dc.subjectBIOCHEMISTRY & MOLECULAR BIOLOGY
dc.subjectBiyokimya
dc.subjectMoleküler Biyoloji
dc.subjectBiyokimya, Genetik ve Moleküler Biyoloji (çeşitli)
dc.subjectBiochemistry
dc.subjectMolecular Biology
dc.subjectBiochemistry, Genetics and Molecular Biology (miscellaneous)
dc.subjectcancer transcriptomics
dc.subjectcorrelation analysis
dc.subjectdata normalization and harmonization
dc.subjectgene expression
dc.subjectmicroarray hybridization
dc.subjectplatform bias
dc.subjectRNA sequencing
dc.subjecttranscriptional profiles
dc.subjectgene expression
dc.subjecttranscriptional profiles
dc.subjectRNA sequencing
dc.subjectmicroarray hybridization
dc.subjectdata normalization and harmonization
dc.subjectplatform bias
dc.subjectcancer transcriptomics
dc.subjectcorrelation analysis
dc.titleUniformly shaped harmonization combines human transcriptomic data from different platforms while retaining their biological properties and differential gene expression patterns
dc.typearticle
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
file.pdf
Size:
3.9 MB
Format:
Adobe Portable Document Format