Publication:
Comparison of estimation methods for missing value imputation of gene expression data [Gen ifade verilerinde eksik deǧerleri düzelten kestirim yöntemlerinin karşilaştirilmasi]

dc.contributor.authorsSarikas A., Odabasioglu N., Altay G.
dc.date.accessioned2022-03-15T02:12:45Z
dc.date.accessioned2026-01-11T17:56:56Z
dc.date.available2022-03-15T02:12:45Z
dc.date.issued2017
dc.description.abstractControl and correction process of missing values (imputation of MVs) is the first stage of the preprocessing of microarray datasets. This paper focuses on a comparison of most reliable and up to date estimation methods to control and correct the missing values. Imputation of MVs has a very high priority because of its impact on next pre-processing and post-processing stages of microarray data analysis namely, quality control, normalization, differential gene expression, classification, clustering, and pathway analysis, etc. Normalized root mean square error (NRMSE) value is used to evaluate the performances of most popular five methods (k-nearest neighbors, Bayesian principal component analysis, local least squares, mean and median). When NRMSE values of methods were compared, it has observed that local least squares (LLS) and Bayesian principal component analysis (BPCA) methods outperformed all other methods in all percentages of MVs (1%, 5%, 10%, and 20%). BPCA method has given the best results in all percentages of MVs over the number of probes or genes, whereas LLS method has given the best results in all percentages of MVs over the number of samples. The advantage of these two methods over others is that they are least affected by the complexity of the data set. © 2016 IEEE.
dc.identifier.doi10.1109/TIPTEKNO.2016.7863090
dc.identifier.isbn9781509023868
dc.identifier.urihttps://hdl.handle.net/11424/247820
dc.language.isotur
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof2016 Medical Technologies National Conference, TIPTEKNO 2016
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectBayesian principal component analysis
dc.subjectgene expression data
dc.subjectk-nearest neighbor
dc.subjectlocal least squares
dc.subjectmean
dc.subjectmedian
dc.subjectmicroarray
dc.subjectmissing data imputation
dc.subjectmissing value estimation
dc.titleComparison of estimation methods for missing value imputation of gene expression data [Gen ifade verilerinde eksik deǧerleri düzelten kestirim yöntemlerinin karşilaştirilmasi]
dc.typeconferenceObject
dspace.entity.typePublication
oaire.citation.title2016 Medical Technologies National Conference, TIPTEKNO 2016

Files