Publication:
Predicting the Soft Error Vulnerability of Parallel Applications Using Machine Learning

dc.contributor.authorARSLAN YILMAZ, SANEM
dc.contributor.authorsOz, Isil; Arslan, Sanem
dc.date.accessioned2022-03-12T22:55:49Z
dc.date.accessioned2026-01-10T21:51:23Z
dc.date.available2022-03-12T22:55:49Z
dc.date.issued2021
dc.description.abstractWith the widespread use of the multicore systems having smaller transistor sizes, soft errors become an important issue for parallel program execution. Fault injection is a prevalent method to quantify the soft error rates of the applications. However, it is very time consuming to perform detailed fault injection experiments. Therefore, prediction-based techniques have been proposed to evaluate the soft error vulnerability in a faster way. In this work, we present a soft error vulnerability prediction approach for parallel applications using machine learning algorithms. We define a set of features including thread communication, data sharing, parallel programming, and performance characteristics; and train our models based on three ML algorithms. This study uses the parallel programming features, as well as the combination of all features for the first time in vulnerability prediction of parallel programs. We propose two models for the soft error vulnerability prediction: (1) A regression model with rigorous feature selection analysis that estimates correct execution rates, (2) A novel classification model that predicts the vulnerability level of the target programs. We get maximum prediction accuracy rate of 73.2% for the regression-based model, and achieve 89% F-score for our classification model.
dc.identifier.doi10.1007/s10766-021-00707-0
dc.identifier.eissn1573-7640
dc.identifier.issn0885-7458
dc.identifier.urihttps://hdl.handle.net/11424/236836
dc.identifier.wosWOS:000633744600001
dc.language.isoeng
dc.publisherSPRINGER/PLENUM PUBLISHERS
dc.relation.ispartofINTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectSoft error analysis
dc.subjectFault injection
dc.subjectParallel programming
dc.subjectMachine Learning
dc.subjectSPLASH-2
dc.titlePredicting the Soft Error Vulnerability of Parallel Applications Using Machine Learning
dc.typearticle
dspace.entity.typePublication
oaire.citation.endPage439
oaire.citation.issue3
oaire.citation.startPage410
oaire.citation.titleINTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING
oaire.citation.volume49

Files