Publication:
Thread vulnerability in parallel applications

dc.contributor.authorTOPCUOĞLU, HALUK RAHMİ
dc.contributor.authorsOz, Isil; Topcuoglu, Haluk Rahmi; Kandemir, Mahmut; Tosun, Oguz
dc.date.accessioned2022-03-12T18:07:11Z
dc.date.available2022-03-12T18:07:11Z
dc.date.issued2012
dc.description.abstractContinuously reducing transistor sizes and aggressive low power operating modes employed by modern architectures tend to increase transient error rates. Concurrently, multicore machines are dominating the architectural spectrum today in various application domains. These two trends require a fresh look at resiliency of multithreaded applications against transient errors from a software perspective. In this paper, we propose and evaluate a new metric called the Thread Vulnerability Factor (TVF). A distinguishing characteristic of TVF is that its calculation for a given thread (which is typically one of the threads of a multithreaded application) does not depend on its code alone, but also on the codes of the threads that share resources and data with that thread. As a result, we decompose TVF of a thread into two complementary parts: local and remote. While the former captures the TVF induced by the code of the target thread, the latter represents the vulnerability impact of the threads that interact with the target thread. We quantify the local and remote TVF values for three architectural components (register file, ALUs, and caches) using a set of ten multithreaded applications from the Parsec and Splash-2 benchmark suites. Our experimental evaluation shows that TVF values tend to increase as the number of cores increases, which means the system becomes more vulnerable as the core count rises. We further discuss how TVF metric can be employed to explore performance-reliability tradeoffs in multicores. Reliability-based analysis of compiler optimizations and redundancy-based fault tolerance are also mentioned as potential usages of our TVF metric. (C) 2012 Elsevier Inc. All rights reserved.
dc.identifier.doi10.1016/j.jpdc.2012.05.002
dc.identifier.eissn1096-0848
dc.identifier.issn0743-7315
dc.identifier.urihttps://hdl.handle.net/11424/230993
dc.identifier.wosWOS:000308276400002
dc.language.isoeng
dc.publisherACADEMIC PRESS INC ELSEVIER SCIENCE
dc.relation.ispartofJOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectReliability
dc.subjectThread Vulnerability
dc.subjectFault tolerance
dc.subjectMulticores
dc.subjectTVF
dc.titleThread vulnerability in parallel applications
dc.typearticle
dspace.entity.typePublication
local.avesis.idd5b21ae8-4352-4040-9c89-3115eface95c
local.import.packageSS17
local.indexed.atWOS
local.indexed.atSCOPUS
local.journal.numberofpages15
oaire.citation.endPage1185
oaire.citation.issue10
oaire.citation.startPage1171
oaire.citation.titleJOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
oaire.citation.volume72
relation.isAuthorOfPublication54c6a927-2146-44b3-90ee-33dac6503317
relation.isAuthorOfPublication.latestForDiscovery54c6a927-2146-44b3-90ee-33dac6503317

Files

Collections