Publication:
Quantifying the impact of data replication on error propagation

dc.contributor.authorÖZTÜRK, ZUHAL
dc.contributor.authorTOPCUOĞLU, HALUK RAHMİ
dc.contributor.authorsÖZTÜRK Z., TOPCUOĞLU H. R. , Kandemir M. T.
dc.date.accessioned2022-09-26T09:01:50Z
dc.date.available2022-09-26T09:01:50Z
dc.date.issued2022-09-01
dc.description.abstractVarious technological developments in the microprocessor world make modern computing systems more vulnerable to soft errors than in the past, and consequently fault tolerance techniques are becoming increasingly important in various application domains. While in general fault tolerance methods are known to achieve high levels of reliability, they can also introduce significant performance, energy, and memory overheads, which can be reduced by employing such techniques selectively, as opposed to indiscriminately. Data Replication is used to prevent error propagation across hardware components and application program data structures by replicating application program\"s data. When using data replication, many factors need to be taken into account, including which data structures/elements to replicate, how many times to replicate a given data element, and which threads to protect (in a multithreaded application). These and similar factors define what can be termed as \"replication space\". This study defines a replication space, and systematically explores protection techniques of various strengths/degrees, quantifying their impacts on memory consumption, performance, and error propagation. Our experimental analysis reveals that different degrees of protection levels bring different outcomes based on the application specifics. In particular, while error propagation is limited, to a certain extent, when employing data replication in multithreaded applications where the thread do not communicate/share data much, the speed of error propagation across threads can be quite fast in applications where threads are more tightly coupled. Additionally, our results indicate that in certain cases where error propagation is low, the effect of data replication on error propagation can be negligible.
dc.identifier.citationÖZTÜRK Z., TOPCUOĞLU H. R. , Kandemir M. T. , "Quantifying the impact of data replication on error propagation", CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022
dc.identifier.doi10.1007/s10586-022-03726-9
dc.identifier.issn1386-7857
dc.identifier.urihttps://hdl.handle.net/11424/281767
dc.language.isoeng
dc.relation.ispartofCLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectSoft errors
dc.subjectError propagation
dc.subjectData replication
dc.subjectMemory and performance overheads
dc.subjectMultithreading
dc.titleQuantifying the impact of data replication on error propagation
dc.typearticle
dspace.entity.typePublication
local.avesis.id7adf9220-7dfb-40c7-a5d9-d7c3b7750725
relation.isAuthorOfPublication29f18775-a2c9-4297-b730-4ad46a280e6e
relation.isAuthorOfPublication54c6a927-2146-44b3-90ee-33dac6503317
relation.isAuthorOfPublication.latestForDiscovery29f18775-a2c9-4297-b730-4ad46a280e6e

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
7.pdf
Size:
1.61 MB
Format:
Adobe Portable Document Format

Collections