About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
HICSS 1986
Conference paper
EFFECTIVE CONCURRENT RECOVERY MECHANISMS FOR SOFT ERRORS IN MINs.
Abstract
In parallel computer systems an interconnection network is used to either share memory between processors and/or exchange information between the processors. This means that a lot of the system's data and control information is communicated across this network. Therefore, to avoid severe performance degradation it is important for the network to be resilient to soft errors (transient and intermittent errors). In this paper we propose mechanisms for recovery from soft errors in multistage interconnection networks (MINs). In order to reduce the work done by these mechanisms, localized concurrent error detection and recovery is proposed.