Reliable distributed sorting through the application-oriented fault tolerance paradigm

Bruce M. McMillin*, Lionel M. Ni

*Corresponding author for this work

Research output: Contribution to conferenceConference Paperpeer-review

1 Citation (Scopus)

Abstract

The design and implementation of a reliable version of the distributed bitonic sorting algorithm using the application-oriented fault tolerance paradigm on a commercial multicomputer is described. Sorting assertions in general are discussed and the bitonic sort algorithm is introduced. Faulty behavior is discussed and a fault-tolerant parallel bitonic sort developed using this paradigm is presented. The error coverage and the response of the fault-tolerant algorithm to faulty behavior are presented. Both asymptotic complexity and the results of run-time experimental measurements on an Ncube multicomputer are given. The authors demonstrate that the application-oriented fault tolerance paradigm is applicable to problems of a noniterative nature.

Original languageEnglish
Pages508-515
Number of pages8
DOIs
Publication statusPublished - Jun 1989
Externally publishedYes
Event9th International Conference on Distributed Computing Systems - Newport Beach, CA, USA
Duration: 5 Jun 19899 Jun 1989

Conference

Conference9th International Conference on Distributed Computing Systems
CityNewport Beach, CA, USA
Period5/06/899/06/89

Fingerprint

Dive into the research topics of 'Reliable distributed sorting through the application-oriented fault tolerance paradigm'. Together they form a unique fingerprint.

Cite this