A Generic Implementation Approach to Concurrent Fault-Tolerant Software (2000)

Author(s): Xu J, Randell B, Romanovsky A

    Abstract: Abstract: This paper addresses the practical implementation of means of tolerating residual software faults in complex software systems, especially concurrent and distributed ones. There are several inherent difficulties in implementing such fault-tolerant software systems, including the controlled use of extra redundancy and the mixture of different design concerns. In an attempt to minimize these difficulties, we present a generic implementation approach, composed of a multi-layered reference architecture, a configuration method and an architectural pattern. We evaluate our implementation approach using an industrial control application whose control software we equip with the ability to tolerate a variety of software faults. The preliminary evidence from this case study shows that our approach can simplify the implementation process, reduce repetitive development effort and provide high flexibility through a generic interface for a wide range of fault tolerance schemes

      • Series Title: Department of Computing Science Technical Report Series
      • Pages: 25
      • Institution: Department of Computing Science, University of Newcastle upon Tyne
      • Publication type: Report
      • Bibliographic status: Published

      Keywords: architectural patterns, concurrent and distributed systems, coordinated atomic (CA) actions, fault-tolerant software, object orientation, safety-critical applications


      Professor Alexander Romanovsky
      Prof of Computing Science