Designing Fault-Tolerant Mobile Systems (2003)

Author(s): Di Marzo G, Romanovsky A

    Abstract: The purpose of this paper is to investigate how several innovative techniques, not all initially intended for fault-tolerance, can be applied in providing fault tolerance of complex mobile agent systems. Due to their roaming nature, mobile agents usually run on Java-based platforms, which ensures full portability of mobile code. The first part of the paper discusses specific characteristics of mobile systems, outlines the application areas benefiting from code mobility, and shows why the existing error recovery techniques are not suitable for mobile systems. In the next part of the paper we present evaluation criteria for fault tolerance techniques, and propose several possible solutions for error recovery at the application level: meta-agent, Coordinated Atomic actions, asynchronous resolution, self-repair, and proof carrying code. The intention is to allow system developers to choose the approach which is suited best to the characteristics of the mobile agent application to be designed. To this end we discuss the advantages and disadvantages of each technique, as well as situations in which it provides the most benefit. A simple example, based on Internet shopping, is used throughout the paper to demonstrate the techniques.

      • Date: 28-29 November 2002
      • Conference Name: International Workshop on Scientific Engineering for Distributed Java Applications (FIDJI 2002)
      • Volume: 2604
      • Pages: 185-201
      • Publisher: Springer-Verlag
      • Publication type: Conference Proceedings (inc. abstract)
      • Bibliographic status: Published

      Keywords: Mobile agents, system structuring, fault tolerance, exception handling, software engineering.

      Staff

      Professor Alexander Romanovsky
      Prof of Computing Science