Exploring Microbial Genome Sequences to Identify Protein Families on the Grid (2005)

Author(s): Sun Y, Wipat A, Pocock M, Lee P, Flanagan K, Worthington J

    Abstract: The analysis of microbial genome sequences can identify protein families that provide potential drug targets for new antibiotics. With the rapid accumulation of newly sequenced genomes, the analysis of complete genome sequences has become a computationally- and data-intensive problem which is intractable on common computer systems. This paper presents the Microbase project that has developed a Grid-based system to support large-scale comparative analysis of complete microbial genome sequences, and the identification of protein families based on the analysis. The system integrates Grid computing with genomic databases to provide a high-performance environment for efficient genome comparison, analysis and protein family search. A pre-computed dataset of sequence similarities and homologous protein families has been generated which can assist the discovery of new therapeutic agents and provide leads for drug development.

      • Date: October 2005
      • Series Title: School of Computing Science Technical Report Series
      • Pages: 8
      • Institution: School of Computing Science, University of Newcastle upon Tyne
      • Publication type: Report
      • Bibliographic status: Published

      Keywords: Genome analysis, Grid, microbial genomes, protein families


      Professor Anil Wipat
      Professor of Integrative Bioinformatics