Runtime prediction of failure modes from system error logs

Atef Mohamed, Mohammad Zulkernine

Research output: Contribution to book or proceedingConference articlepeer-review

5 Scopus citations

Abstract

Predicting potential failure occurrences during runtime is important to achieve system resilience and avoid hazardous consequences of failures. Existing failure prediction techniques in software systems involve forecasting failure counts, effects, and occurrences. Most of these techniques predict failures that may occur in future runtime intervals and only few techniques predict them at runtime. However, they do not estimate the failure modes and they require extensive instrumentation of source code. In this paper, we provide an approach for predicting failure occurrences and modes during system runtime. Our methodology utilizes system error log records to craft runtime error-spread signature. Using system error log history, we determine a predictive function (estimator) for each failure mode based on these signatures. This estimator can be used to predict a failure mode eventuality measure (a probability of failure mode occurrence) from system error log during system runtime. An experimental evaluation using PostgreSQL opensource database is provided. Our results show high accuracy of failure occurrence and mode predictions.

Original languageEnglish
Title of host publicationProceedings - 2013 International Conference on Engineering of Complex Computer Systems, ICECCS 2013
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages232-241
Number of pages10
ISBN (Print)9780769550077
DOIs
StatePublished - 2013
Event18th International Conference on Engineering of Complex Computer Systems, ICECCS 2013 - Singapore, Singapore
Duration: Jul 17 2013Jul 19 2013

Publication series

NameProceedings of the IEEE International Conference on Engineering of Complex Computer Systems, ICECCS
ISSN (Print)2770-8527
ISSN (Electronic)2770-8535

Conference

Conference18th International Conference on Engineering of Complex Computer Systems, ICECCS 2013
Country/TerritorySingapore
CitySingapore
Period07/17/1307/19/13

Keywords

  • failure mode
  • failure prediction
  • regression analysis
  • runtime error log
  • software reliability

Fingerprint

Dive into the research topics of 'Runtime prediction of failure modes from system error logs'. Together they form a unique fingerprint.

Cite this