ABSTRACT:
The introduction of computers has been a huge plus to human life in its entirety because it provides both the world of business and private an easy and fast means to process, generate and exchange information. However, the proliferation of networked devices, internet services and the amount of data being generated frequently is enormous. This poses a major challenge, to the procurement cost of high performing computers and servers capable of processing and housing the big data. This called for the migration of organizational and/or institutional data upload to the cloud for high level of productivity at a low cost. Therefore, with high demand for cloud services and resources by users who migrated to the cloud, cloud computing systems have experienced an increase in outages or failures in real-time cloud computing environment and thereby affecting its reliability and availability. This paper proposes and simulates a system comprising four components: the user, task controller, fault detector and fault tolerance layers to mitigate the occurrence of fault combining checkpointing and replication techniques using cloud simulator (CloudSim).
Keywords:
Fault tolerance, task controller, CloudSim, cloud computing, checkpointing, fault detector, mitigate failure replication, reliability.