Difference between High Availability and Fault Tolerance

Evidian SafeKit

What is the difference between high availability and fault tolerance?

Overview

This article explores the pros and cons of a high availability cluster versus a fault tolerant system by looking at hardware constraints, software failures, RTO, RPO...

The following comparative tables explain in detail the difference between a fault tolerant system and SafeKit, a software high availability cluster.

What is high availability?

A high availability cluster is based on two servers with restart of the critical application in the event of hardware or software failures. There are 2 types of clusters: hardware clusters and software clusters.

Hardware clusters are based on shared disks resulting in dependencies between servers and their connections to shared disk arrays.

Software clusters like Evidian SafeKit are based on real-time data replication and are hardware-agnostic: they can be deployed on physical or virtual servers or in the cloud.

What is fault tolerance?

A fault tolerant system relies on either specialized hardware or specialized hypervisor to detect a hardware failure and instantly switch to a redundant hardware component without application restart.

Fault-tolerant systems only deal with hardware failures and not software failures, by far the most common reason for system downtime.

Pros and cons of high availability and fault tolerance

Software high availability cluster

Active active high availability

Fault-tolerant system

Fault tolerance with lockstep CPU

Product
SafeKit on Windows and Linux Fault tolerant products
Hardware
No dedicated server.

Each server can be the failover server of the other one for multiple applications.

Dedicated hardware.

The secondary server is dedicated to the execution of the same application synchronized at the instruction level.

Software failure
Software failure supported with restart in another OS environment. Software exception on both servers at the same time on the same OS.
Smooth upgrage/fix of application and OS
Yes

Smooth upgrade/fix of application and OS possible server by server.

N and N+1 versions can coexist.

No

Same application and OS image on both servers.

RTO/RPO
The recovery time with SafeKit (RTO) depends on the time to detect and to restart the application (about 1 minute).

The data loss with SafeKit (RPO) is zero as the replication is synchronous.

The recovery time (RTO) of a fault tolerant system is zero.

The application is not restarted in case of failure and continue its execution on the secondary server.

The data loss (RPO) is also zero.

Flexibility
Can run on any type of server with standard Windows and Linux OS Depends on specific hardware or on specific hypervisors
Suited for
Software editors which want to add a simple high availability option to their application Environment where hardware failures is the main concern

SafeKit High Availability Differentiators against Competition

SafeKit: an ideal solution for a partner application

This platform agnostic solution is ideal for a partner with a critical application and who wants to provide a high availability option easy to deploy to many customers.

This clustering solution is also recognized as the simplest to implement by our partners.

Demonstrations of SafeKit High Availability Software

SafeKit Webinar

This webinar presents in 2 minutes Evidian SafeKit.

In this webinar, you will understand SafeKit mirror and farm clusters.

Microsoft SQL Server Cluster

This video shows a mirror module configuration with synchronous real-time replication and failover.

The file replication and the failover are configured for Microsoft SQL Server but it works in the same manner for other databases.

Free trial here

Apache Cluster

This video shows a farm module configuration with load balancing and failover.

The load balancing and the failover are configured for Apache but it works in the same manner for other web services.

Free trial here

Hyper-V Cluster

This video shows a Hyper-V cluster with full replications of virtual machines.

Virtual machines can run on both Hyper-V servers and they are restarted in case of failure.

Free trial here

SafeKit Modules for Plug&Play High Availability Solutions

SafeKit Modules for Plug&Play High Availability Solutions

Network load balancing and failover

Windows farm

Linux farm

Generic Windows farm   > Generic Linux farm   >
Microsoft IIS   > -
NGINX   >
Apache   >
Amazon AWS farm   >
Microsoft Azure farm   >
Google GCP farm   >
Other cloud   >

Advanced clustering architectures

Several modules can be deployed on the same cluster. Thus, advanced clustering architectures can be implemented:

SafeKit Training

Introduction

  1. Overview / pptx

    • Features
    • Architectures
    • Distinctive advantages
  2. Competition / pptx

    • Hardware vs software cluster
    • Synchronous vs asynchronous replication
    • File vs disk replication
    • High availability vs fault tolerance
    • Hardware vs software load balancing
    • Virtual machine vs application HA

Installation, Console, CLI

  1. Install and setup / pptx

    • Package installation
    • Nodes setup
    • Cluster configuration
    • Upgrade
  2. Web console / pptx

    • Cluster configuration
    • Configuration tab
    • Control tab
    • Monitor tab
    • Advanced Configuration tab
  3. Command line / pptx

    • Silent installation
    • Cluster administration
    • Module administration
    • Command line interface

Advanced configuration

  1. Mirror module / pptx

    • userconfig.xml + restart scripts
    • Heartbeat (<hearbeat>)
    • Virtual IP address (<vip>)
    • Real-time file replication (<rfs>)
  2. Farm  module / pptx

    • userconfig.xml + restart scripts
    • Farm configuration (<farm>)
    • Virtual IP address (<vip>)
  3. Checkers / pptx

    • Failover machine (<failover>)
    • Process monitoring (<errd>)
    • Network and duplicate IP checkers
    • Custom checker (<custom>)
    • Split brain checker (<splitbrain>)
    • TCP, ping, module checkers

Support

  1. Support tools / pptx

    • Analyze snaphots
  2. Evidian support / pptx

    • Get permanent license key
    • Register on support.evidian.com
    • Call desk

Documentation

  1. Technical documentation

  2. Presales documentation