Business Continuity and Disaster Recovery Toolkit

Collection(s): EDUCAUSE Working Group

Testing and Training

Regular testing and exercises should be part of an ongoing BC/DR plan maintenance process. The frequency of testing can vary depending on the organization's needs and risk profile, but it's typically done annually or whenever significant changes occur within the organization or its environment.

Testing Methods

  1. Plan Review: The BC/DR team looks over the plan, identifying any parts that need improvement or updates or have gaps that need to be filled.
  2. Tabletop Exercise: A tabletop exercise is a role-playing activity in which players respond to scenarios presented by one or more facilitators. Participants usually play their own role (such as CEO, IT lead, or communications rep), but they can also play other roles to fill in gaps.
  3. DR Scenario Simulations: This is an actual execution of the DR plan into a non-production DR environment. It usually is limited to specific workloads, systems, and applications and does not include the entire environment.
  4. Full DR Simulation: This is similar to the previous method, but you're attempting to recover everything in a scenario where there is a total loss of operations and location. These exercises can certainly provide the most benefit but can be burdensome and expensive.

Why to Test a BCP

  1. Verify Effectiveness: Testing helps determine whether the BCP is effective in achieving its objectives. Testing helps identify weaknesses, gaps, and areas for improvement in the plan.
  2. Compliance: Many industries and regulatory bodies require organizations to have a BCP and conduct regular testing as part of compliance, such as with HIPAA, FERPA, and GLBA).
  3. Assurance: Testing provides assurance to stakeholders, including employees, customers, partners, and investors, that the organization is prepared to maintain critical operations in the face of disruptions.
  4. Learning and Training: Testing allows employees to familiarize themselves with their roles and responsibilities in a crisis, building confidence to respond more effectively when a real disaster occurs, and develops organizational trust in the plan.
  5. Continuous Improvement: Identifying areas for improvement through testing helps organizations refine and enhance their BCP over time.

What to Test in a BCP

  1. Critical Functions and Processes: Test the core functions and processes that are vital to your organization's operations. This includes IT systems, communication, supply chain, and customer service.
  2. Response and Recovery Procedures: Evaluate the effectiveness of the procedures outlined in the BCP for responding to and recovering from specific scenarios. This includes actions such as data backup, employee evacuation, vendor services, and resource allocation.
  3. Communication Plans: Test the communication channels and protocols established in the BCP for internal and external stakeholders. Ensure that communication is timely and accurate.
  4. Resource Availability: Assess the availability of essential resources, such as personnel, equipment, and facilities, during a crisis. This includes testing backup sites and supply chain resilience.
  5. Dependencies: Identify and test dependencies between different functions and processes. Determine how disruptions in one area can impact others and how these dependencies are managed.
  6. Timeliness: Evaluate the timeliness of responses and recovery efforts. Determine whether objectives for recovery time objectives (RTOs) and recovery point objectives (RPOs) are met.
  7. Employee Training and Awareness: Ensure that employees are adequately trained and aware of their roles and responsibilities during a crisis. Conduct drills and exercises to reinforce their readiness. All new employees should be briefed on their roles and responsibilities.
  8. Documentation and Record Keeping: Verify that documentation and record-keeping procedures are in place and available during a crisis. This includes having updated contact lists, recovery plans, and incident logs.
  9. Scalability: Test the plan's scalability by simulating different levels of disruption, from minor incidents to large-scale disasters, to see how well the plan adapts.
  10. Feedback and Improvement: After each test, gather feedback from participants and stakeholders. Use this feedback to make necessary adjustments and improvements to the BCP.