Skip to main content

CRSD in Oracle Clusterware – Your RAC's Silent Orchestrator

๐Ÿ”ง CRSD in Oracle Clusterware – Your RAC's Silent Orchestrator

In Oracle RAC (Real Application Clusters), high availability is not just a goal—it's a requirement. Behind the scenes, CRSD (Cluster Ready Services Daemon) is the component ensuring that every essential service, instance, and resource is up, running, and properly managed.

In this post, we will explore:

  • What is CRSD?
  • Its core responsibilities and architecture
  • Interaction with other clusterware components
  • Failover behavior and how it handles resource recovery
  • Diagnostic commands and logs

๐Ÿ“˜ What is CRSD?

The Cluster Ready Services Daemon (CRSD) is a core component of Oracle Clusterware. It is responsible for:

  • Managing high availability (HA) resources
  • Starting and stopping databases, listeners, ASM, VIPs, and services
  • Orchestrating failover and restarts when a node or resource fails

CRSD runs as root and ensures that critical Oracle and non-Oracle applications are managed according to the policies defined in the OCR.

⚙️ CRSD Responsibilities

Function Description
๐Ÿš€ Resource Management Starts/stops Oracle databases, listeners, services, and ASM
๐Ÿ” Failover Handling Relocates resources during node or service failure
๐Ÿง  Dependency Tracking Maintains relationships (e.g., listener must start before DB)
๐Ÿ“ก VIP & SCAN Management Controls Virtual IPs and Single Client Access Names
๐Ÿ“’ OCR Interaction Reads and writes to the Oracle Cluster Registry for resource state/config

๐Ÿงฑ CRSD Architecture

CRSD is part of the Oracle High Availability Service stack. It spawns and manages several subcomponents:

  • crsd.bin – Main daemon running as root
  • oraagent.bin – User-mode agent to handle database, ASM, and listener operations
  • crsagent.bin – Manages internal cluster resources

How it works:

crsd.bin
   ├── oraagent.bin     ➝ manages Oracle resources
   ├── crsagent.bin     ➝ manages internal infra like SCAN/VIP
   └── OCR               ➝ reads and stores configuration

๐Ÿ”„ What Happens During Resource Failures?

  • If a service or DB crashes, CRSD attempts a local restart.
  • If unsuccessful, the resource is relocated to another node.
  • Failover logic is determined by target/placement policies.

If the CRSD itself fails:

  • The node may be evicted by CSSD if it fails health checks.
  • CRSD is auto-restarted by the OHAS daemon.

๐Ÿ” What Resources Does CRSD Manage?

  • ๐Ÿ“ก SCAN Listeners and SCAN VIPs
  • ๐Ÿงฑ Oracle Databases (RAC, RAC One Node)
  • ๐Ÿ”ง ASM Instances
  • ๐Ÿ’ป Node VIPs
  • ๐Ÿ–ฅ️ Application resources registered by srvctl
  • ๐Ÿงฉ Custom resources via action scripts

๐Ÿงช Diagnostic Commands

Use these to check the health and behavior of CRSD:

crsctl check crs
crsctl status resource -t
crsctl start resource ora.dbname.db
crsctl stop crs

๐Ÿ“‚ Important Log Files

Component Log Path
CRSD Logs $GRID_HOME/log/<hostname>/crsd/crsd.log
Agent Logs $GRID_HOME/log/<hostname>/agent/
OCR Interaction $GRID_HOME/log/<hostname>/crsd/ocrdump.log

๐Ÿ›ก Best Practices

  • ✅ Monitor crsd.log regularly for failover events
  • ๐Ÿ” Avoid manual resource manipulation—always use crsctl or srvctl
  • ๐Ÿ”ง Register resources with proper dependencies and policies
  • ๐Ÿ“’ Keep OCR and voting disks healthy to prevent node eviction

๐Ÿงญ Conclusion

CRSD is the brain of high availability in Oracle RAC. It automates resource startup, monitors health, and performs complex orchestration with little human interaction. Understanding its architecture and log files can make troubleshooting more predictable and help ensure your cluster runs smoothly.

When RAC behaves like magic—CRSD is the magician behind the curtain.

Comments

Popular posts from this blog

๐Ÿš€ Automating Oracle Database Patching with Ansible: A Complete Guide

Oracle database patching has long been the bane of DBAs everywhere. It's a critical task that requires precision, expertise, and often results in extended maintenance windows. What if I told you that you could automate this entire process, reducing both risk and downtime while ensuring consistency across your Oracle estate? ๐Ÿ’ก In this comprehensive guide, I'll walk you through a production-ready Ansible playbook that completely automates Oracle patch application using OPatch. Whether you're managing a single Oracle instance or hundreds of databases across your enterprise, this solution will transform your patch management strategy! ๐ŸŽฏ ๐Ÿ”ฅ The Challenge: Why Oracle Patching is Complex Before diving into the solution, let's understand why Oracle patching is so challenging: ๐Ÿ”— Multiple dependencies : OPatch versions, Oracle Home configurations, running processes ⚠️ Risk of corruption : Incorrect patch application can render databases unusable ⏰ Downtime requirements : Da...

Oracle RAC Switchover & Switchback: Step-by-Step Guide

 Ensuring business continuity requires regular Disaster Recovery (DR) drills. This guide covers the Switchover and Switchback process between Primary (DC) and Standby (DR) databases . Pre-checks Before Performing Switchover Before starting the activity, ensure there are no active sessions in the database. If any are found, share the session details with the application team, get their confirmation, and terminate the sessions. Primary Database Name: PRIMARY Standby Database Name: STANDBY  Identify Active Sessions set lines 999 pages 999 col machine for a30 col username for a30 col program for a30 compute sum of count on report break on report select inst_id,username,osuser,machine,program,status,count(1) "count" from gv$session where inst_id=1 and program like 'JDBC%' group by inst_id,username,osuser,machine,program,status order by 1,2; select inst_id,username,osuser,machine,program,status,count(1) "count" from gv$session where inst_id=2 and program lik...

Mastering Oracle RAC with SRVCTL Commands

Oracle Real Application Clusters (RAC) provide high availability, scalability, and manageability for databases. One of the most powerful tools for managing RAC databases is srvctl , a command-line utility that allows administrators to control various database services. This blog explores essential srvctl commands to help you efficiently manage Oracle RAC environments. 1. Checking Database Configuration and Status  List all available databases on the host:                  srvctl config database   Check the status of a specific database and its instances:                    srvctl status database -d <database_name>   Retrieve detailed status information about a database, including its instances and states:                    srvctl status database -d <database_name> -v 2. Stopping and Starting Databases   ...