Skip to Main Content

Database Software

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

RAC 12cR1 installation failing on 2nd node with CRS-8503

PS_orclNerdFeb 24 2016 — edited Feb 25 2016

Hi,

my installation on OL6.5 12.1.0.2 RAC 2 node is failing on this error... in the crs/rac2-pub/crs/incident/incdir_1 is an incident created each time I retry the installation..

Trace file /u01/app/oracle/diag/crs/rac2-pub/crs/trace/ocssd.trc

Oracle Database 12c Clusterware Release 12.1.0.2.0 - Production Copyright 1996, 2014 Oracle. All rights reserved.

DDE: Flood control is not active

Incident 17 created, dump file: /u01/app/oracle/diag/crs/rac2-pub/crs/incident/incdir_17/ocssd_i17.trc

CRS-8503 [] [] [] [] [] [] [] [] [] [] [] []

CLSB:3324278528: Oracle Clusterware infrastructure error in OCSSD (OS PID 23200): Fatal signal 6 has occurred in program ocssd thread 3324278528; nested signal count is 1

My GI install output shows it too, at the end is the info about unable to start ocssd,

More Details

Execution of GI Install script is successful on nodes : [rac1-pub] 

Execution of GI Install script is failed on nodes : [rac2-pub] 

Exception details  - PRCZ-2009 : Failed to execute command "/u01/app/12.1.0/grid/root.sh" as root within 3,600 seconds on nodes "rac2-pub"

 

Execution status of failed node:rac2-pub

Standard output

...CRS-2672: Attempting to start 'ora.diskmon' on 'rac2-pub'

CRS-2676: Start of 'ora.diskmon' on 'rac2-pub' succeeded

CRS-2883: Resource 'ora.cssd' failed during Clusterware stack start.

CRS-4406: Oracle High Availability Services synchronous start failed.

CRS-4000: Command Start failed, or completed with errors.

2016/02/24 17:43:47 CLSRSC-117: Failed to start Oracle Clusterware stack Died at /u01/app/12.1.0/grid/crs/install/crsinstall.pm line 914.

The command '/u01/app/12.1.0/grid/perl/bin/perl -I/u01/app/12.1.0/grid/perl/lib -I/u01/app/12.1.0/grid/crs/install /u01/app/12.1.0/grid/crs/install/rootcrs.pl -auto -lang=en_US.UTF-8' execution failed

When I check the incident there are checksum fails on my asmlib disks they are for my GRID diskgroup...

2016-02-24 17:45:44.394179 :CLSF:3300083456: checksum failed for disk:ORCL:DISK4:

2016-02-24 17:45:44.394180 :CLSF:3300083456: Error: obj 2147483648 blk 0 name 'hard_kfbh' flags 0x65 first 1

2016-02-24 17:45:44.394181 :CLSF:3300083456:   0: expected 1 actual 1

2016-02-24 17:45:44.394182 :CLSF:3300083456:   1: expected 2 actual 130

2016-02-24 17:45:44.394183 :CLSF:3300083456:   2: expected 1 actual 1

2016-02-24 17:45:44.394184 :CLSF:3300083456:   3: expected 0 actual 0

2016-02-24 17:45:44.394185 :CLSF:3300083456:   4: expected 0 actual 0

2016-02-24 17:45:44.394185 :CLSF:3300083456:   5: expected 1 actual 1

2016-02-24 17:45:44.394186 :CLSF:3300083456:   6: expected 2622102583 actual 2622102583

2016-02-24 17:45:44.394187 :CLSF:3300083456: bh: ptr 0x7f25b0244c00 size 512

2016-02-24 17:45:44.394189 :SKGFD:3300083456: bh:  dump of 0x0x7f25b0244c00, len 512

...

2016-02-24 17:45:44.394301 :CLSF:3300083456: Read ASM header off dev:ORCL:DISK4:56:64

2016-02-24 17:45:44.394305 :SKGFD:3300083456: Lib :ASM:ASM Library - Generic Linux, version 2.0.12 (KABI_V2): closing handle 0x7f25b0244780 for disk :ORCL:DISK4:

..

2016-02-24 17:46:01.403166 :CSSD:2793887488: clssnmSendingThread: Connection pending for node rac1-pub, number 1, flags 0x00000002

2016-02-24 17:46:01.908095 :CSSD:3309287168: clssscWaitOnEventValue: after CmInfo State  val 3, eval 1 waited 1000 with cvtimewait status 4294967186

2016-02-24 17:46:01.915773 :CSSD:2808080128: clssnmvDHBValidateNCopy: node 1, rac1-pub, has a disk HB, but no network HB, DHB has rcfg 351799176, wrtcnt, 8392, LATS 4662844, lastSeqNo 8389, uniqueness 1456329487, timestamp 1456332361/5237444

2016-02-24 17:46:02.368549 :CSSD:3320321792: clsssc_CLSFAInit_CB: System not ready for CLSFA initialization

2016-02-24 17:46:02.368554 :CSSD:3320321792: clsssc_CLSFAInit_CB: clsfa fencing not ready yet

Have you experienced such problems?? Maybe it is necessary to update my oracleasmlib binaries (?), I have only downloaded the oracleasmlib rpm and installed it... and configured as written on oracle.com.. on metalink I found a note that the newest PSU should fix a similar error.

Is it possible to patch the GI on both nodes and then rerun root.sh on second node? What calls the last step (Configure Oracle GI for a Cluster) in a GI installation?

It's only a RAC to prepare me for certifications

This post has been answered by PS_orclNerd on Feb 25 2016
Jump to Answer
Comments
Locked Post
New comments cannot be posted to this locked post.
Post Details
Locked on Mar 24 2016
Added on Feb 24 2016
5 comments
4,863 views