ora-07445 ora-00602导致实例1crash
962699Sep 18 2012 — edited Sep 18 2012刘大麻烦帮看下下面这个问题:
two nodes rac 10.2.0.5 on hp 11.31
在metalink上找到ID 1281101.1的文章有类似的例子,但是不能肯定就是文章中所说的BUG引起的,另外有没有类似trca格式化10046跟踪文件的工具格式化sczh1_pmon_7653.trc这样的trace文件?
以下是部分alert日志:
Tue Sep 18 12:41:52 EAT 2012
Thread 1 advanced to log sequence 64886 (LGWR switch)
Current log# 1 seq# 64886 mem# 0: +ZHDATA01/sczh/onlinelog/group_1.258.727538987
Current log# 1 seq# 64886 mem# 1: +ZHDATA01/sczh/onlinelog/group_1.259.727538989
Tue Sep 18 12:42:25 EAT 2012
Errors in file /oracle/product/admin/sczh/udump/sczh1_ora_10442.trc:
ORA-07445: M-3M-vM-OM-VM-RM-lM-3M-#M-4M-mM-NM-s: M-:M-KM-PM-DM-WM-*M-4M-" [kssdct()+144] [SIGSEGV] [Address not mapped to object] [0
x000019416] [] []
Tue Sep 18 12:42:27 EAT 2012
Trace dumping is performing id=[cdmp_20120918124227]
Tue Sep 18 12:42:46 EAT 2012
Errors in file /oracle/product/admin/sczh/bdump/sczh1_pmon_7653.trc:
ORA-07445: exception encountered: core dump [$cold_kssdch_stage()+320] [SIGSEGV] [Address not mapped to object] [0x000019426] [] []
Tue Sep 18 12:42:47 EAT 2012
Errors in file /oracle/product/admin/sczh/bdump/sczh1_pmon_7653.trc:
ORA-00602: internal programming exception
ORA-07445: exception encountered: core dump [$cold_kssdch_stage()+320] [SIGSEGV] [Address not mapped to object] [0x000019426] [] []
Tue Sep 18 12:42:47 EAT 2012
PMON: terminating instance due to error 602
Tue Sep 18 12:42:49 EAT 2012
Shutting down instance (abort)
License high water mark = 328
Tue Sep 18 12:42:53 EAT 2012
Instance terminated by PMON, pid = 7653
Tue Sep 18 12:42:54 EAT 2012
Instance terminated by USER, pid = 19627
Tue Sep 18 12:43:04 EAT 2012
Starting ORACLE instance (normal)
Tue Sep 18 12:43:16 EAT 2012
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Interface type 1 lan0 192.168.1.0 configured from OCR for use as a cluster interconnect
Interface type 1 lan2 10.64.1.0 configured from OCR for use as a public interface
Picked latch-free SCN scheme 3
LICENSE_MAX_USERS = 0
SYS auditing is disabled
ksdpec: called for event 13740 prior to event group initialization
Starting up ORACLE RDBMS Version: 10.2.0.5.0.
System parameters with non-default values:
processes = 500
sessions = 555
sga_max_size = 17179869184
lock_sga = TRUE
__shared_pool_size = 2147483648
shared_pool_size = 2147483648
__large_pool_size = 134217728
large_pool_size = 134217728
__java_pool_size = 50331648
__streams_pool_size = 16777216
backup_tape_io_slaves = TRUE
sga_target = 17179869184
control_files = ZHDATA01/sczh/controlfile/current.256.727538987, ZHDATA01/sczh/controlfile/current.257.727538987
db_block_size = 8192
__db_cache_size = 14814281728
compatible = 10.2.0.5.0
log_archive_dest_1 = LOCATION=/zharch1/ alternate=log_archive_dest_2 reopen=5 max_failure=6
log_archive_dest_2 = LOCATION=/zhnasarch1/
log_archive_dest_state_1 = ENABLE
log_archive_dest_state_2 = ALTERNATE
db_files = 512
db_file_multiblock_read_count= 128
cluster_database = TRUE
cluster_database_instances= 1
db_create_file_dest = +ZHDATA01
db_create_online_log_dest_1= +ZHDATA01
db_create_online_log_dest_2= +ZHDATA01
thread = 1
instance_number = 1
undo_management = AUTO
undo_tablespace = undotbs1
remote_login_passwordfile= EXCLUSIVE
db_domain =
instance_name = sczh1
dispatchers = (PROTOCOL=TCP) (SERVICE=sczh1XDB)
local_listener = (ADDRESS = (PROTOCOL = TCP)(HOST = 10.64.1.47)(PORT = 1521))
remote_listener = LISTENERS_SCZH
job_queue_processes = 10
background_dump_dest = /oracle/product/admin/sczh/bdump
user_dump_dest = /oracle/product/admin/sczh/udump
core_dump_dest = /oracle/product/admin/sczh/cdump
audit_file_dest = /oracle/product/admin/sczh/adump
db_name = sczh
open_cursors = 500
pga_aggregate_target = 8589934592
aq_tm_processes = 0
Cluster communication is configured to use the following interface(s) for this instance
192.168.1.5
Tue Sep 18 12:43:17 EAT 2012
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
PMON started with pid=2, OS id=19898
DIAG started with pid=3, OS id=19925
PSP0 started with pid=4, OS id=19927
LMON started with pid=5, OS id=19929
LMD0 started with pid=6, OS id=19931
LMS0 started with pid=7, OS id=19933
LMS1 started with pid=8, OS id=19935
LMS2 started with pid=9, OS id=19937
MMAN started with pid=10, OS id=19939
DBW0 started with pid=11, OS id=19941
DBW1 started with pid=12, OS id=19943
LGWR started with pid=13, OS id=19950
CKPT started with pid=14, OS id=19952
SMON started with pid=15, OS id=19954
RECO started with pid=16, OS id=19956
CJQ0 started with pid=17, OS id=19958
MMON started with pid=18, OS id=19960
Tue Sep 18 12:43:23 EAT 2012
starting up 1 dispatcher(s) for network address '(ADDRESS=(PARTIAL=YES)(PROTOCOL=TCP))'...
MMNL started with pid=19, OS id=19962
Tue Sep 18 12:43:23 EAT 2012
starting up 1 shared server(s) ...
Tue Sep 18 12:43:26 EAT 2012
lmon registered with NM - instance id 1 (internal mem no 0)
Tue Sep 18 12:43:26 EAT 2012
Reconfiguration started (old inc 0, new inc 24)
List of nodes:
0 1
Global Resource Directory frozen
* allocate domain 0, invalid = TRUE
Communication channels reestablished
* domain 0 valid according to instance 1
* domain 0 valid = 1 according to instance 1
Tue Sep 18 12:43:27 EAT 2012
Master broadcasted resource hash value bitmaps
Non-local Process blocks cleaned out
Tue Sep 18 12:43:27 EAT 2012
LMS 0: 0 GCS shadows cancelled, 0 closed
Tue Sep 18 12:43:27 EAT 2012
LMS 1: 0 GCS shadows cancelled, 0 closed
Tue Sep 18 12:43:27 EAT 2012
LMS 2: 0 GCS shadows cancelled, 0 closed
Set master node info
Submitted all remote-enqueue requests
Dwn-cvts replayed, VALBLKs dubious
All grantable enqueues granted
Tue Sep 18 12:43:28 EAT 2012
LMS 0: 0 GCS shadows traversed, 0 replayed
Tue Sep 18 12:43:28 EAT 2012
LMS 1: 0 GCS shadows traversed, 0 replayed
Tue Sep 18 12:43:28 EAT 2012
LMS 2: 0 GCS shadows traversed, 0 replayed
Tue Sep 18 12:43:28 EAT 2012
Submitted all GCS remote-cache requests
Post SMON to start 1st pass IR
Fix write in gcs resources
Reconfiguration complete
LCK0 started with pid=22, OS id=20031
Tue Sep 18 12:43:31 EAT 2012
ALTER DATABASE MOUNT
Tue Sep 18 12:43:31 EAT 2012
Starting background process ASMB
ASMB started with pid=24, OS id=20040
Starting background process RBAL
RBAL started with pid=25, OS id=20044
Tue Sep 18 12:43:35 EAT 2012
SUCCESS: diskgroup ZHDATA01 was mounted
Tue Sep 18 12:43:39 EAT 2012
Setting recovery target incarnation to 1
Tue Sep 18 12:43:39 EAT 2012
Successful mount of redo thread 1, with mount id 255777838
Tue Sep 18 12:43:39 EAT 2012
Database mounted in Shared Mode (CLUSTER_DATABASE=TRUE)
Completed: ALTER DATABASE MOUNT
Tue Sep 18 12:43:40 EAT 2012
ALTER DATABASE OPEN
Tue Sep 18 12:43:40 EAT 2012
Picked broadcast on commit scheme to generate SCNs
Tue Sep 18 12:43:44 EAT 2012
LGWR: STARTING ARCH PROCESSES
ARC0 started with pid=23, OS id=20316
ARC1 started with pid=28, OS id=20324
Tue Sep 18 12:43:44 EAT 2012
ARC0: Archival started
ARC1: Archival started
LGWR: STARTING ARCH PROCESSES COMPLETE
Thread 1 opened at log sequence 64887
Current log# 2 seq# 64887 mem# 0: +ZHDATA01/sczh/onlinelog/group_2.260.727538993
Current log# 2 seq# 64887 mem# 1: +ZHDATA01/sczh/onlinelog/group_2.261.727538995
Successful open of redo thread 1
Tue Sep 18 12:43:44 EAT 2012
MTTR advisory is disabled because FAST_START_MTTR_TARGET is not set
Tue Sep 18 12:43:44 EAT 2012
SMON: enabling cache recovery
Tue Sep 18 12:43:44 EAT 2012
ARC1: Becoming the 'no FAL' ARCH
ARC1: Becoming the 'no SRL' ARCH
Tue Sep 18 12:43:44 EAT 2012
ARC0: Becoming the heartbeat ARCH
Tue Sep 18 12:43:45 EAT 2012
Successfully onlined Undo Tablespace 1.
Tue Sep 18 12:43:45 EAT 2012
SMON: enabling tx recovery
Tue Sep 18 12:43:45 EAT 2012
Database Characterset is ZHS16GBK
Opening with internal Resource Manager plan
replication_dependency_tracking turned off (no async multimaster replication found)
WARNING: AQ_TM_PROCESSES is set to 0. System operation might be adversely affected.
Completed: ALTER DATABASE OPEN
Tue Sep 18 13:00:36 EAT 2012
Thread 1 advanced to log sequence 64888 (LGWR switch)
Current log# 3 seq# 64888 mem# 0: +ZHDATA01/sczh/onlinelog/group_3.262.727538997
Current log# 3 seq# 64888 mem# 1: +ZHDATA01/sczh/onlinelog/group_3.263.727539001