Hello, we are on 10.2.0.3 (I know, time to upgrade, just cannot right now). Data Guard is in place, in MAX AVAILABILITY. Prod job that was running for 3 years once a week with no issues started to take 10+ hours. It used to take 2 hrs max. Checked everything, nothing changed. Ran AWR report and see that the top wait event is RELIABLE MESSAGE:
Top 5 Timed Events
Event Waits Time(s) Avg Wait(ms) % Total Call Time Wait Class
CPU time 45,270 97.5
*reliable message 2,655 534 201 1.1 Other*
db file sequential read 199,278 366 2 .8 User I/O
db file scattered read 122,914 287 2 .6 User I/O
LGWR wait on LNS 35,511 209 6 .4 Network
Is this Data Guard? What can we do to fix this? I have brought down standby, tried to re-run this job and still got high reliable message:
Top 5 Timed Events
Event Waits Time(s) Avg Wait(ms) % Total Call Time Wait Class
CPU time 11,490 98.7
reliable message 804 77 96 .7 Other
db file scattered read 92,969 34 0 .3 User I/O
db file sequential read 17,596 25 1 .2 User I/O
log file parallel write 13,268 11 1 .1 System I/O
I don't understand, not much on Metalink. If I brought down standby, logs are not shipping there anymore, shouldn't that have improved performance? Or should I disable configuration to test no Data Guard scenario in this case? In the past disabeling configuration required re-buillding DG though. Plz advise. What is reliable message? Thank you!