2017-03-27Oracle故障gc buffer busy acquire导致数据库不可用
实施反馈系统有20分钟不可用,然后又自动恢复了。先查看alert日志,看到打开文件数不够,系统已经运行几年了,怎么可能呢。 Non critical error ORA-48180 caught while writing to trace file "/u01/app/ora/diag/rdbms/nwzcdb/nwzcdb2/trace/nwzcdb2_ora_195339.trc" 检查数据库服务器的配置,ulimit -a ,发现oracle hard nofile 65536,应该是足够大的。 查看问题时段的数据库报告,发现数据库过载了。
| |
Snap Id | Snap Time | Sessions | Cursors/Session | Instances | Begin Snap: 13158 24-3月 -17 09:00:27 1531 7.5 2 End Snap: 13159 24-3月 -17 10:00:38 1810 10.0 2 Elapsed:Event | Waits | Total Wait Time (sec) | Wait Avg(ms) | % DB time | Wait Class | gc buffer busy acquire 288,206 568.5K 1972 30.5 Cluster enq: TX - index contention 11,889 325K 27335 17.5 Concurrency buffer busy waits 54,302 217.9K 4012 11.7 Concurrency gc buffer busy release 228,772 200.8K 878 10.8 Cluster gc current grant busy 251,301 73.6K 293 4.0 Cluster gc current block congested 109,356 71.2K 651 3.8 Cluster gc cr block congested 25,922 69.2K 2669 3.7 Cluster gc cr grant congested 30,967 42.1K 1360 2.3 ClusterOwner | Tablespace Name | Object Name | Subobject Name | Obj. Type | GC Buffer Busy | % of Capture | LCSC LCSC_DATA INDEX_LOG_UO_OPERATE_TIME INDEX 266,048 36.39 LCSC LCSC_DATA SS_SECURITY_RESPONSIBILITYFunction Name | Reads: Data | Reqs per sec | Data per sec | Writes: Data | Waits: Count | Avg Tm(ms) | Direct Reads 475.2G 209.64 134.921 1.8G 3.97 .503M 0Physical Reads | %Total | LCSC LCSC_DATA FUNCTION_LOCATION P_DEFAULT_SUB_P_0502 TABLE SUBPARTITION 12,612,706 18.74 LCSC LCSC_DATA FUNCTION_LOCATION P_DEFAULT_SUB_P_0501 TABLE SUBPARTITION 7,703,111 11.44 P_DEFAULT_SUB_P_0505 7,258,626 10.78 LCSC LCSC_DATA FUNCTION_LOCATION P_DEFAULT_SUB_P_0503 TABLE SUBPARTITION 6,005,657 8.92 P_DEFAULT_SUB_P_0507 4,008,989 5.96
---|