在rac环境中,HACMP的vg为unconcurrent状态,是多么糟糕的一件事,而这个不幸就在某生产系统上发生了。环境介绍:AIX6.1的系统,使用的是EMCCLARiiON存储,oracl
环境介绍:
AIX 6.1的系统,使用的是EMC CLARiiON存储,oracle10.2.0.5
问题状况:
先看下各个卷组的状态
data03vg
lsvg data03vg VOLUME GROUP:data03vgVG IDENTIFIER: 00f79d1100004c00000001386f00edfb VG STATE:activePP SIZE:128 megabyte (s) VG PERMISSION:read/writeTOTAL PPs:5315 (680320 megabytes) MAX LVs:512FREE PPs:711 (91008 megabytes) LVs:78USED PPs:4604 (589312 megabytes) OPEN LVs:0QUORUM:3 (Enabled) TOTAL PVs:5VG DESCRIPTORS: 5 STALE PVs:0STALE PPs:0 ACTIVE PVs:5AUTO ON:no Concurrent:Enhanced-CapableAuto-Concurrent: Disabled VG Mode:Non-Concurrent MAX PPs per VG:130048 MAX PPs per PV:2032MAX PVs:64 LTG size (Dynamic): 1024 kilobyte(s)AUTO SYNC:no HOT SPARE:noBB POLICY:relocatable PV RESTRICTION:noneINFINITE RETRY: no
data01vg
lsvg data01vg VOLUME GROUP:data01vgVG IDENTIFIER: 00f79d1100004c00000001386effcc48 VG STATE:activePP SIZE:128 megabyte (s) VG PERMISSION:read/writeTOTAL PPs:6378 (816384 megabytes) MAX LVs:512FREE PPs:1146 (146688 megabytes) LVs:88USED PPs:5232 (669696 megabytes) OPEN LVs:0QUORUM:4 (Enabled) TOTAL PVs:6VG DESCRIPTORS: 6 STALE PVs:0STALE PPs:0 ACTIVE PVs:6AUTO ON:no Concurrent:Enhanced-CapableAuto-Concurrent: Disabled VG Mode:Non-Concurrent MAX PPs per VG:130048 MAX PPs per PV:2032MAX PVs:64 LTG size (Dynamic): 1024 kilobyte(s)AUTO SYNC:no HOT SPARE:noBB POLICY:relocatable PV RESTRICTION:noneINFINITE RETRY: no
data02vg
lsvg data02vg VOLUME GROUP:data02vgVG IDENTIFIER: 00f79d1100004c00000001386f007c90 VG STATE:activePP SIZE:128 megabyte (s) VG PERMISSION:read/writeTOTAL PPs:2126 (272128 megabytes) MAX LVs:512FREE PPs:18 (2304 megabytes) LVs:39USED PPs:2108 (269824 megabytes) OPEN LVs:0QUORUM:2 (Enabled) TOTAL PVs:2VG DESCRIPTORS: 3 STALE PVs:0STALE PPs:0 ACTIVE PVs:2AUTO ON:no Concurrent:Enhanced-CapableAuto-Concurrent: Disabled VG Mode:Non-Concurrent MAX PPs per VG:130048 MAX PPs per PV:2032MAX PVs:64 LTG size (Dynamic): 1024 kilobyte(s)AUTO SYNC:no HOT SPARE:noBB POLICY:relocatable PV RESTRICTION:noneINFINITE RETRY: no
vg中pv的状态:
data03vg
lsvg -p data03vg data03vg: PV_NAMEPV STATETOTAL PPs FREE PPs FREE DISTRIBUTION hdiskpower11active106343 01..00..00..00..42 hdiskpower17removed1063167 21..00..00..00..146 hdiskpower18removed1063167 21..00..00..00..146 hdiskpower19removed1063167 21..00..00..00..146 hdiskpower20removed1063167 21..00..00..00..146
data01vg
lsvg -p data01vg data01vg: PV_NAMEPV STATETOTAL PPs FREE PPs FREE DISTRIBUTION hdiskpower7active10630 00..00..00..00..00 hdiskpower8active106320 00..00..00..00..20 hdiskpower9active106324 02..00..00..00..22 hdiskpower10active10630 00..00..00..00..00 hdiskpower16missing1063551 21..00..105..212..213 hdiskpower21missing1063551
data02vg
lsvg -p data02vg data02vg: PV_NAMEPV STATETOTAL PPs FREE PPs FREE DISTRIBUTION hdiskpower0active10630 00..00..00..00..00 hdiskpower24active106318 00..00..00..00..18
有好多盘不是missing就是removed的,数据库日志报错为:
Thu Mar 21 17:53:58 BEIST 2013
UP简历
基于AI技术的免费在线简历制作工具
128 查看详情
Errors in
file /oracle/app/oracle/admin/ctsdb/bdump/ctsdb2_m000_19595456.trc:
ORA-27072: File I/O error
IBM AIX RISC System/6000 Error: 5: I/O error
odmget HACMPdisktype HACMPdisktype:PdDvLn = “disk/pseudo/power”ghostdisks = “SCSI3″checkres = “SCSI_TUR”breakres = “/usr/lpp/EMC/Symmetrix/bin/emcpowerreset”parallel = “false”makedev = “MKDEV”reserved1 = “”reserved2 = “”reserved3 = “”lssrc –a | grep cl clcomdcaa7929856active clcomdESclcomdES9633858active clstrmgrEScluster9240596active gsclvmdinoperative clinfoEScluster17104944active clconfdcaainoperative nimshnimclientinoperative
两节点的gsclvmd 都是inoperative,看来只能重启hacmp来把gsclvmd给拉起来。
解决过程:
1.先进行数据库的备份,,然后停库:
节点1: su – oracle srvctl stop listener –n ctscrm1 ps –ef | grep “LOCAL=NO”| grep –v grep | awk ‘{print $2}’|xargs kill -9 oracle> alter system switch logfile; oracle> alter system checkpoint; srvctl stop instance –d ctsdb –I ctsdb1
节点2:su – oracle srvctl stop listener –n ctscrm2 ps –ef | grep “LOCAL=NO”| grep –v grep | awk ‘{print $2}’|xargs kill -9 oracle> alter system switch logfile; oracle> alter system checkpoint; srvctl stop instance –d ctsdb –I ctsdb2
关闭crs:节点1和节点2
crsctlstop crs
2.重启hacmp
smit clstop
版权声明:本文内容由互联网用户自发贡献,该文观点仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。
如发现本站有涉嫌抄袭侵权/违法违规的内容, 请发送邮件至 chuangxiangniao@163.com 举报,一经查实,本站将立刻删除。
发布者:程序猿,转转请注明出处:https://www.chuangxiangniao.com/p/561166.html
微信扫一扫
支付宝扫一扫