| Author |
Message |
Bart
Guest
|
Posted:
Wed Dec 29, 2004 7:30 pm Post subject:
Can no longer failover Cluster Group |
|
|
Hi,
for years now, we have an Exchange Cluster running (W2K).
Since a few days now, we cannot bring our Cluster Group to the 1st
node anymore.
Our 2 other (application) groups can failover without any problem.
We don't think it is caused by connection problems, because we have no
messages that say that "the node lost communication..." in the Event
Log.
First, we thought it was a SCSI problem (see cluster.log below), but
our tools show no errors on the SCSI devices.
Anyway, the problem lies with the Physical Disk Q resource. But what
??
It's especially the following messages that bother me:
SCSI, error attaching to signature b183c136, error 2.
Physical Disk <Disk Q:>: Arbitrate: Unable to attach to signature
b183c136. Error: 2.
I get the following messages in the cluster.log of the 1st node (the
problem node)
-----------------------------------------------------------------------------
[FM] FmsTakeGroupRequest: To take group
'7677a8de-5ec6-474d-bff8-513ea772d3a5'.
[FM] FmpTakeGroupRequest: To take group
'7677a8de-5ec6-474d-bff8-513ea772d3a5'.
[MM] MmSetQuorumOwner(1,1), old owner 0.
Physical Disk <Disk Q:>: [DiskArb]Wait for offline thread to
complete...
Physical Disk <Disk Q:>: [DiskArb]------- DisksArbitrate -------.
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH20)>: [EXRES] calling EcStoreIsServerAlive()
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH20)>: [EXRES] EcStoreIsServerAlive() returned error
0, fIsAlive=TRUE
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH10)>: [EXRES] calling EcStoreIsServerAlive()
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH10)>: [EXRES] EcStoreIsServerAlive() returned error
0, fIsAlive=TRUE
Physical Disk <Disk Q:>: SCSI, error attaching to signature b183c136,
error 2.
Physical Disk <Disk Q:>: Arbitrate: Unable to attach to signature
b183c136. Error: 2.
[MM] MmSetQuorumOwner(0,0), old owner 1.
[FM] FmpTakeGroupRequest: MM did not select local node 1 as the
arbitration winner...
[FM] FmpTakeGroupRequest: Exit for group
<7677a8de-5ec6-474d-bff8-513ea772d3a5>, Status = 1237...
-----------------------------------------------------------------------------
I get the following messages in the cluster.log of the 2nd node (the
ok node)
-----------------------------------------------------------------------------
[FM] FmpOfflineResourceList: Bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:>
<db7a4bbd-cbfa-4887-8ab9-b4fb54552781>
[FM] FmpOfflineResource: MSDTC depends on Disk Q:. Shut down first.
[MM] MmSetQuorumOwner(0,1), old owner 2.
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[LM] LmRemoveTimerActivity:: Entry 0x00000380
[LM] :ReSyncTimerHandles Entry.
[LM] ResyncTimerHandles::removed Timer 0x00000380
[LM] :ReSyncTimerHandles Exit gdwNumHandles=3
[LM] LmRemoveTimerActivity:: Exit
[LM] LogClose : Entry LogFile=0x0014ea68
[LM] LmRemoveTimerActivity:: Entry 0x000003a8
[LM] :ReSyncTimerHandles Entry.
[LM] ResyncTimerHandles::removed Timer 0x000003a8
[LM] :ReSyncTimerHandles Exit gdwNumHandles=2
[LM] LmRemoveTimerActivity:: Exit
[LM] LogClose : Exit returning success
[FM] FmpRmOfflineResource: RmOffline() for
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 returned error 997
Physical Disk <Disk Q:>: Stop watching disk b183c136
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[GUM] GumSendUpdate: Locker waiting type 0 context 8
[GUM] Thread 0xb60 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216551 type 0 context 8
[GUM] GumSendUpdate: Dispatching seq 216551 type 0 context 8 to node 1
[GUM] GumSendUpdate: Locker updating seq 216551 type 0 context 8
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216551 type 0 context 8
[FM] FmpPropagateResourceState: resource
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 pending event.
[FM] FmpMoveGroup: Exit group <Cluster Group>, status = 997
[GUM] GumSendUpdate: Locker waiting type 0 context 9
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216552 type 0 context 9
[GUM] GumSendUpdate: Dispatching seq 216552 type 0 context 9 to node 1
[GUM] GumSendUpdate: Locker updating seq 216552 type 0 context 9
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216552 type 0 context 9
[FM] FmpPropagateGroupState: Group
7677a8de-5ec6-474d-bff8-513ea772d3a5 state = 1, persistent state = 0
[FM] FmpDoMoveGroup: Exit, status = 997
[FM] FmpMovePendingThread Entry.
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216553 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216553 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216553 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216553 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216554 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216554 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216554 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216554 type 0 context 11
[FM] CompleteMoveGroup: Entry for <Cluster Group>
[FM] CompleteMoveGroup: Completing the move for group Cluster Group to
node 1 (1)
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster IP Address>
<c22bcb35-6ea3-4c1c-b321-628f7e0d9e45>
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster Name>
<0cffd3dc-ed73-499b-b27f-0412470a56f9>
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <MSDTC>
<9e9495a6-9700-41cf-ac1a-fdb08e40e7b3>
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Legato IP Address
(cluxch10)> <fac8212e-b455-4702-a263-461a6045cbe7>
[FM] FmpOfflineResourceList: Bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:>
<db7a4bbd-cbfa-4887-8ab9-b4fb54552781>
[FM] FmpOfflineResource: Offline resource <Disk Q:> returned pending
[FM] FmpCompleteMoveGroup: Exit, status = 997
[GUM] GumSendUpdate: Locker waiting type 0 context 11
Physical Disk: PnP Event GUID_IO_VOLUME_LOCK for 764928 received
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216555 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216555 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216555 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216555 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216556 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216556 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216556 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216556 type 0 context 11
Physical Disk <Disk Q:>: Offline, Dismounting volume
\Device\Harddisk7\Partition2.
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT for 764928 received
Physical Disk: PnP Event GUID_IO_VOLUME_UNLOCK for 764928 received
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
called.
Physical Disk <Disk Q:>: [DiskArb] CompletionRoutine, status 0.
Physical Disk: PnP Event GUID_IO_VOLUME_MOUNT for ? (Partition1)
received.
Physical Disk: PnP Event GUID_IO_VOLUME_MOUNT for Q (Partition2)
received.
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT for 934056 received
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT_FAILED for 934056
received
Physical Disk <Disk Q:>: [DiskArb]Successful read (sector 12)
[CLNXCH20:724020] (0,f571d2c0:01c4df7d).
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT for 764928 received
Physical Disk <Disk Q:>: [DiskArb]Successful write (sector 12) [:0]
(0,00000000:00000000).
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
complete.
Physical Disk <Disk Q:>: DisksDismountDrives: letter mask is 00010000.
[RM] RmpSetResourceStatus, Posting state 3 notification for resource
<Disk Q:>
[FM] NotifyCallBackRoutine: enqueuing event
[FM] FmpCreateResStateChangeHandler: Entry
[FM] FmpCreateResStateChangeHandler: Exit, status 0
[FM] FmpHandleResStateChangeProc: Entry...
[CP] CppResourceNotify for resource Disk Q:
[FM] FmpHandleResourceTransition: Resource Name =
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 old state=130 new state=3
[GUM] GumSendUpdate: Locker waiting type 0 context 8
[GUM] Thread 0x109c UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216557 type 0 context 8
[GUM] GumSendUpdate: Dispatching seq 216557 type 0 context 8 to node 1
[GUM] GumSendUpdate: Locker updating seq 216557 type 0 context 8
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216557 type 0 context 8
[FM] FmpPropagateResourceState: resource
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 offline event.
[FM] FmpOfflineWaitingTree: Entry for <Disk Q:>.
[FM] OfflineWaitingResourceTree: Exit, status=0 for <Disk Q:>.
[FM] FmpOfflineWaitingTree: Quorum resource is in the same
group,Moving list=0x00117348
[FM] FmpOfflineWaitingTree: bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:>
<db7a4bbd-cbfa-4887-8ab9-b4fb54552781>
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[MM] MmSetQuorumOwner(0,0), old owner 0.
Physical Disk <Disk Q:>: Stop watching disk b183c136
Physical Disk <Disk Q:>: RemoveDisk: disk b183c136 not found
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
called.
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
complete.
[CP] CppResourceNotify for resource Disk Q:
[FM] RmTerminateResource: db7a4bbd-cbfa-4887-8ab9-b4fb54552781 is now
offline
[FM] FmpOfflineWaitingTree: returned status 0 for <Disk Q:>.
[FM] FmpHandleResStateChangeProc: Exit...
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216558 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216558 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216558 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216558 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216559 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216559 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216559 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216559 type 0 context 11
[FM] CompleteMoveGroup: Entry for <Cluster Group>
[FM] CompleteMoveGroup: Completing the move for group Cluster Group to
node 1 (1)
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster IP Address>
<c22bcb35-6ea3-4c1c-b321-628f7e0d9e45>
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster Name>
<0cffd3dc-ed73-499b-b27f-0412470a56f9>
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <MSDTC>
<9e9495a6-9700-41cf-ac1a-fdb08e40e7b3>
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Legato IP Address
(cluxch10)> <fac8212e-b455-4702-a263-461a6045cbe7>
[FM] FmpOfflineResourceList: Bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:>
<db7a4bbd-cbfa-4887-8ab9-b4fb54552781>
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[MM] MmSetQuorumOwner(0,0), old owner 0.
Physical Disk <Disk Q:>: Stop watching disk b183c136
Physical Disk <Disk Q:>: RemoveDisk: disk b183c136 not found
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
called.
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
complete.
[CP] CppResourceNotify for resource Disk Q:
[FM] RmTerminateResource: db7a4bbd-cbfa-4887-8ab9-b4fb54552781 is now
offline
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x10bc UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216560 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216560 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216560 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216560 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 13
[GUM] Thread 0x10bc UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216561 type 0 context 13
[GUM] GumSendUpdate: Dispatching seq 216561 type 0 context 13 to node
1
[GUM] GumSendUpdate: Locker updating seq 216561 type 0 context 13
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216561 type 0 context 13
[FM] FmpCompleteMoveGroup: Take group
7677a8de-5ec6-474d-bff8-513ea772d3a5 request to remote node 1
[FM] FmpCompleteMoveGroup: Remote node asked us to resend take group
request for group 7677a8de-5ec6-474d-bff8-513ea772d3a5 to another node
....
[MM] MmSetQuorumOwner(2,1), old owner 0.
Physical Disk <Disk Q:>: [DiskArb]Wait for offline thread to
complete...
Physical Disk <Disk Q:>: [DiskArb]------- DisksArbitrate -------.
----------------------------------------------------------------------------
There is no direct impact, because everything runs fine on our 2nd
node, but if something happens to that 2nd node...
Kind Regards,
Bart |
|
| Back to top |
|
 |
Scott Schnoll [MSFT]
Guest
|
Posted:
Wed Dec 29, 2004 11:22 pm Post subject:
Re: Can no longer failover Cluster Group |
|
|
Hi Bart,
Error code 2 translates to "The system cannot find the file specified." The
error in this case may mean that it cannot find the disk, or that, because
of some kind of problem, it cannot locate the quorum logfile that should be
on the disk. Do you know if the drive was reformatted after it was in
production for a while? Also, are the drivers and firmware for the SCSI
controllers the same on both nodes?
That said, this could indicate a faulty drive. I know you said that the
tools show no errors, but were these hardware-level tools?
Also, are you using multipath I/O software for the storage in this cluster?
--
Scott Schnoll
This posting is provided "AS IS" with no warranties, and confers no
rights. Please do not send email directly to this alias. This alias is for
newsgroup
purposes only.
"Bart" <bvanneste077_nspm@yahoo.co.uk> wrote in message
news:a6c200e7.0412290530.59ffb8fe@posting.google.com...
| Quote: | Hi,
for years now, we have an Exchange Cluster running (W2K).
Since a few days now, we cannot bring our Cluster Group to the 1st
node anymore.
Our 2 other (application) groups can failover without any problem.
We don't think it is caused by connection problems, because we have no
messages that say that "the node lost communication..." in the Event
Log.
First, we thought it was a SCSI problem (see cluster.log below), but
our tools show no errors on the SCSI devices.
Anyway, the problem lies with the Physical Disk Q resource. But what
??
It's especially the following messages that bother me:
SCSI, error attaching to signature b183c136, error 2.
Physical Disk <Disk Q:>: Arbitrate: Unable to attach to signature
b183c136. Error: 2.
I get the following messages in the cluster.log of the 1st node (the
problem node)
-----------------------------------------------------------------------------
[FM] FmsTakeGroupRequest: To take group
'7677a8de-5ec6-474d-bff8-513ea772d3a5'.
[FM] FmpTakeGroupRequest: To take group
'7677a8de-5ec6-474d-bff8-513ea772d3a5'.
[MM] MmSetQuorumOwner(1,1), old owner 0.
Physical Disk <Disk Q:>: [DiskArb]Wait for offline thread to
complete...
Physical Disk <Disk Q:>: [DiskArb]------- DisksArbitrate -------.
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH20)>: [EXRES] calling EcStoreIsServerAlive()
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH20)>: [EXRES] EcStoreIsServerAlive() returned error
0, fIsAlive=TRUE
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH10)>: [EXRES] calling EcStoreIsServerAlive()
Microsoft Exchange Information Store <Exchange Information Store
Instance - (SRVXCH10)>: [EXRES] EcStoreIsServerAlive() returned error
0, fIsAlive=TRUE
Physical Disk <Disk Q:>: SCSI, error attaching to signature b183c136,
error 2.
Physical Disk <Disk Q:>: Arbitrate: Unable to attach to signature
b183c136. Error: 2.
[MM] MmSetQuorumOwner(0,0), old owner 1.
[FM] FmpTakeGroupRequest: MM did not select local node 1 as the
arbitration winner...
[FM] FmpTakeGroupRequest: Exit for group
7677a8de-5ec6-474d-bff8-513ea772d3a5>, Status = 1237...
-----------------------------------------------------------------------------
I get the following messages in the cluster.log of the 2nd node (the
ok node)
-----------------------------------------------------------------------------
[FM] FmpOfflineResourceList: Bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:
db7a4bbd-cbfa-4887-8ab9-b4fb54552781
[FM] FmpOfflineResource: MSDTC depends on Disk Q:. Shut down first.
[MM] MmSetQuorumOwner(0,1), old owner 2.
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[LM] LmRemoveTimerActivity:: Entry 0x00000380
[LM] :ReSyncTimerHandles Entry.
[LM] ResyncTimerHandles::removed Timer 0x00000380
[LM] :ReSyncTimerHandles Exit gdwNumHandles=3
[LM] LmRemoveTimerActivity:: Exit
[LM] LogClose : Entry LogFile=0x0014ea68
[LM] LmRemoveTimerActivity:: Entry 0x000003a8
[LM] :ReSyncTimerHandles Entry.
[LM] ResyncTimerHandles::removed Timer 0x000003a8
[LM] :ReSyncTimerHandles Exit gdwNumHandles=2
[LM] LmRemoveTimerActivity:: Exit
[LM] LogClose : Exit returning success
[FM] FmpRmOfflineResource: RmOffline() for
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 returned error 997
Physical Disk <Disk Q:>: Stop watching disk b183c136
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[GUM] GumSendUpdate: Locker waiting type 0 context 8
[GUM] Thread 0xb60 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216551 type 0 context 8
[GUM] GumSendUpdate: Dispatching seq 216551 type 0 context 8 to node 1
[GUM] GumSendUpdate: Locker updating seq 216551 type 0 context 8
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216551 type 0 context 8
[FM] FmpPropagateResourceState: resource
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 pending event.
[FM] FmpMoveGroup: Exit group <Cluster Group>, status = 997
[GUM] GumSendUpdate: Locker waiting type 0 context 9
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216552 type 0 context 9
[GUM] GumSendUpdate: Dispatching seq 216552 type 0 context 9 to node 1
[GUM] GumSendUpdate: Locker updating seq 216552 type 0 context 9
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216552 type 0 context 9
[FM] FmpPropagateGroupState: Group
7677a8de-5ec6-474d-bff8-513ea772d3a5 state = 1, persistent state = 0
[FM] FmpDoMoveGroup: Exit, status = 997
[FM] FmpMovePendingThread Entry.
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216553 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216553 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216553 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216553 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216554 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216554 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216554 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216554 type 0 context 11
[FM] CompleteMoveGroup: Entry for <Cluster Group
[FM] CompleteMoveGroup: Completing the move for group Cluster Group to
node 1 (1)
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster IP Address
c22bcb35-6ea3-4c1c-b321-628f7e0d9e45
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster Name
0cffd3dc-ed73-499b-b27f-0412470a56f9
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <MSDTC
9e9495a6-9700-41cf-ac1a-fdb08e40e7b3
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Legato IP Address
(cluxch10)> <fac8212e-b455-4702-a263-461a6045cbe7
[FM] FmpOfflineResourceList: Bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:
db7a4bbd-cbfa-4887-8ab9-b4fb54552781
[FM] FmpOfflineResource: Offline resource <Disk Q:> returned pending
[FM] FmpCompleteMoveGroup: Exit, status = 997
[GUM] GumSendUpdate: Locker waiting type 0 context 11
Physical Disk: PnP Event GUID_IO_VOLUME_LOCK for 764928 received
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216555 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216555 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216555 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216555 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216556 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216556 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216556 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216556 type 0 context 11
Physical Disk <Disk Q:>: Offline, Dismounting volume
\Device\Harddisk7\Partition2.
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT for 764928 received
Physical Disk: PnP Event GUID_IO_VOLUME_UNLOCK for 764928 received
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
called.
Physical Disk <Disk Q:>: [DiskArb] CompletionRoutine, status 0.
Physical Disk: PnP Event GUID_IO_VOLUME_MOUNT for ? (Partition1)
received.
Physical Disk: PnP Event GUID_IO_VOLUME_MOUNT for Q (Partition2)
received.
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT for 934056 received
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT_FAILED for 934056
received
Physical Disk <Disk Q:>: [DiskArb]Successful read (sector 12)
[CLNXCH20:724020] (0,f571d2c0:01c4df7d).
Physical Disk: PnP Event GUID_IO_VOLUME_DISMOUNT for 764928 received
Physical Disk <Disk Q:>: [DiskArb]Successful write (sector 12) [:0]
(0,00000000:00000000).
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
complete.
Physical Disk <Disk Q:>: DisksDismountDrives: letter mask is 00010000.
[RM] RmpSetResourceStatus, Posting state 3 notification for resource
Disk Q:
[FM] NotifyCallBackRoutine: enqueuing event
[FM] FmpCreateResStateChangeHandler: Entry
[FM] FmpCreateResStateChangeHandler: Exit, status 0
[FM] FmpHandleResStateChangeProc: Entry...
[CP] CppResourceNotify for resource Disk Q:
[FM] FmpHandleResourceTransition: Resource Name =
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 old state=130 new state=3
[GUM] GumSendUpdate: Locker waiting type 0 context 8
[GUM] Thread 0x109c UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216557 type 0 context 8
[GUM] GumSendUpdate: Dispatching seq 216557 type 0 context 8 to node 1
[GUM] GumSendUpdate: Locker updating seq 216557 type 0 context 8
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216557 type 0 context 8
[FM] FmpPropagateResourceState: resource
db7a4bbd-cbfa-4887-8ab9-b4fb54552781 offline event.
[FM] FmpOfflineWaitingTree: Entry for <Disk Q:>.
[FM] OfflineWaitingResourceTree: Exit, status=0 for <Disk Q:>.
[FM] FmpOfflineWaitingTree: Quorum resource is in the same
group,Moving list=0x00117348
[FM] FmpOfflineWaitingTree: bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:
db7a4bbd-cbfa-4887-8ab9-b4fb54552781
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[MM] MmSetQuorumOwner(0,0), old owner 0.
Physical Disk <Disk Q:>: Stop watching disk b183c136
Physical Disk <Disk Q:>: RemoveDisk: disk b183c136 not found
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
called.
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
complete.
[CP] CppResourceNotify for resource Disk Q:
[FM] RmTerminateResource: db7a4bbd-cbfa-4887-8ab9-b4fb54552781 is now
offline
[FM] FmpOfflineWaitingTree: returned status 0 for <Disk Q:>.
[FM] FmpHandleResStateChangeProc: Exit...
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216558 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216558 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216558 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216558 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x678 UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216559 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216559 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216559 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216559 type 0 context 11
[FM] CompleteMoveGroup: Entry for <Cluster Group
[FM] CompleteMoveGroup: Completing the move for group Cluster Group to
node 1 (1)
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster IP Address
c22bcb35-6ea3-4c1c-b321-628f7e0d9e45
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Cluster Name
0cffd3dc-ed73-499b-b27f-0412470a56f9
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <MSDTC
9e9495a6-9700-41cf-ac1a-fdb08e40e7b3
[FM] FmpOfflineResourceList: Bring non quorum resource offline
[FM] FmpOfflineResource: Offline resource <Legato IP Address
(cluxch10)> <fac8212e-b455-4702-a263-461a6045cbe7
[FM] FmpOfflineResourceList: Bring quorum resource offline
[FM] FmpOfflineResource: Offline resource <Disk Q:
db7a4bbd-cbfa-4887-8ab9-b4fb54552781
[DM] DmpQuoObjNotifyCb: Quorum resource
offline/offlinepending/preoffline
[MM] MmSetQuorumOwner(0,0), old owner 0.
Physical Disk <Disk Q:>: Stop watching disk b183c136
Physical Disk <Disk Q:>: RemoveDisk: disk b183c136 not found
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
called.
Physical Disk <Disk Q:>: [DiskArb] StopPersistentReservations is
complete.
[CP] CppResourceNotify for resource Disk Q:
[FM] RmTerminateResource: db7a4bbd-cbfa-4887-8ab9-b4fb54552781 is now
offline
[GUM] GumSendUpdate: Locker waiting type 0 context 11
[GUM] Thread 0x10bc UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216560 type 0 context 11
[GUM] GumSendUpdate: Dispatching seq 216560 type 0 context 11 to node
1
[GUM] GumSendUpdate: Locker updating seq 216560 type 0 context 11
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216560 type 0 context 11
[GUM] GumSendUpdate: Locker waiting type 0 context 13
[GUM] Thread 0x10bc UpdateLock wait on Type 0
[GUM] DoLockingUpdate successful, lock granted to 2
[GUM] GumSendUpdate: Locker dispatching seq 216561 type 0 context 13
[GUM] GumSendUpdate: Dispatching seq 216561 type 0 context 13 to node
1
[GUM] GumSendUpdate: Locker updating seq 216561 type 0 context 13
[GUM] GumpDoUnlockingUpdate releasing lock ownership
[GUM] GumSendUpdate: completed update seq 216561 type 0 context 13
[FM] FmpCompleteMoveGroup: Take group
7677a8de-5ec6-474d-bff8-513ea772d3a5 request to remote node 1
[FM] FmpCompleteMoveGroup: Remote node asked us to resend take group
request for group 7677a8de-5ec6-474d-bff8-513ea772d3a5 to another node
...
[MM] MmSetQuorumOwner(2,1), old owner 0.
Physical Disk <Disk Q:>: [DiskArb]Wait for offline thread to
complete...
Physical Disk <Disk Q:>: [DiskArb]------- DisksArbitrate -------.
----------------------------------------------------------------------------
There is no direct impact, because everything runs fine on our 2nd
node, but if something happens to that 2nd node...
Kind Regards,
Bart |
|
|
| Back to top |
|
 |
Guest
|
Posted:
Fri Dec 31, 2004 8:12 pm Post subject:
Re: Can no longer failover Cluster Group |
|
|
Hi Scott,
No format.
Drivers & firmware are equal.
No hardware-level tools.
Yes, multipath software: Data Duplex Manager (Fujitsu Siemens)
A reboot of the faulty node solved the problem... for now....
This could mean that the problem lies with the OS, not with the
hardware ?
Regards,
Bart |
|
| Back to top |
|
 |
Scott Schnoll [MSFT]
Guest
|
Posted:
Fri Dec 31, 2004 10:54 pm Post subject:
Re: Can no longer failover Cluster Group |
|
|
Perhaps. It could also be with the MPIO software. You might check to see
if there is an update available for DDM. Aside from that, I would keep a
close watch on things for a while to make sure all nodes are healthy.
--
Scott Schnoll
This posting is provided "AS IS" with no warranties, and confers no
rights. Please do not send email directly to this alias. This alias is for
newsgroup
purposes only.
<bvanneste077_nspm@yahoo.co.uk> wrote in message
news:1104502328.094960.241090@c13g2000cwb.googlegroups.com...
| Quote: | Hi Scott,
No format.
Drivers & firmware are equal.
No hardware-level tools.
Yes, multipath software: Data Duplex Manager (Fujitsu Siemens)
A reboot of the faulty node solved the problem... for now....
This could mean that the problem lies with the OS, not with the
hardware ?
Regards,
Bart
|
|
|
| Back to top |
|
 |
|
|
|
|