Flashcopy, ghost hdisk ??

Hi all,

I'm getting some errors on AIX regarding Flashcopy and volume group hard disks.

The script that activates flashcopy showed this errors:

Recreating Flashcopy for lun01_A1
Performing syntax check...
Syntax check complete.
Executing script...
Script execution complete.
SMcli completed successfully.
Recreating Flashcopy for lun02_B1
Prforming syntax check...
Syntax check complete.
Executing script...
Error occurred while attempting to recreate flashcopy logical diskDrive lun02_B1 at line 1.
Error 1011 - A management connection to the controller in slot B must be defined to complete this operation.

If you are managing this controller directly through the Ethernet (out-of-band), you must define its management connection (IP address or
DNS/Network name) using the Add Storage Subsystem option.

If you are managing this controller through a host agent (in-band), verify that all physical paths to the controller are connected and op
erational. Then run the hot_add utility on the affected host, and then Refresh or Rescan the Host.
The command at line 1 that caused the error is:

recreate FlashCopy logicalDrive ["lun02_B1"];
Script execution halted due to error.
SMcli failed.
LIST OF DISKS = hdisk10 hdisk11
hdisk10 changed
hdisk10 changed
Removing old definitions for flashcopyvg"

From the Storage Manager Client the Flashcopy was active but was not shown/mounted its filesystems on AIX.

And then after cfgmgr showed this output:

>cfgmgr
Method error (/usr/lib/methods/cfgefscsi -l fscsi2 ):
        0514-061 Cannot find a child device.
Method error (/etc/methods/cfgfcparray -l hdisk11 ):
        0514-051 Device to be configured does not match the physical
                 device at the specified connection location.

lspv showed this output (no hdisk10 part of flashcopyvg)

........
hdisk11 none None 
hdisk19 000c92f2cc0c9e6c datavg concurrent
..........

Flashcopyvg had originally hdisk10 and hdisk11 assigned.

lsvg -p datavg
datavg:
PV_NAME PV STATE TOTAL PPs FREE PPs FREE DISTRIBUTION
hdisk8 active 799 7 00..00..00..00..07
hdisk9 active 799 7 00..00..00..00..07
#lspv hdisk19  (exactly the same as lspv hdisk8)
PHYSICAL VOLUME: hdisk8 VOLUME GROUP: data01vg
PV IDENTIFIER: 000c92f2cc0c9e6c VG IDENTIFIER 000c92f20000d90000000121cc0c9f7b
PV STATE: active 
STALE PARTITIONS: 0 ALLOCATABLE: yes
PP SIZE: 128 megabyte(s) LOGICAL VOLUMES: 3
TOTAL PPs: 799 (102272 megabytes) VG DESCRIPTORS: 2
FREE PPs: 7 (896 megabytes) HOT SPARE: no
USED PPs: 792 (101376 megabytes) MAX REQUEST: 1 megabyte
FREE DISTRIBUTION: 00..00..00..00..07 
USED DISTRIBUTION: 160..160..159..160..153
#fget_config -Av
..........
hdisk8 dac0 4 data_lun1_A 
hdisk9 dac1 5 data_lun2_B 
...........
hdisk11 dac1 7 lun02_B1 Snapshot Volume
hdisk19 dac0 6 lun01_A1 Snapshot Volume

Why is there hdisk10 disappeared and replaced by another non-existent before hdisk19 which itself is shown identical with hdisk8 (same PVID) ?? :confused:

Thanks

Ok, I found a solution and let's hope that it is permanent and the error doesn't happen again:
The steps to be taken was:
1) umount -t flashcopy
2) varyoffvg flashcopyvg
3) rmdev -dl hdisk10;rmdev -dl hdisk11; rmdev -dl hdisk19
4) cfgmgr
5) varyonvg flashcopyvg
6) mount -t flashcopy

The configuration of the hdisks was firstly like:

datavg    <-->    flashcopyvg
hdisk8  <-->    hdisk10
hdisk9    <-->    hdisk11

Then it changed to:

datavg    <-->    flashcopyvg
hdisk8  <-->    hdisk19
hdisk9    <-->    hdisk11

and hdisk19 had the same PVID as hdisk8.

After the 6 steps above the situation is as before.

Regards