Monday, September 17, 2012

Server not booting after luactivate to Patch ID Generic_147440-02 and above on T2000

Problem:

Rebooting with command: boot
Boot device: disk1:a  File and args:
SunOS Release 5.10 Version Generic_147440-10 64-bit
Copyright (c) 1983, 2011, Oracle and/or its affiliates. All rights reserved.
sorry, variable 'tcp_conn_hash_size' is not defined in the 'tcp' module
Hostname: sq......104
NOTICE: VxVM vxdmp V-5-0-34 added disk array 604033, datype = EMC
NOTICE: VxVM vxdmp V-5-0-34 added disk array DISKS, datype = Disk
NOTICE: VxVM vxdmp V-5-3-1700 dmpnode 287/0x0 has migrated from enclosure FAKE_ENCLR_SNO to enclosure DISKS
Cross trap sync timeout:  at cpu_sync.xword[1]: 0x1010panic: failed to stop cpu12
panic: failed to stop cpu13
panic: failed to stop cpu14
panic: failed to stop cpu15
panic[cpu20]/thread=2a1020cdca0: xt_sync: timeout
000002a1020cce90 unix:xt_sync+370 (10bcc00, 1, 2a1020ccfa8, 626475a3f8, 1, 1913c08)
  %l0-3: 0000000000000008 00000062647575c4 00000062abfc3028 00000062abfc2ff8
  %l4-7: 000002a1020ccfb0 0000000000000020 0000000000000000 00000000010bcd28
000002a1020cd1b0 unix:hat_unload_callback+7d4 (7fc00, 2a1020cd348, 0, 2a1020cd448, 0, 30002c77b40)
  %l0-3: 00000300090fe000 fffffffffffffff8 0000000000000001 0000000000000001
  %l4-7: 0000000000000000 000003000907b010 ffffffffffffffff 0000030002c77b48
000002a1020cd590 swrand:physmem_ent_gen+210 (1a25348, 700000ccd80, 0, 0, 0, 1000)
  %l0-3: 0000000000001d9b 0000000000000000 000002a1020cd68c 0000000000001fff
  %l4-7: 0000000000002000 0000000000001000 0000000003b36000 000000000000000d
000002a1020cd6f0 swrand:rnd_handler+14 (0, 2a1020cdca0, 512c90a521, 60035591bf2, 1a25338, 1a25000)
  %l0-3: 00000600371d1b80 8000000000000000 0000000000000001 0000000000080000
  %l4-7: 0000060035591be0 0000000000010000 00000000fffeffff 000000007bfedea4
000002a1020cd7a0 genunix:callout_list_expire+5c (60033bb0940, 60035a4be80, 80000000, 0, bfffffffffffffff, 4000000000000000)
  %l0-3: 00000600371d1b80 8000000000000000 0000000000000001 0000000000080000
  %l4-7: 0000060035591be0 0000000000010000 00000000fffeffff 000000007bfedea4
000002a1020cd850 genunix:callout_expire+1c (60033bb0940, 60033bb09c0, 512c90a521, 60035591bf2, 0, 60035591bf4)
  %l0-3: 0000060035a4be80 0000060035591bf0 0000000000000001 0000000000080000
  %l4-7: 0000060035591be0 0000000000010000 00000000fffeffff 0000060035591be8
000002a1020cd900 genunix:callout_execute+c (60033bb0940, 60034a84708, 10e5138, 0, 10d5400, 0)
  %l0-3: 0000060034a84708 0000060035591bf0 0000000000000001 0000000000080000
  %l4-7: 0000060035591be0 0000000000010000 00000000fffeffff 0000060035591be8
000002a1020cd9b0 genunix:taskq_thread+3b8 (60035591c28, 60035591bc0, 512c90a521, 60035591bf2, 51fe5f7609, 60035591bf4)
  %l0-3: 0000060034a84708 0000060035591bf0 0000000000000001 0000000000080000
  %l4-7: 0000060035591be0 0000000000010000 00000000fffeffff 0000060035591be8
syncing file systems... done
skipping system dump - no dump device configured
rebooting...
SC Alert: Host System has Reset
SC Alert: Indicator SYS/ACT is now SLOW BLINK
SC Alert: Indicator SYS/ACT is now STANDBY BLINK
SC Alert: Host system has shut down.
SC Alert: Indicator SYS/ACT is now SLOW BLINK
|
SC Alert: Indicator SYS/ACT is now ON

Sun Fire T200, No Keyboard
Copyright 2006 Sun Microsystems, Inc.  All rights reserved.
OpenBoot 4.25.0, 32760 MB memory available, Serial #75381276.
Ethernet address 0:14:4f:7e:3a:1c, Host ID: 847e3a1c.

Rebooting with command: boot

Solution :

Hi Friends,

I am back on Blogging after a gap of couple of years. If you are trying to upgrade a T2000 running Solaris 10 from u8 to u9 and install patch 147440-02 or above and your current firmware level is ver old, your server will never boot from the new BE. This is a Solaris Bug and the only solution to resolve it is to upgrade the Firware to  139434-09

Here in my server below was the current firmware version:

sc> showsc version -v
Advanced Lights Out Manager CMT v1.3.2
SC Firmware version: CMT 1.3.2
SC Bootmon version: CMT 1.3.2
VBSC 1.3.1
VBSC firmware built Feb  5 2007, 21:11:38
SC Bootmon Build Release: 01
SC bootmon checksum: F44BD111
SC Bootmon built Feb  5 2007, 21:18:50
SC Build Release: 01
SC firmware checksum: 48C51947
SC firmware built Feb  5 2007, 21:19:05
SC firmware flashupdate THU MAY 10 19:00:22 2007
SC System Memory Size: 32 MB
SC NVRAM Version = 12
SC hardware type: 4
FPGA Version: 4.2.4.7


I upgraded it to below:


sc> showsc version -v

Advanced Lights Out Manager CMT v1.7.11
SC Firmware version: CMT 1.7.11
SC Bootmon version: CMT 1.7.11

VBSC 1.7.3.d
VBSC firmware built Jul  6 2011, 19:27:17

SC Bootmon Build Release: 01
SC bootmon checksum: 4CB78FC8
SC Bootmon built Jul  6 2011, 19:37:05

SC Build Release: 01
SC firmware checksum: C41F3325

SC firmware built Jul  6 2011, 19:37:18
SC firmware flashupdate THU AUG 30 09:42:18 2012

SC System Memory Size: 32 MB
SC NVRAM Version = 14
SC hardware type: 4
FPGA Version: 4.2.4.7

After Firmware upgrade when i tried booted from the new BE. It worked absoluely fine.

Good Luck.

Yogesh

 

Monday, August 23, 2010

VxVM vxassist ERROR V-5-1-436 Cannot allocate space to grow volume to 1958928768 blocks

Hi All,

# vxdg -g oradgh free
DISK DEVICE TAG OFFSET LENGTH FLAGS
emcpower67 emcpower67s2 emcpower67 0 230395648 -

# df -h /fh01 /bfh01
Filesystem size used avail capacity Mounted on
/dev/vx/dsk/oradgh/fh01vol 824G 771G 50G 94% /fh01
/dev/vx/dsk/shadowdg/fh01vol 824G 228G 560G 29% /bfh01root on cidcshlspora01:

# /etc/vx/bin/vxresize -g oradgh fh01vol +230395648 emcpower67 &
9080
root on cidcshlspora01:
# VxVM vxassist ERROR V-5-1-436 Cannot allocate space to grow volume to 1958928768 blocks
VxVM vxresize ERROR V-5-1-4703 Problem running vxassist command for volume fh01vol, in diskgroup oradgh
vxtask list
TASKID PTID TYPE/STATE PCT PROGRESS
root on cidcshlspora01:

In this kind of case it may happen the block size selected for the FS while its creation is different, so you can grow the FS by taking the size from( vxassist -g dgname maxsize diskname) command as mentioned below:

# vxassist -g oradgh maxsize emcpower67
Maximum volume size: 230391808 (112496Mb)

# /etc/vx/bin/vxresize -g oradgh fh01vol +230391808 emcpower67
root on cidcshlspora01:
# df -h /fh01
Filesystem size used avail capacity Mounted on
/dev/vx/dsk/oradgh/fh01vol 934G 771G 153G 84% /fh01
root on cidcshlspora01:

So the FS grown as above.. Please let meknow if any doubts..

Thanks,
Yogesh

Sunday, August 22, 2010

metastat: host: data1: must be owner of the set for this command

Hi All,

For this error, Please run teh below command. don't worry if it is not a cluster set as well. But if it is a cluster set then confirm that it is not imported on other server.

#metaset -s data1 -C take
It would import the diskset and problem would be resolved.

Thanks,
Yogesh

Sunday, August 15, 2010

Local zone state changed after Patching

Hi All,

This is my first post to forum and the problem is related to Solaris Zones:
Issue: Solaris Local zone state changed to incomplete after patching Solaris recommended patch Cluster.
Troubleshooting and Resolution:
# zoneadm list -ivc
ID NAME STATUS PATH BRAND IP
0 global running / native shared
- seedzone incomplete /rpool/seedzone native shared
I moved to /rpool/seedzone and saw a extra file SUNWdetached.xml under that. I removed that file and also change the state of the local zone from incomplete to installed in /etc/zones/index. The issue got resolved. I rebooted the server and issue never came back.
Thanks,
Yogesh