List Info

Thread: re: 10.2.0.3 - root.sh fails when adding 2nd node ?




re: 10.2.0.3 - root.sh fails when adding 2nd node ?
user name
2007-08-10 08:38:34
Folks,
    I'm trying to install a 2nd node on my 2 node test
cluster and I
can't seem to
    get past running the root.sh script on the 2nd node. 
Whenever I
execute the root.sh
    on the newly added node (this is the very last step of
the
installer), the CSS deamon
    doesn't come up and eventually it reboots my original
node.

    From the ocssd.log files, I can tell that it has
something to do
with the 2 nodes speaking
    to each other ... either via the ocr/vote disks or
network connectivity.

    I've setup my raw partitions via fdisk, bound them in
/etc/raw and
setup permissions in udev.permissions.
    I've even cksum'd all raw devices from both nodes .. and
it all
looks good.

    Could I be missing something else? Any ideas?

    Here is that the ocssd.log complains about.

[    CSSD]2007-08-09 15:40:27.547 >USER:    CSS daemon
log for node
sdbe3, number 2, in cluster oracm_crs
[  clsdmt]Listening to
(ADDRESS=(PROTOCOL=ipc)(KEY=sdbe3DBG_CSSD))
[    CSSD]2007-08-09 15:40:27.642 [2546082016] >TRACE:  
clssscmain:
local-only set to false
[    CSSD]2007-08-09 15:40:34.506 [2546082016] >TRACE:  
clssnmReadNodeInfo: added node 1 (sdbe1) to cluster
[    CSSD]2007-08-09 15:40:34.543 [2546082016] >TRACE:  
clssnmReadNodeInfo: added node 2 (sdbe3) to cluster
[    CSSD]2007-08-09 15:40:34.548 [1082145120] >TRACE:  
clssnm_skgxnmon: skgxn init failed, rc 1
[    CSSD]2007-08-09 15:40:34.548 [2546082016] >TRACE:  
clssnm_skgxnonline: Using vacuous skgxn monitor
[    CSSD]2007-08-09 15:40:37.912 [2546082016] >TRACE:  
clssnmInitNMInfo: misscount set to 60
[    CSSD]2007-08-09 15:40:37.918 [2546082016] >TRACE:  
clssnmDiskStateChange: state from 1 to 2 disk
(0//dev/raw/raw1)
[    CSSD]2007-08-09 15:40:37.979 [2546082016] >TRACE:  
clssnmDiskStateChange: state from 1 to 2 disk
(1//dev/raw/raw3)
[    CSSD]2007-08-09 15:40:37.981 [2546082016] >TRACE:  
clssnmDiskStateChange: state from 1 to 2 disk
(2//dev/raw/raw5)
[    CSSD]2007-08-09 15:40:40.816 [1084246368] >TRACE:  
clssnmDiskStateChange: state from 2 to 4 disk
(1//dev/raw/raw3)
[    CSSD]2007-08-09 15:40:40.825 [1082145120] >TRACE:  
clssnmDiskStateChange: state from 2 to 4 disk
(0//dev/raw/raw1)
[    CSSD]2007-08-09 15:40:40.830 [1084246368] >TRACE:  
clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(483)
LATS(0)
Disk lastSeqNo(483)
[    CSSD]2007-08-09 15:40:40.837 [1082145120] >TRACE:  
clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(483)
LATS(0)
Disk lastSeqNo(483)
[    CSSD]2007-08-09 15:40:41.767 [1086347616] >TRACE:  
clssnmDiskStateChange: state from 2 to 4 disk
(2//dev/raw/raw5)
[    CSSD]2007-08-09 15:40:41.779 [1086347616] >TRACE:  
clssnmReadDskHeartbeat: node(1) is down. rcfg(2) wrtcnt(484)
LATS(0)
Disk lastSeqNo(484)
[    CSSD]2007-08-09 15:40:41.797 [2546082016] >TRACE:  
clssscSclsFatal: read value of disable
[    CSSD]2007-08-09 15:40:41.797 [1090550112] >TRACE:  
clssnmFatalThread: spawned
[    CSSD]2007-08-09 15:40:41.797 [2546082016] >TRACE:  
clssscSclsFatal: read value of disable
[    CSSD]2007-08-09 15:40:41.798 [1092651360] >TRACE:  
clssnmconnect:
connecting to node 2, flags 0x0001, connector 1
[    CSSD]2007-08-09 15:40:41.798 [1092651360] >TRACE:  
clssnmconnect:
connecting to node 0, flags 0x0000, connector 1
[    CSSD]2007-08-09 15:40:41.799 [1092651360] >TRACE:  
clssnmconnect:
connecting to node 1, flags 0x0001, connector 0
[    CSSD]2007-08-09 15:40:41.801 [1094752608] >TRACE:  
clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)
     (KEY=Oracle_CSS_LclLstnr_oracm_crs_2))
[    CSSD]2007-08-09 15:40:41.801 [1094752608] >TRACE:  
clssgmclientlsnr: listening on
(ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_sdbe3_oracm_crs))
[    CSSD]2007-08-09 15:40:42.832 [1092651360] >TRACE:  
clssnmConnComplete: connected to node 1 (con 0x2a981016c0),
     state 3 birth 0, unique 1186687891/1186687891 
prevConuni(0)
[    CSSD]2007-08-09 15:40:43.307 [1105258848] >TRACE:  
clssnmSendingThread: Connection complete
[    CSSD]2007-08-09 15:40:43.307 [1103157600] >TRACE:  
clssnmPollingThread: Connection complete
[    CSSD]2007-08-09 15:40:43.307 [1107360096] >TRACE:  
clssnmRcfgMgrThread: Connection complete
[    CSSD]2007-08-09 15:40:43.307 [1107360096] >TRACE:  
clssnmRcfgMgrThread: Local Join
[    CSSD]2007-08-09 15:40:43.307 [1107360096] >TRACE:  
clssnmLocalJoinEvent: set node(1) inactive
[    CSSD]2007-08-09 15:40:43.307 [1107360096] >WARNING:
clssnmLocalJoinEvent: takeover aborted due to UNKNOWN nodes
[    CSSD]2007-08-09 15:40:43.992 [1092651360] >TRACE:  
clssnmHandleSync: Acknowledging sync: src[1] srcName[sdbe1]
seq[5] sync[2]
[    CSSD]2007-08-09 15:40:44.309 [1107360096] >TRACE:  
clssnmRcfgMgrThread: lastleader(1) unique(1186688418)
[    CSSD]2007-08-09 15:40:44.994 [1092651360] >TRACE:  
clssnmSendVoteInfo: node(1) syncSeqNo(2)
[    CSSD]2007-08-09 15:40:46.998 [1092651360] >TRACE:  
clssnmUpdateNodeState: node 0, state (0/0) unique (0/0)
prevConuni(0)
birth (0/0)
     (old/new)
[    CSSD]2007-08-09 15:40:46.998 [1092651360] >TRACE:  
clssnmDeactivateNode: node 0 () left cluster

[    CSSD]2007-08-09 15:40:46.998 [1092651360] >TRACE:  
clssnmUpdateNodeState: node 1, state (4/3) unique
(1186687891/1186687891)
      prevConuni(0) birth (0/1) (old/new)
[    CSSD]2007-08-09 15:40:46.998 [1092651360] >TRACE:  
clssnmUpdateNodeState: node 2, state (1/2) unique
(1186688418/1186688418)
     prevConuni(0) birth (0/2) (old/new)
[    CSSD]2007-08-09 15:40:46.998 [1092651360] >USER:   
clssnmHandleUpdate: SYNC(2) from node(1) completed
[    CSSD]2007-08-09 15:40:46.998 [1092651360] >USER:   
clssnmHandleUpdate: NODE 1 (sdbe1) IS ACTIVE MEMBER OF
CLUSTER
[    CSSD]2007-08-09 15:40:46.998 [1092651360] >USER:   
clssnmHandleUpdate: NODE 2 (sdbe3) IS ACTIVE MEMBER OF
CLUSTER
[    CSSD]2007-08-09 15:40:47.002 [2546082016] >USER:   
NMEVENT_SUSPEND
[00][00][00][00]
[    CSSD]2007-08-09 15:40:47.003 [1109461344] >TRACE:  
clssgmReconfigThread:  started for reconfig (2)
[    CSSD]2007-08-09 15:40:47.003 [1109461344] >USER:   
NMEVENT_RECONFIG [00][00][00][06]
[    CSSD]2007-08-09 15:40:47.003 [1109461344] >TRACE:  
clssgmEstablishConnections: 2 nodes in cluster incarn 2
[    CSSD]2007-08-09 15:40:47.075 [1101056352] >TRACE:  
clssgmInitialRecv: (0x774770) accepted a new
      connection from node 1 born at 1 active (2, 2), vers
(10,3,1,2)
[    CSSD]2007-08-09 15:40:47.075 [1101056352] >TRACE:  
clssgmInitialRecv: conns done (2/2)
[    CSSD]2007-08-09 15:40:47.075 [1109461344] >TRACE:  
clssgmEstablishMasterNode: MASTER for 2 is node(1) birth(1)
[    CSSD]2007-08-09 15:40:47.075 [1109461344] >TRACE:  
clssgmChangeMasterNode: requeued 0 RPCs
[    CSSD]2007-08-09 15:40:47.590 [1084246368] >TRACE:  
clssnmvFatalCheck: extra node 1
[    CSSD]2007-08-09 15:40:47.590 [1084246368] >TRACE:  
clssnmvFatalCheck: fatal 1, sclsfatal 0
[    CSSD]2007-08-09 15:40:47.593 [1086347616] >TRACE:  
clssnmvFatalCheck: extra node 1
[    CSSD]2007-08-09 15:40:47.593 [1086347616] >TRACE:  
clssnmvFatalCheck: fatal 1, sclsfatal 0
[    CSSD]2007-08-09 15:40:47.600 [1082145120] >TRACE:  
clssnmvFatalCheck: extra node 1
[    CSSD]2007-08-09 15:40:47.600 [1082145120] >TRACE:  
clssnmvFatalCheck: fatal 1, sclsfatal 0
[    CSSD]2007-08-09 15:40:47.824 [1090550112] >TRACE:  
clssnmFatalThread: Fatal mode enabled
[    CSSD]2007-08-09 15:40:48.045 [1092651360] >TRACE:  
clssnmSendFatalOn: req to syncLeader(1)
[    CSSD]2007-08-09 15:40:51.322 [1103157600] >TRACE:  
clssnmPollingThread: node sdbe3 (2) missed(2) checkin(s)
[    CSSD]2007-08-09 15:41:17.132 [1109461344] >ERROR:  
clssgmSlaveCMSync: reconfig timeout on master 1

[    CSSD]2007-08-09 15:41:17.132 [1109461344] >TRACE:  
clssgmReconfigThread:  completed for reconfig(2), with
status(0)
[    CSSD]2007-08-09 15:41:17.190 [2546082016] >ERROR:  
clssgmStartNMMon: reconfig incarn 2 failed. Retrying.



-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: re: 10.2.0.3 - root.sh fails when adding 2nd node ?
user name
2007-08-10 14:33:15
How are you logged in to the 2nd node?  If you use vnc, it
is possible 
to encounter problems when you run the root.sh and you
didn't start the 
vncserver as root.

Peter Santos wrote:
> Folks,
>     I'm trying to install a 2nd node on my 2 node test
cluster and I
> can't seem to
>     get past running the root.sh script on the 2nd
node.  Whenever I
> execute the root.sh
>     on the newly added node (this is the very last step
of the
> installer), the CSS deamon
>     doesn't come up and eventually it reboots my
original node.
>
>     From the ocssd.log files, I can tell that it has
something to do
> with the 2 nodes speaking
>     to each other ... either via the ocr/vote disks or
network connectivity.
>
>     I've setup my raw partitions via fdisk, bound them
in /etc/raw and
> setup permissions in udev.permissions.
>     I've even cksum'd all raw devices from both nodes
.. and it all
> looks good.
>
>     Could I be missing something else? Any ideas?
>
>     Here is that the ocssd.log complains about.
>
> [    CSSD]2007-08-09 15:40:27.547 >USER:    CSS
daemon log for node
> sdbe3, number 2, in cluster oracm_crs
> [  clsdmt]Listening to
(ADDRESS=(PROTOCOL=ipc)(KEY=sdbe3DBG_CSSD))
> [    CSSD]2007-08-09 15:40:27.642 [2546082016]
>TRACE:   clssscmain:
> local-only set to false
> [    CSSD]2007-08-09 15:40:34.506 [2546082016]
>TRACE:  
> clssnmReadNodeInfo: added node 1 (sdbe1) to cluster
> [    CSSD]2007-08-09 15:40:34.543 [2546082016]
>TRACE:  
> clssnmReadNodeInfo: added node 2 (sdbe3) to cluster
> [    CSSD]2007-08-09 15:40:34.548 [1082145120]
>TRACE:  
> clssnm_skgxnmon: skgxn init failed, rc 1
> [    CSSD]2007-08-09 15:40:34.548 [2546082016]
>TRACE:  
> clssnm_skgxnonline: Using vacuous skgxn monitor
> [    CSSD]2007-08-09 15:40:37.912 [2546082016]
>TRACE:  
> clssnmInitNMInfo: misscount set to 60
> [    CSSD]2007-08-09 15:40:37.918 [2546082016]
>TRACE:  
> clssnmDiskStateChange: state from 1 to 2 disk
(0//dev/raw/raw1)
> [    CSSD]2007-08-09 15:40:37.979 [2546082016]
>TRACE:  
> clssnmDiskStateChange: state from 1 to 2 disk
(1//dev/raw/raw3)
> [    CSSD]2007-08-09 15:40:37.981 [2546082016]
>TRACE:  
> clssnmDiskStateChange: state from 1 to 2 disk
(2//dev/raw/raw5)
> [    CSSD]2007-08-09 15:40:40.816 [1084246368]
>TRACE:  
> clssnmDiskStateChange: state from 2 to 4 disk
(1//dev/raw/raw3)
> [    CSSD]2007-08-09 15:40:40.825 [1082145120]
>TRACE:  
> clssnmDiskStateChange: state from 2 to 4 disk
(0//dev/raw/raw1)
> [    CSSD]2007-08-09 15:40:40.830 [1084246368]
>TRACE:  
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
> Disk lastSeqNo(483)
> [    CSSD]2007-08-09 15:40:40.837 [1082145120]
>TRACE:  
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
> Disk lastSeqNo(483)
> [    CSSD]2007-08-09 15:40:41.767 [1086347616]
>TRACE:  
> clssnmDiskStateChange: state from 2 to 4 disk
(2//dev/raw/raw5)
> [    CSSD]2007-08-09 15:40:41.779 [1086347616]
>TRACE:  
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(484) LATS(0)
> Disk lastSeqNo(484)
> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:  
> clssscSclsFatal: read value of disable
> [    CSSD]2007-08-09 15:40:41.797 [1090550112]
>TRACE:  
> clssnmFatalThread: spawned
> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:  
> clssscSclsFatal: read value of disable
> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 2, flags 0x0001, connector 1
> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 0, flags 0x0000, connector 1
> [    CSSD]2007-08-09 15:40:41.799 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 1, flags 0x0001, connector 0
> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:  
> clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)
>      (KEY=Oracle_CSS_LclLstnr_oracm_crs_2))
> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:  
> clssgmclientlsnr: listening on
> (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_sdbe3_oracm_crs))
> [    CSSD]2007-08-09 15:40:42.832 [1092651360]
>TRACE:  
> clssnmConnComplete: connected to node 1 (con
0x2a981016c0),
>      state 3 birth 0, unique 1186687891/1186687891 
prevConuni(0)
> [    CSSD]2007-08-09 15:40:43.307 [1105258848]
>TRACE:  
> clssnmSendingThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1103157600]
>TRACE:  
> clssnmPollingThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:  
> clssnmRcfgMgrThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:  
> clssnmRcfgMgrThread: Local Join
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:  
> clssnmLocalJoinEvent: set node(1) inactive
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>WARNING:
> clssnmLocalJoinEvent: takeover aborted due to UNKNOWN
nodes
> [    CSSD]2007-08-09 15:40:43.992 [1092651360]
>TRACE:  
> clssnmHandleSync: Acknowledging sync: src[1]
srcName[sdbe1] seq[5] sync[2]
> [    CSSD]2007-08-09 15:40:44.309 [1107360096]
>TRACE:  
> clssnmRcfgMgrThread: lastleader(1) unique(1186688418)
> [    CSSD]2007-08-09 15:40:44.994 [1092651360]
>TRACE:  
> clssnmSendVoteInfo: node(1) syncSeqNo(2)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:  
> clssnmUpdateNodeState: node 0, state (0/0) unique (0/0)
prevConuni(0)
> birth (0/0)
>      (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:  
> clssnmDeactivateNode: node 0 () left cluster
>
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:  
> clssnmUpdateNodeState: node 1, state (4/3) unique
(1186687891/1186687891)
>       prevConuni(0) birth (0/1) (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:  
> clssnmUpdateNodeState: node 2, state (1/2) unique
(1186688418/1186688418)
>      prevConuni(0) birth (0/2) (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:   
> clssnmHandleUpdate: SYNC(2) from node(1) completed
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:   
> clssnmHandleUpdate: NODE 1 (sdbe1) IS ACTIVE MEMBER OF
CLUSTER
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:   
> clssnmHandleUpdate: NODE 2 (sdbe3) IS ACTIVE MEMBER OF
CLUSTER
> [    CSSD]2007-08-09 15:40:47.002 [2546082016]
>USER:    NMEVENT_SUSPEND
> [00][00][00][00]
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:  
> clssgmReconfigThread:  started for reconfig (2)
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>USER:   
> NMEVENT_RECONFIG [00][00][00][06]
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:  
> clssgmEstablishConnections: 2 nodes in cluster incarn
2
> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:  
> clssgmInitialRecv: (0x774770) accepted a new
>       connection from node 1 born at 1 active (2, 2),
vers (10,3,1,2)
> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:  
> clssgmInitialRecv: conns done (2/2)
> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:  
> clssgmEstablishMasterNode: MASTER for 2 is node(1)
birth(1)
> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:  
> clssgmChangeMasterNode: requeued 0 RPCs
> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:  
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:  
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:  
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:  
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:  
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:  
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.824 [1090550112]
>TRACE:  
> clssnmFatalThread: Fatal mode enabled
> [    CSSD]2007-08-09 15:40:48.045 [1092651360]
>TRACE:  
> clssnmSendFatalOn: req to syncLeader(1)
> [    CSSD]2007-08-09 15:40:51.322 [1103157600]
>TRACE:  
> clssnmPollingThread: node sdbe3 (2) missed(2)
checkin(s)
> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>ERROR:  
> clssgmSlaveCMSync: reconfig timeout on master 1
>
> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>TRACE:  
> clssgmReconfigThread:  completed for reconfig(2), with
status(0)
> [    CSSD]2007-08-09 15:41:17.190 [2546082016]
>ERROR:  
> clssgmStartNMMon: reconfig incarn 2 failed. Retrying.
>
>
>
>   


-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: re: 10.2.0.3 - root.sh fails when adding 2nd node ?
user name
2007-08-10 14:56:08
Check List:
-- are all IP resolved on second node and on the fist 1 to
the corerct IP:

 host1, host1-priv, host1-vip
 host2, host2-priv, hgost2-vip
(use your names)

-- Are privare anbd public on the same interface on all
nodes?

-- Are all jost names short enough (I'd better avoid names
> 8 symbols).

-- For OCRFile, is output of 'od <your-OCR-file> |
head -100' the same on 
all nodes.

- For CSS File, the same

-- Is CSSFile writable by Oracle user on all nodes ('disk'
group is not 
enough).

-- Is linux the same (uname -a')?

-- can you ping host1-priv from host-2? Vice versa?

-- can oracle slogin from host1 to host2? Vice versa?

(I dont remember exact command, but check OCRFile
configuration and CSSFile 
configuration on all nodes).


----- Original Message ----- 
From: "Peter Santos" <psantoscheetahmail.com>
To: <suse-oraclesuse.com>
Sent: Friday, August 10, 2007 6:38 AM
Subject: [suse-oracle] re: 10.2.0.3 - root.sh fails when
adding 2nd node ?


> Folks,
>    I'm trying to install a 2nd node on my 2 node test
cluster and I
> can't seem to
>    get past running the root.sh script on the 2nd node.
 Whenever I
> execute the root.sh
>    on the newly added node (this is the very last step
of the
> installer), the CSS deamon
>    doesn't come up and eventually it reboots my
original node.
>
>    From the ocssd.log files, I can tell that it has
something to do
> with the 2 nodes speaking
>    to each other ... either via the ocr/vote disks or
network 
> connectivity.
>
>    I've setup my raw partitions via fdisk, bound them
in /etc/raw and
> setup permissions in udev.permissions.
>    I've even cksum'd all raw devices from both nodes ..
and it all
> looks good.
>
>    Could I be missing something else? Any ideas?
>
>    Here is that the ocssd.log complains about.
>
> [    CSSD]2007-08-09 15:40:27.547 >USER:    CSS
daemon log for node
> sdbe3, number 2, in cluster oracm_crs
> [  clsdmt]Listening to
(ADDRESS=(PROTOCOL=ipc)(KEY=sdbe3DBG_CSSD))
> [    CSSD]2007-08-09 15:40:27.642 [2546082016]
>TRACE:   clssscmain:
> local-only set to false
> [    CSSD]2007-08-09 15:40:34.506 [2546082016]
>TRACE:
> clssnmReadNodeInfo: added node 1 (sdbe1) to cluster
> [    CSSD]2007-08-09 15:40:34.543 [2546082016]
>TRACE:
> clssnmReadNodeInfo: added node 2 (sdbe3) to cluster
> [    CSSD]2007-08-09 15:40:34.548 [1082145120]
>TRACE:
> clssnm_skgxnmon: skgxn init failed, rc 1
> [    CSSD]2007-08-09 15:40:34.548 [2546082016]
>TRACE:
> clssnm_skgxnonline: Using vacuous skgxn monitor
> [    CSSD]2007-08-09 15:40:37.912 [2546082016]
>TRACE:
> clssnmInitNMInfo: misscount set to 60
> [    CSSD]2007-08-09 15:40:37.918 [2546082016]
>TRACE:
> clssnmDiskStateChange: state from 1 to 2 disk
(0//dev/raw/raw1)
> [    CSSD]2007-08-09 15:40:37.979 [2546082016]
>TRACE:
> clssnmDiskStateChange: state from 1 to 2 disk
(1//dev/raw/raw3)
> [    CSSD]2007-08-09 15:40:37.981 [2546082016]
>TRACE:
> clssnmDiskStateChange: state from 1 to 2 disk
(2//dev/raw/raw5)
> [    CSSD]2007-08-09 15:40:40.816 [1084246368]
>TRACE:
> clssnmDiskStateChange: state from 2 to 4 disk
(1//dev/raw/raw3)
> [    CSSD]2007-08-09 15:40:40.825 [1082145120]
>TRACE:
> clssnmDiskStateChange: state from 2 to 4 disk
(0//dev/raw/raw1)
> [    CSSD]2007-08-09 15:40:40.830 [1084246368]
>TRACE:
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
> Disk lastSeqNo(483)
> [    CSSD]2007-08-09 15:40:40.837 [1082145120]
>TRACE:
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
> Disk lastSeqNo(483)
> [    CSSD]2007-08-09 15:40:41.767 [1086347616]
>TRACE:
> clssnmDiskStateChange: state from 2 to 4 disk
(2//dev/raw/raw5)
> [    CSSD]2007-08-09 15:40:41.779 [1086347616]
>TRACE:
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(484) LATS(0)
> Disk lastSeqNo(484)
> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:
> clssscSclsFatal: read value of disable
> [    CSSD]2007-08-09 15:40:41.797 [1090550112]
>TRACE:
> clssnmFatalThread: spawned
> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:
> clssscSclsFatal: read value of disable
> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 2, flags 0x0001, connector 1
> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 0, flags 0x0000, connector 1
> [    CSSD]2007-08-09 15:40:41.799 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 1, flags 0x0001, connector 0
> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:
> clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)
>     (KEY=Oracle_CSS_LclLstnr_oracm_crs_2))
> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:
> clssgmclientlsnr: listening on
> (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_sdbe3_oracm_crs))
> [    CSSD]2007-08-09 15:40:42.832 [1092651360]
>TRACE:
> clssnmConnComplete: connected to node 1 (con
0x2a981016c0),
>     state 3 birth 0, unique 1186687891/1186687891 
prevConuni(0)
> [    CSSD]2007-08-09 15:40:43.307 [1105258848]
>TRACE:
> clssnmSendingThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1103157600]
>TRACE:
> clssnmPollingThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
> clssnmRcfgMgrThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
> clssnmRcfgMgrThread: Local Join
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
> clssnmLocalJoinEvent: set node(1) inactive
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>WARNING:
> clssnmLocalJoinEvent: takeover aborted due to UNKNOWN
nodes
> [    CSSD]2007-08-09 15:40:43.992 [1092651360]
>TRACE:
> clssnmHandleSync: Acknowledging sync: src[1]
srcName[sdbe1] seq[5] sync[2]
> [    CSSD]2007-08-09 15:40:44.309 [1107360096]
>TRACE:
> clssnmRcfgMgrThread: lastleader(1) unique(1186688418)
> [    CSSD]2007-08-09 15:40:44.994 [1092651360]
>TRACE:
> clssnmSendVoteInfo: node(1) syncSeqNo(2)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmUpdateNodeState: node 0, state (0/0) unique (0/0)
prevConuni(0)
> birth (0/0)
>     (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmDeactivateNode: node 0 () left cluster
>
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmUpdateNodeState: node 1, state (4/3) unique
(1186687891/1186687891)
>      prevConuni(0) birth (0/1) (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmUpdateNodeState: node 2, state (1/2) unique
(1186688418/1186688418)
>     prevConuni(0) birth (0/2) (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
> clssnmHandleUpdate: SYNC(2) from node(1) completed
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
> clssnmHandleUpdate: NODE 1 (sdbe1) IS ACTIVE MEMBER OF
CLUSTER
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
> clssnmHandleUpdate: NODE 2 (sdbe3) IS ACTIVE MEMBER OF
CLUSTER
> [    CSSD]2007-08-09 15:40:47.002 [2546082016]
>USER:    NMEVENT_SUSPEND
> [00][00][00][00]
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:
> clssgmReconfigThread:  started for reconfig (2)
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>USER:
> NMEVENT_RECONFIG [00][00][00][06]
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:
> clssgmEstablishConnections: 2 nodes in cluster incarn
2
> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:
> clssgmInitialRecv: (0x774770) accepted a new
>      connection from node 1 born at 1 active (2, 2),
vers (10,3,1,2)
> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:
> clssgmInitialRecv: conns done (2/2)
> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:
> clssgmEstablishMasterNode: MASTER for 2 is node(1)
birth(1)
> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:
> clssgmChangeMasterNode: requeued 0 RPCs
> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.824 [1090550112]
>TRACE:
> clssnmFatalThread: Fatal mode enabled
> [    CSSD]2007-08-09 15:40:48.045 [1092651360]
>TRACE:
> clssnmSendFatalOn: req to syncLeader(1)
> [    CSSD]2007-08-09 15:40:51.322 [1103157600]
>TRACE:
> clssnmPollingThread: node sdbe3 (2) missed(2)
checkin(s)
> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>ERROR:
> clssgmSlaveCMSync: reconfig timeout on master 1
>
> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>TRACE:
> clssgmReconfigThread:  completed for reconfig(2), with
status(0)
> [    CSSD]2007-08-09 15:41:17.190 [2546082016]
>ERROR:
> clssgmStartNMMon: reconfig incarn 2 failed. Retrying.
>
>
>
> -- 
> To unsubscribe, email: suse-oracle-unsubscribesuse.com
> For additional commands, email: suse-oracle-helpsuse.com
> Please see http://www.suse.com/oracl
e/ before posting
>
> 


-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


SuSe and Oracle11- why SLES9 is not supported while RHEL4 (and OL4) are supported?
user name
2007-08-10 15:43:07
http://download.oracle.c
om/docs/cd/B28359_01/install.111/b32002/pre_install.htm#CIHF
ICFD

Compatibility:

RHEL4 and RHEL5 (reasonable - old and solid RHEL4, and next
generation 
RHEL5)
Oracle Linux 4 and 5 (the same)
AsiaLinux 2 and 3 (the same)
SLES10 and no SLES9 !

Now let's notice ## of problemss with Oracle  SLES10
(starting with Oracle9 
incompatibility, network problems with StandBy, terrible
installation system 
and so on). If I use RHEL or OL, I can use old solid and
well compatible 
system (I don't need ANY new features of SHES10 on DB
servers; no ANY!!!); 
if I use SLES9, I am enforced to switch onto Oracle Linux 4
or 5.

Is it because of Oracle or because of SuSe? We see the same
problem with 
preinstalled systems - you can order DELl with RHEL4 or
RHEL5, but can't 
with SLES9 (SLES10 only).



----- Original Message ----- 
From: "Peter Santos" <psantoscheetahmail.com>
To: <suse-oraclesuse.com>
Sent: Friday, August 10, 2007 6:38 AM
Subject: [suse-oracle] re: 10.2.0.3 - root.sh fails when
adding 2nd node ?


> Folks,
>    I'm trying to install a 2nd node on my 2 node test
cluster and I
> can't seem to
>    get past running the root.sh script on the 2nd node.
 Whenever I
> execute the root.sh
>    on the newly added node (this is the very last step
of the
> installer), the CSS deamon
>    doesn't come up and eventually it reboots my
original node.
>
>    From the ocssd.log files, I can tell that it has
something to do
> with the 2 nodes speaking
>    to each other ... either via the ocr/vote disks or
network 
> connectivity.
>
>    I've setup my raw partitions via fdisk, bound them
in /etc/raw and
> setup permissions in udev.permissions.
>    I've even cksum'd all raw devices from both nodes ..
and it all
> looks good.
>
>    Could I be missing something else? Any ideas?
>
>    Here is that the ocssd.log complains about.
>
> [    CSSD]2007-08-09 15:40:27.547 >USER:    CSS
daemon log for node
> sdbe3, number 2, in cluster oracm_crs
> [  clsdmt]Listening to
(ADDRESS=(PROTOCOL=ipc)(KEY=sdbe3DBG_CSSD))
> [    CSSD]2007-08-09 15:40:27.642 [2546082016]
>TRACE:   clssscmain:
> local-only set to false
> [    CSSD]2007-08-09 15:40:34.506 [2546082016]
>TRACE:
> clssnmReadNodeInfo: added node 1 (sdbe1) to cluster
> [    CSSD]2007-08-09 15:40:34.543 [2546082016]
>TRACE:
> clssnmReadNodeInfo: added node 2 (sdbe3) to cluster
> [    CSSD]2007-08-09 15:40:34.548 [1082145120]
>TRACE:
> clssnm_skgxnmon: skgxn init failed, rc 1
> [    CSSD]2007-08-09 15:40:34.548 [2546082016]
>TRACE:
> clssnm_skgxnonline: Using vacuous skgxn monitor
> [    CSSD]2007-08-09 15:40:37.912 [2546082016]
>TRACE:
> clssnmInitNMInfo: misscount set to 60
> [    CSSD]2007-08-09 15:40:37.918 [2546082016]
>TRACE:
> clssnmDiskStateChange: state from 1 to 2 disk
(0//dev/raw/raw1)
> [    CSSD]2007-08-09 15:40:37.979 [2546082016]
>TRACE:
> clssnmDiskStateChange: state from 1 to 2 disk
(1//dev/raw/raw3)
> [    CSSD]2007-08-09 15:40:37.981 [2546082016]
>TRACE:
> clssnmDiskStateChange: state from 1 to 2 disk
(2//dev/raw/raw5)
> [    CSSD]2007-08-09 15:40:40.816 [1084246368]
>TRACE:
> clssnmDiskStateChange: state from 2 to 4 disk
(1//dev/raw/raw3)
> [    CSSD]2007-08-09 15:40:40.825 [1082145120]
>TRACE:
> clssnmDiskStateChange: state from 2 to 4 disk
(0//dev/raw/raw1)
> [    CSSD]2007-08-09 15:40:40.830 [1084246368]
>TRACE:
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
> Disk lastSeqNo(483)
> [    CSSD]2007-08-09 15:40:40.837 [1082145120]
>TRACE:
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
> Disk lastSeqNo(483)
> [    CSSD]2007-08-09 15:40:41.767 [1086347616]
>TRACE:
> clssnmDiskStateChange: state from 2 to 4 disk
(2//dev/raw/raw5)
> [    CSSD]2007-08-09 15:40:41.779 [1086347616]
>TRACE:
> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(484) LATS(0)
> Disk lastSeqNo(484)
> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:
> clssscSclsFatal: read value of disable
> [    CSSD]2007-08-09 15:40:41.797 [1090550112]
>TRACE:
> clssnmFatalThread: spawned
> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:
> clssscSclsFatal: read value of disable
> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 2, flags 0x0001, connector 1
> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 0, flags 0x0000, connector 1
> [    CSSD]2007-08-09 15:40:41.799 [1092651360]
>TRACE:   clssnmconnect:
> connecting to node 1, flags 0x0001, connector 0
> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:
> clssgmclientlsnr: listening on (ADDRESS=(PROTOCOL=ipc)
>     (KEY=Oracle_CSS_LclLstnr_oracm_crs_2))
> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:
> clssgmclientlsnr: listening on
> (ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_sdbe3_oracm_crs))
> [    CSSD]2007-08-09 15:40:42.832 [1092651360]
>TRACE:
> clssnmConnComplete: connected to node 1 (con
0x2a981016c0),
>     state 3 birth 0, unique 1186687891/1186687891 
prevConuni(0)
> [    CSSD]2007-08-09 15:40:43.307 [1105258848]
>TRACE:
> clssnmSendingThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1103157600]
>TRACE:
> clssnmPollingThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
> clssnmRcfgMgrThread: Connection complete
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
> clssnmRcfgMgrThread: Local Join
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
> clssnmLocalJoinEvent: set node(1) inactive
> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>WARNING:
> clssnmLocalJoinEvent: takeover aborted due to UNKNOWN
nodes
> [    CSSD]2007-08-09 15:40:43.992 [1092651360]
>TRACE:
> clssnmHandleSync: Acknowledging sync: src[1]
srcName[sdbe1] seq[5] sync[2]
> [    CSSD]2007-08-09 15:40:44.309 [1107360096]
>TRACE:
> clssnmRcfgMgrThread: lastleader(1) unique(1186688418)
> [    CSSD]2007-08-09 15:40:44.994 [1092651360]
>TRACE:
> clssnmSendVoteInfo: node(1) syncSeqNo(2)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmUpdateNodeState: node 0, state (0/0) unique (0/0)
prevConuni(0)
> birth (0/0)
>     (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmDeactivateNode: node 0 () left cluster
>
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmUpdateNodeState: node 1, state (4/3) unique
(1186687891/1186687891)
>      prevConuni(0) birth (0/1) (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
> clssnmUpdateNodeState: node 2, state (1/2) unique
(1186688418/1186688418)
>     prevConuni(0) birth (0/2) (old/new)
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
> clssnmHandleUpdate: SYNC(2) from node(1) completed
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
> clssnmHandleUpdate: NODE 1 (sdbe1) IS ACTIVE MEMBER OF
CLUSTER
> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
> clssnmHandleUpdate: NODE 2 (sdbe3) IS ACTIVE MEMBER OF
CLUSTER
> [    CSSD]2007-08-09 15:40:47.002 [2546082016]
>USER:    NMEVENT_SUSPEND
> [00][00][00][00]
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:
> clssgmReconfigThread:  started for reconfig (2)
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>USER:
> NMEVENT_RECONFIG [00][00][00][06]
> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:
> clssgmEstablishConnections: 2 nodes in cluster incarn
2
> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:
> clssgmInitialRecv: (0x774770) accepted a new
>      connection from node 1 born at 1 active (2, 2),
vers (10,3,1,2)
> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:
> clssgmInitialRecv: conns done (2/2)
> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:
> clssgmEstablishMasterNode: MASTER for 2 is node(1)
birth(1)
> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:
> clssgmChangeMasterNode: requeued 0 RPCs
> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:
> clssnmvFatalCheck: extra node 1
> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:
> clssnmvFatalCheck: fatal 1, sclsfatal 0
> [    CSSD]2007-08-09 15:40:47.824 [1090550112]
>TRACE:
> clssnmFatalThread: Fatal mode enabled
> [    CSSD]2007-08-09 15:40:48.045 [1092651360]
>TRACE:
> clssnmSendFatalOn: req to syncLeader(1)
> [    CSSD]2007-08-09 15:40:51.322 [1103157600]
>TRACE:
> clssnmPollingThread: node sdbe3 (2) missed(2)
checkin(s)
> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>ERROR:
> clssgmSlaveCMSync: reconfig timeout on master 1
>
> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>TRACE:
> clssgmReconfigThread:  completed for reconfig(2), with
status(0)
> [    CSSD]2007-08-09 15:41:17.190 [2546082016]
>ERROR:
> clssgmStartNMMon: reconfig incarn 2 failed. Retrying.
>
>
>
> -- 
> To unsubscribe, email: suse-oracle-unsubscribesuse.com
> For additional commands, email: suse-oracle-helpsuse.com
> Please see http://www.suse.com/oracl
e/ before posting
>
> 


-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: SuSe and Oracle11- why SLES9 is not supported while RHEL4 (and OL4) are supported?
user name
2007-08-10 16:10:46
>>> On 8/10/2007 at 1:43 PM,
"Alexei_Roudnev" <Alexei_Roudnevexigengroup.com>
wrote:
> http://download.oracle.com/docs/cd/
B28359_01/install.111/b32002/pre_install.h 
> tm#CIHFICFD
> 
> Compatibility:
> 
> RHEL4 and RHEL5 (reasonable - old and solid RHEL4, and
next generation 
> RHEL5)
> Oracle Linux 4 and 5 (the same)
> AsiaLinux 2 and 3 (the same)
> SLES10 and no SLES9 !
...
> 
> Is it because of Oracle or because of SuSe? 

I guess Oracle made informed decision to support only SLES10
(and possibly future SLES11 release), similar to Oracle
10gR2. No surprise here.

We (Novell/SUSE) recommend customers to start adopting
SLES10 SP1 to make it even better platform 

-Arun




--
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: SuSe and Oracle11- why SLES9 is notsupported while RHEL4 (and OL4) are supported?
user name
2007-08-10 17:33:46
Problem is that adopting SLES10 makes a lot of tropubles and
dont bring any 
value.

Examples:
- iSCSI - SLES10 missed multiport support and used tricked
and very 
unconvenient open-iSCSI insrtead of well-tested and reliable
Cisco-iscsi in 
SLES8, SLES9, RHEL4 and UL4.
- DataGuard tests  SLES10 shows strange lost of SQL*Net
connection because 
of async io problem with the network;
- numerous people reports OCFSv2 problems with SLES10
- numerous reports or RAC cluster problems
- SLES10 can't work with Oracle9i. In reality you can't
migrate because of 
this, if you use Oracle9i.
- SLES10 your system is a mess.

Sysadmins are not dumb - seen all this, they select RHEL4 or
UL4 - these 2 
systems are well compatible with all Oracle versions (I can
even run 
Oracle8i on RHEL4, with some hacks) and are rock-solid.

It became more difficult to support SLES vs RHEL/UL after
Oracle11 
announcement.

PS. The same happen with SLES8 - very solid system was
abandoned by Novell - 
when RHEL and others supported new ET64T cpu-s on their
RHEL3 systems, 
Novell did not adapted SLES8 onto it.

----- Original Message ----- 
From: "Arun Singh" <Arun.Singhnovell.com>
To: <suse-oraclesuse.com>
Sent: Friday, August 10, 2007 2:10 PM
Subject: Re: [suse-oracle] SuSe and Oracle11- why SLES9 is
notsupported 
while RHEL4 (and OL4) are supported?


>>> On 8/10/2007 at 1:43 PM,
"Alexei_Roudnev" 
>>> <Alexei_Roudnevexigengroup.com>
wrote:
> http://download.oracle.com/docs/cd/
B28359_01/install.111/b32002/pre_install.h
> tm#CIHFICFD
>
> Compatibility:
>
> RHEL4 and RHEL5 (reasonable - old and solid RHEL4, and
next generation
> RHEL5)
> Oracle Linux 4 and 5 (the same)
> AsiaLinux 2 and 3 (the same)
> SLES10 and no SLES9 !
...
>
> Is it because of Oracle or because of SuSe?

I guess Oracle made informed decision to support only SLES10
(and possibly 
future SLES11 release), similar to Oracle 10gR2. No surprise
here.

We (Novell/SUSE) recommend customers to start adopting
SLES10 SP1 to make it 
even better platform 

-Arun




-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting



-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: re: 10.2.0.3 - root.sh fails when adding 2nd node ?
user name
2007-08-10 23:24:00
We finally figured it out.

Turns out that when our sa's gave us the configured machine,
the MTU on
the private interconnect was set to Jumbo Frames (MTU =
9000), and
I remember running a check where I ping'd each private
interconnect with
a ping -s 9000 <ip of interconnect> .. and it worked.
Maybe the
switch wasn't setup to handle jumbo frames... anyway when we
set the MTU
back to 1500 it all worked like a charm .. only wasted
about
3 days.

Does anyone know of the best way to ensure that Jumbo Frames
is working
.. This is not the firs time our SA's have told me that
Jumbo Frames
was configured on the switch when it wasn't. I wonder if
there is a good
way to test this without bringing up the web interface to
the
switch etc .. It seems like sending the interconnect IP's
large packets
will succeed even when Jumbo Frames is not configured?

-peter




Bart Goossens wrote:
> How are you logged in to the 2nd node?  If you use vnc,
it is possible
> to encounter problems when you run the root.sh and you
didn't start
> the vncserver as root.
>
> Peter Santos wrote:
>> Folks,
>>     I'm trying to install a 2nd node on my 2 node
test cluster and I
>> can't seem to
>>     get past running the root.sh script on the 2nd
node.  Whenever I
>> execute the root.sh
>>     on the newly added node (this is the very last
step of the
>> installer), the CSS deamon
>>     doesn't come up and eventually it reboots my
original node.
>>
>>     From the ocssd.log files, I can tell that it
has something to do
>> with the 2 nodes speaking
>>     to each other ... either via the ocr/vote disks
or network
>> connectivity.
>>
>>     I've setup my raw partitions via fdisk, bound
them in /etc/raw and
>> setup permissions in udev.permissions.
>>     I've even cksum'd all raw devices from both
nodes .. and it all
>> looks good.
>>
>>     Could I be missing something else? Any ideas?
>>
>>     Here is that the ocssd.log complains about.
>>
>> [    CSSD]2007-08-09 15:40:27.547 >USER:    CSS
daemon log for node
>> sdbe3, number 2, in cluster oracm_crs
>> [  clsdmt]Listening to
(ADDRESS=(PROTOCOL=ipc)(KEY=sdbe3DBG_CSSD))
>> [    CSSD]2007-08-09 15:40:27.642 [2546082016]
>TRACE:   clssscmain:
>> local-only set to false
>> [    CSSD]2007-08-09 15:40:34.506 [2546082016]
>TRACE: 
>> clssnmReadNodeInfo: added node 1 (sdbe1) to
cluster
>> [    CSSD]2007-08-09 15:40:34.543 [2546082016]
>TRACE: 
>> clssnmReadNodeInfo: added node 2 (sdbe3) to
cluster
>> [    CSSD]2007-08-09 15:40:34.548 [1082145120]
>TRACE: 
>> clssnm_skgxnmon: skgxn init failed, rc 1
>> [    CSSD]2007-08-09 15:40:34.548 [2546082016]
>TRACE: 
>> clssnm_skgxnonline: Using vacuous skgxn monitor
>> [    CSSD]2007-08-09 15:40:37.912 [2546082016]
>TRACE: 
>> clssnmInitNMInfo: misscount set to 60
>> [    CSSD]2007-08-09 15:40:37.918 [2546082016]
>TRACE: 
>> clssnmDiskStateChange: state from 1 to 2 disk
(0//dev/raw/raw1)
>> [    CSSD]2007-08-09 15:40:37.979 [2546082016]
>TRACE: 
>> clssnmDiskStateChange: state from 1 to 2 disk
(1//dev/raw/raw3)
>> [    CSSD]2007-08-09 15:40:37.981 [2546082016]
>TRACE: 
>> clssnmDiskStateChange: state from 1 to 2 disk
(2//dev/raw/raw5)
>> [    CSSD]2007-08-09 15:40:40.816 [1084246368]
>TRACE: 
>> clssnmDiskStateChange: state from 2 to 4 disk
(1//dev/raw/raw3)
>> [    CSSD]2007-08-09 15:40:40.825 [1082145120]
>TRACE: 
>> clssnmDiskStateChange: state from 2 to 4 disk
(0//dev/raw/raw1)
>> [    CSSD]2007-08-09 15:40:40.830 [1084246368]
>TRACE: 
>> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
>> Disk lastSeqNo(483)
>> [    CSSD]2007-08-09 15:40:40.837 [1082145120]
>TRACE: 
>> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(483) LATS(0)
>> Disk lastSeqNo(483)
>> [    CSSD]2007-08-09 15:40:41.767 [1086347616]
>TRACE: 
>> clssnmDiskStateChange: state from 2 to 4 disk
(2//dev/raw/raw5)
>> [    CSSD]2007-08-09 15:40:41.779 [1086347616]
>TRACE: 
>> clssnmReadDskHeartbeat: node(1) is down. rcfg(2)
wrtcnt(484) LATS(0)
>> Disk lastSeqNo(484)
>> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE: 
>> clssscSclsFatal: read value of disable
>> [    CSSD]2007-08-09 15:40:41.797 [1090550112]
>TRACE: 
>> clssnmFatalThread: spawned
>> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE: 
>> clssscSclsFatal: read value of disable
>> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
>> connecting to node 2, flags 0x0001, connector 1
>> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
>> connecting to node 0, flags 0x0000, connector 1
>> [    CSSD]2007-08-09 15:40:41.799 [1092651360]
>TRACE:   clssnmconnect:
>> connecting to node 1, flags 0x0001, connector 0
>> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE: 
>> clssgmclientlsnr: listening on
(ADDRESS=(PROTOCOL=ipc)
>>      (KEY=Oracle_CSS_LclLstnr_oracm_crs_2))
>> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE: 
>> clssgmclientlsnr: listening on
>>
(ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_sdbe3_oracm_crs))
>> [    CSSD]2007-08-09 15:40:42.832 [1092651360]
>TRACE: 
>> clssnmConnComplete: connected to node 1 (con
0x2a981016c0),
>>      state 3 birth 0, unique 1186687891/1186687891 
prevConuni(0)
>> [    CSSD]2007-08-09 15:40:43.307 [1105258848]
>TRACE: 
>> clssnmSendingThread: Connection complete
>> [    CSSD]2007-08-09 15:40:43.307 [1103157600]
>TRACE: 
>> clssnmPollingThread: Connection complete
>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE: 
>> clssnmRcfgMgrThread: Connection complete
>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE: 
>> clssnmRcfgMgrThread: Local Join
>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE: 
>> clssnmLocalJoinEvent: set node(1) inactive
>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>WARNING:
>> clssnmLocalJoinEvent: takeover aborted due to
UNKNOWN nodes
>> [    CSSD]2007-08-09 15:40:43.992 [1092651360]
>TRACE: 
>> clssnmHandleSync: Acknowledging sync: src[1]
srcName[sdbe1] seq[5]
>> sync[2]
>> [    CSSD]2007-08-09 15:40:44.309 [1107360096]
>TRACE: 
>> clssnmRcfgMgrThread: lastleader(1)
unique(1186688418)
>> [    CSSD]2007-08-09 15:40:44.994 [1092651360]
>TRACE: 
>> clssnmSendVoteInfo: node(1) syncSeqNo(2)
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE: 
>> clssnmUpdateNodeState: node 0, state (0/0) unique
(0/0) prevConuni(0)
>> birth (0/0)
>>      (old/new)
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE: 
>> clssnmDeactivateNode: node 0 () left cluster
>>
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE: 
>> clssnmUpdateNodeState: node 1, state (4/3) unique
>> (1186687891/1186687891)
>>       prevConuni(0) birth (0/1) (old/new)
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE: 
>> clssnmUpdateNodeState: node 2, state (1/2) unique
>> (1186688418/1186688418)
>>      prevConuni(0) birth (0/2) (old/new)
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:  
>> clssnmHandleUpdate: SYNC(2) from node(1) completed
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:  
>> clssnmHandleUpdate: NODE 1 (sdbe1) IS ACTIVE MEMBER
OF CLUSTER
>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:  
>> clssnmHandleUpdate: NODE 2 (sdbe3) IS ACTIVE MEMBER
OF CLUSTER
>> [    CSSD]2007-08-09 15:40:47.002 [2546082016]
>USER:    NMEVENT_SUSPEND
>> [00][00][00][00]
>> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE: 
>> clssgmReconfigThread:  started for reconfig (2)
>> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>USER:  
>> NMEVENT_RECONFIG [00][00][00][06]
>> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE: 
>> clssgmEstablishConnections: 2 nodes in cluster
incarn 2
>> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE: 
>> clssgmInitialRecv: (0x774770) accepted a new
>>       connection from node 1 born at 1 active (2,
2), vers (10,3,1,2)
>> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE: 
>> clssgmInitialRecv: conns done (2/2)
>> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE: 
>> clssgmEstablishMasterNode: MASTER for 2 is node(1)
birth(1)
>> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE: 
>> clssgmChangeMasterNode: requeued 0 RPCs
>> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE: 
>> clssnmvFatalCheck: extra node 1
>> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE: 
>> clssnmvFatalCheck: fatal 1, sclsfatal 0
>> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE: 
>> clssnmvFatalCheck: extra node 1
>> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE: 
>> clssnmvFatalCheck: fatal 1, sclsfatal 0
>> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE: 
>> clssnmvFatalCheck: extra node 1
>> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE: 
>> clssnmvFatalCheck: fatal 1, sclsfatal 0
>> [    CSSD]2007-08-09 15:40:47.824 [1090550112]
>TRACE: 
>> clssnmFatalThread: Fatal mode enabled
>> [    CSSD]2007-08-09 15:40:48.045 [1092651360]
>TRACE: 
>> clssnmSendFatalOn: req to syncLeader(1)
>> [    CSSD]2007-08-09 15:40:51.322 [1103157600]
>TRACE: 
>> clssnmPollingThread: node sdbe3 (2) missed(2)
checkin(s)
>> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>ERROR: 
>> clssgmSlaveCMSync: reconfig timeout on master 1
>>
>> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>TRACE: 
>> clssgmReconfigThread:  completed for reconfig(2),
with status(0)
>> [    CSSD]2007-08-09 15:41:17.190 [2546082016]
>ERROR: 
>> clssgmStartNMMon: reconfig incarn 2 failed.
Retrying.
>>
>>
>>
>>   
>
>

-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: re: 10.2.0.3 - root.sh fails when adding 2nd node ?
user name
2007-08-10 23:53:01
Run pings. If you can run pings of 8K between serevrs, then
more likely tcp 
and udp (RAC uses udp) will work as well. In addition, run
ftp for the big 
(1 GB) file and verify that you have bottleneck in network
bandwidth or disk 
system - so seen about 80 MB/second, if disks are very fast,
or 10 - 30 
MB/secobd, on usual speed disks. If network have problem,
you wil see more 
likely <= 1 MB/second transfer speed.

More likely, your network admins had it configured but did
not saved 
configuration (I am both, sys and network admin, so I can
control it here 
easily), or you run 9K pings before setting up 9K MTU (in
this case ping wil 
work tru fragmentation).

I always run many performance tests after systems are
network/configured but 
before RAC installation.

Btw, I saw some performance improvement with jumbo frames
(sugnificant on 
iSCSI and some on RAC interconnection).

----- Original Message ----- 
From: "Peter Santos" <psantoscheetahmail.com>
To: <suse-oraclesuse.com>
Sent: Friday, August 10, 2007 9:24 PM
Subject: Re: [suse-oracle] re: 10.2.0.3 - root.sh fails when
adding 2nd node 
?


> We finally figured it out.
>
> Turns out that when our sa's gave us the configured
machine, the MTU on
> the private interconnect was set to Jumbo Frames (MTU =
9000), and
> I remember running a check where I ping'd each private
interconnect with
> a ping -s 9000 <ip of interconnect> .. and it
worked. Maybe the
> switch wasn't setup to handle jumbo frames... anyway
when we set the MTU
> back to 1500 it all worked like a charm .. only wasted
about
> 3 days.
>
> Does anyone know of the best way to ensure that Jumbo
Frames is working
> .. This is not the firs time our SA's have told me that
Jumbo Frames
> was configured on the switch when it wasn't. I wonder
if there is a good
> way to test this without bringing up the web interface
to the
> switch etc .. It seems like sending the interconnect
IP's large packets
> will succeed even when Jumbo Frames is not configured?
>
> -peter
>
>
>
>
> Bart Goossens wrote:
>> How are you logged in to the 2nd node?  If you use
vnc, it is possible
>> to encounter problems when you run the root.sh and
you didn't start
>> the vncserver as root.
>>
>> Peter Santos wrote:
>>> Folks,
>>>     I'm trying to install a 2nd node on my 2
node test cluster and I
>>> can't seem to
>>>     get past running the root.sh script on the
2nd node.  Whenever I
>>> execute the root.sh
>>>     on the newly added node (this is the very
last step of the
>>> installer), the CSS deamon
>>>     doesn't come up and eventually it reboots
my original node.
>>>
>>>     From the ocssd.log files, I can tell that
it has something to do
>>> with the 2 nodes speaking
>>>     to each other ... either via the ocr/vote
disks or network
>>> connectivity.
>>>
>>>     I've setup my raw partitions via fdisk,
bound them in /etc/raw and
>>> setup permissions in udev.permissions.
>>>     I've even cksum'd all raw devices from both
nodes .. and it all
>>> looks good.
>>>
>>>     Could I be missing something else? Any
ideas?
>>>
>>>     Here is that the ocssd.log complains
about.
>>>
>>> [    CSSD]2007-08-09 15:40:27.547 >USER:   
CSS daemon log for node
>>> sdbe3, number 2, in cluster oracm_crs
>>> [  clsdmt]Listening to
(ADDRESS=(PROTOCOL=ipc)(KEY=sdbe3DBG_CSSD))
>>> [    CSSD]2007-08-09 15:40:27.642 [2546082016]
>TRACE:   clssscmain:
>>> local-only set to false
>>> [    CSSD]2007-08-09 15:40:34.506 [2546082016]
>TRACE:
>>> clssnmReadNodeInfo: added node 1 (sdbe1) to
cluster
>>> [    CSSD]2007-08-09 15:40:34.543 [2546082016]
>TRACE:
>>> clssnmReadNodeInfo: added node 2 (sdbe3) to
cluster
>>> [    CSSD]2007-08-09 15:40:34.548 [1082145120]
>TRACE:
>>> clssnm_skgxnmon: skgxn init failed, rc 1
>>> [    CSSD]2007-08-09 15:40:34.548 [2546082016]
>TRACE:
>>> clssnm_skgxnonline: Using vacuous skgxn
monitor
>>> [    CSSD]2007-08-09 15:40:37.912 [2546082016]
>TRACE:
>>> clssnmInitNMInfo: misscount set to 60
>>> [    CSSD]2007-08-09 15:40:37.918 [2546082016]
>TRACE:
>>> clssnmDiskStateChange: state from 1 to 2 disk
(0//dev/raw/raw1)
>>> [    CSSD]2007-08-09 15:40:37.979 [2546082016]
>TRACE:
>>> clssnmDiskStateChange: state from 1 to 2 disk
(1//dev/raw/raw3)
>>> [    CSSD]2007-08-09 15:40:37.981 [2546082016]
>TRACE:
>>> clssnmDiskStateChange: state from 1 to 2 disk
(2//dev/raw/raw5)
>>> [    CSSD]2007-08-09 15:40:40.816 [1084246368]
>TRACE:
>>> clssnmDiskStateChange: state from 2 to 4 disk
(1//dev/raw/raw3)
>>> [    CSSD]2007-08-09 15:40:40.825 [1082145120]
>TRACE:
>>> clssnmDiskStateChange: state from 2 to 4 disk
(0//dev/raw/raw1)
>>> [    CSSD]2007-08-09 15:40:40.830 [1084246368]
>TRACE:
>>> clssnmReadDskHeartbeat: node(1) is down.
rcfg(2) wrtcnt(483) LATS(0)
>>> Disk lastSeqNo(483)
>>> [    CSSD]2007-08-09 15:40:40.837 [1082145120]
>TRACE:
>>> clssnmReadDskHeartbeat: node(1) is down.
rcfg(2) wrtcnt(483) LATS(0)
>>> Disk lastSeqNo(483)
>>> [    CSSD]2007-08-09 15:40:41.767 [1086347616]
>TRACE:
>>> clssnmDiskStateChange: state from 2 to 4 disk
(2//dev/raw/raw5)
>>> [    CSSD]2007-08-09 15:40:41.779 [1086347616]
>TRACE:
>>> clssnmReadDskHeartbeat: node(1) is down.
rcfg(2) wrtcnt(484) LATS(0)
>>> Disk lastSeqNo(484)
>>> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:
>>> clssscSclsFatal: read value of disable
>>> [    CSSD]2007-08-09 15:40:41.797 [1090550112]
>TRACE:
>>> clssnmFatalThread: spawned
>>> [    CSSD]2007-08-09 15:40:41.797 [2546082016]
>TRACE:
>>> clssscSclsFatal: read value of disable
>>> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
>>> connecting to node 2, flags 0x0001, connector
1
>>> [    CSSD]2007-08-09 15:40:41.798 [1092651360]
>TRACE:   clssnmconnect:
>>> connecting to node 0, flags 0x0000, connector
1
>>> [    CSSD]2007-08-09 15:40:41.799 [1092651360]
>TRACE:   clssnmconnect:
>>> connecting to node 1, flags 0x0001, connector
0
>>> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:
>>> clssgmclientlsnr: listening on
(ADDRESS=(PROTOCOL=ipc)
>>>      (KEY=Oracle_CSS_LclLstnr_oracm_crs_2))
>>> [    CSSD]2007-08-09 15:40:41.801 [1094752608]
>TRACE:
>>> clssgmclientlsnr: listening on
>>>
(ADDRESS=(PROTOCOL=ipc)(KEY=OCSSD_LL_sdbe3_oracm_crs))
>>> [    CSSD]2007-08-09 15:40:42.832 [1092651360]
>TRACE:
>>> clssnmConnComplete: connected to node 1 (con
0x2a981016c0),
>>>      state 3 birth 0, unique
1186687891/1186687891  prevConuni(0)
>>> [    CSSD]2007-08-09 15:40:43.307 [1105258848]
>TRACE:
>>> clssnmSendingThread: Connection complete
>>> [    CSSD]2007-08-09 15:40:43.307 [1103157600]
>TRACE:
>>> clssnmPollingThread: Connection complete
>>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
>>> clssnmRcfgMgrThread: Connection complete
>>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
>>> clssnmRcfgMgrThread: Local Join
>>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>TRACE:
>>> clssnmLocalJoinEvent: set node(1) inactive
>>> [    CSSD]2007-08-09 15:40:43.307 [1107360096]
>WARNING:
>>> clssnmLocalJoinEvent: takeover aborted due to
UNKNOWN nodes
>>> [    CSSD]2007-08-09 15:40:43.992 [1092651360]
>TRACE:
>>> clssnmHandleSync: Acknowledging sync: src[1]
srcName[sdbe1] seq[5]
>>> sync[2]
>>> [    CSSD]2007-08-09 15:40:44.309 [1107360096]
>TRACE:
>>> clssnmRcfgMgrThread: lastleader(1)
unique(1186688418)
>>> [    CSSD]2007-08-09 15:40:44.994 [1092651360]
>TRACE:
>>> clssnmSendVoteInfo: node(1) syncSeqNo(2)
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
>>> clssnmUpdateNodeState: node 0, state (0/0)
unique (0/0) prevConuni(0)
>>> birth (0/0)
>>>      (old/new)
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
>>> clssnmDeactivateNode: node 0 () left cluster
>>>
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
>>> clssnmUpdateNodeState: node 1, state (4/3)
unique
>>> (1186687891/1186687891)
>>>       prevConuni(0) birth (0/1) (old/new)
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>TRACE:
>>> clssnmUpdateNodeState: node 2, state (1/2)
unique
>>> (1186688418/1186688418)
>>>      prevConuni(0) birth (0/2) (old/new)
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
>>> clssnmHandleUpdate: SYNC(2) from node(1)
completed
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
>>> clssnmHandleUpdate: NODE 1 (sdbe1) IS ACTIVE
MEMBER OF CLUSTER
>>> [    CSSD]2007-08-09 15:40:46.998 [1092651360]
>USER:
>>> clssnmHandleUpdate: NODE 2 (sdbe3) IS ACTIVE
MEMBER OF CLUSTER
>>> [    CSSD]2007-08-09 15:40:47.002 [2546082016]
>USER:    NMEVENT_SUSPEND
>>> [00][00][00][00]
>>> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:
>>> clssgmReconfigThread:  started for reconfig
(2)
>>> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>USER:
>>> NMEVENT_RECONFIG [00][00][00][06]
>>> [    CSSD]2007-08-09 15:40:47.003 [1109461344]
>TRACE:
>>> clssgmEstablishConnections: 2 nodes in cluster
incarn 2
>>> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:
>>> clssgmInitialRecv: (0x774770) accepted a new
>>>       connection from node 1 born at 1 active
(2, 2), vers (10,3,1,2)
>>> [    CSSD]2007-08-09 15:40:47.075 [1101056352]
>TRACE:
>>> clssgmInitialRecv: conns done (2/2)
>>> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:
>>> clssgmEstablishMasterNode: MASTER for 2 is
node(1) birth(1)
>>> [    CSSD]2007-08-09 15:40:47.075 [1109461344]
>TRACE:
>>> clssgmChangeMasterNode: requeued 0 RPCs
>>> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:
>>> clssnmvFatalCheck: extra node 1
>>> [    CSSD]2007-08-09 15:40:47.590 [1084246368]
>TRACE:
>>> clssnmvFatalCheck: fatal 1, sclsfatal 0
>>> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:
>>> clssnmvFatalCheck: extra node 1
>>> [    CSSD]2007-08-09 15:40:47.593 [1086347616]
>TRACE:
>>> clssnmvFatalCheck: fatal 1, sclsfatal 0
>>> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:
>>> clssnmvFatalCheck: extra node 1
>>> [    CSSD]2007-08-09 15:40:47.600 [1082145120]
>TRACE:
>>> clssnmvFatalCheck: fatal 1, sclsfatal 0
>>> [    CSSD]2007-08-09 15:40:47.824 [1090550112]
>TRACE:
>>> clssnmFatalThread: Fatal mode enabled
>>> [    CSSD]2007-08-09 15:40:48.045 [1092651360]
>TRACE:
>>> clssnmSendFatalOn: req to syncLeader(1)
>>> [    CSSD]2007-08-09 15:40:51.322 [1103157600]
>TRACE:
>>> clssnmPollingThread: node sdbe3 (2) missed(2)
checkin(s)
>>> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>ERROR:
>>> clssgmSlaveCMSync: reconfig timeout on master
1
>>>
>>> [    CSSD]2007-08-09 15:41:17.132 [1109461344]
>TRACE:
>>> clssgmReconfigThread:  completed for
reconfig(2), with status(0)
>>> [    CSSD]2007-08-09 15:41:17.190 [2546082016]
>ERROR:
>>> clssgmStartNMMon: reconfig incarn 2 failed.
Retrying.
>>>
>>>
>>>
>>>
>>
>>
>
> -- 
> To unsubscribe, email: suse-oracle-unsubscribesuse.com
> For additional commands, email: suse-oracle-helpsuse.com
> Please see http://www.suse.com/oracl
e/ before posting
>
> 


-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: SuSe and Oracle11- why SLES9 is notsupported while RHEL4 (and OL4) are supported?
user name
2007-08-11 09:50:56
On Fri, 2007-08-10 at 15:33 -0700, Alexei_Roudnev wrote:
> Sysadmins are not dumb - seen all this, they select
RHEL4 or UL4 - these 2 
> systems are well compatible with all Oracle versions (I
can even run 
> Oracle8i on RHEL4, with some hacks) and are
rock-solid.

Your right, sysadmins are not dumb, unfortunately, software
companies
are dumb.  They seem to be pushing shorter and shorter
product
lifecycles and have forgone "stable" products for
simply "current"
products.  Last time I looked Novell's official lifecycle
for SLES9 was
five years.  Oracle's support for their database products is
not much
better, although at least you can pay extra.

Now, 5 years may sound like a long time, but the problem is
that modern
software products are complex and generally full of bugs
when they are
released.  It usually takes 1-2 years before a product truly
becomes
stable.  Many companies won't even look at a product until
the 1st or
2nd service release, which is usually between 1-2 years
after a product
release.  Then it usually takes another few months before
it's approved
for deployment and another year to be deployed within the
company.  That
means that a product doesn't see major market penetration
until 2-3
years into it's lifecycle.  If the product only has a 5 year
lifecycle
that means only 2-3 years of truly usable time in most
companies.
That's far too short for critical software like databases
and OS's.

My guess is that this is what lead to RHEL4 being certified
while SLES9
is not.  Redhat has a 7 year lifecycle on it's products, but
Novell only
officially guarantees 5 years (as far as I know anyway). 
That means
that SLES9 is scheduled to be desupported on August 2009
only two years
from now, and it already feels abandoned.

RHEL4 on the other hand, will be supported until Feb 2012,
nearly 4 1/2
years from now, even though it's initial release only
trailed the SLES9
release by 6 months.  Those two years might not seem like
much, but they
are huge for most companies and have a huge TCO impact.  It
was one of
the major reasons that we switched from SUSE to Redhat
several years
ago. 

Later,
Tom


-- 
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


Re: SuSe and Oracle11- why SLES9 is notsupported while RHEL4 (and OL4) are supported?
user name
2007-08-11 10:46:58
>>> On 8/11/2007 at 7:50 AM, Tom Sightler
<tsightlerzeusinc.com> wrote:
> On Fri, 2007-08-10 at 15:33 -0700, Alexei_Roudnev
wrote:
>> Sysadmins are not dumb - seen all this, they select
RHEL4 or UL4 - these 2 
>> systems are well compatible with all Oracle
versions (I can even run 
>> Oracle8i on RHEL4, with some hacks) and are
rock-solid.
> 
> Your right, sysadmins are not dumb, unfortunately,
software companies
> are dumb.  They seem to be pushing shorter and shorter
product
> lifecycles and have forgone "stable" products
for simply "current"
> products.  Last time I looked Novell's official
lifecycle for SLES9 was
> five years.  Oracle's support for their database
products is not much
> better, although at least you can pay extra.
...
> My guess is that this is what lead to RHEL4 being
certified while SLES9
> is not.  Redhat has a 7 year lifecycle on it's
products, but Novell only
> officially guarantees 5 years (as far as I know
anyway).  That means
> that SLES9 is scheduled to be desupported on August
2009 only two years
> from now, and it already feels abandoned.
> 
> RHEL4 on the other hand, will be supported until Feb
2012, nearly 4 1/2
> years from now, even though it's initial release only
trailed the SLES9
> release by 6 months.  Those two years might not seem
like much, but they

Just for the benefits of all list members. Now, Novell/SUSE
support cycle (http://support.n
ovell.com/lifecycle/) is 7 years (It was 5).

For exacts dates, etc. Please refer: 
http://support.novell.com/lifecycle/lcSearchResults.jsp?st=S
USE&x=0&y=0&sl=-1&sg=-1&pid=1000

-Arun


--
To unsubscribe, email: suse-oracle-unsubscribesuse.com
For additional commands, email: suse-oracle-helpsuse.com
Please see http://www.suse.com/oracl
e/ before posting


[1-10] [11-20] [21]

about | contact  Other archives ( Real Estate discussion Medical topics )