OpenSolaris

Discussions Communities Projects Download Source Browser

Home » OpenSolaris Forums » ha-clusters » discuss

Thread: Error installing cluster

Welcome, Guest Help
Login Login
Guest Settings Guest Settings
Reply to this Thread Reply to this Thread Search Forum Search Forum Back to Thread List Back to Thread List

Permlink Replies: 26 - Last Post: Jun 23, 2008 10:34 PM by: tirth
abhilash tg
abhi.tg4u@gmail.com
Error installing cluster
Posted: Jun 7, 2008 10:21 AM

  Click to reply to this thread Reply

hi
 
 I installed cluster expree on solaris 5.11 b_86. . After installing and configuring the cluster and when i rebooted I got the foll error
 
panic[cpu0] FFFFFFFFFbc245a0 BADTRAP type=d dup...
 
and solaris fail to boot. it then shows
 
rebooting.....
 
and the screen is stuck
 
I dont have the GUI as the drivers do not support my graphics card. So iam working on command line.
_______________________________________________ ha-clusters-discuss mailing list ha-clusters-discuss at opensolaris dot org http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


frueauf

Posts: 241
From:

Registered: 5/29/07
Re: Error installing cluster
Posted: Jun 8, 2008 12:29 AM   in response to: abhilash tg

  Click to reply to this thread Reply

Hi Abhilash,

unfortunately you missed out the important information - can you please
provide the exact panic string with the subsequent stack trace?

Otherwise it is hard to tell what your problem might be.

Can you also say a bit more on your specific configuration? Is it a two
node cluster? Which storage are you using? Have you already been at the
point to configure a quorum device?

Greets
Thorsten

abhilash tg wrote:
> hi
>
> I installed cluster expree on solaris 5.11 b_86. . After installing and
> configuring the cluster and when i rebooted I got the foll error
>
> panic[cpu0] FFFFFFFFFbc245a0 BADTRAP type=d dup...
>
> and solaris fail to boot. it then shows
>
> rebooting.....
>
> and the screen is stuck
>
> I dont have the GUI as the drivers do not support my graphics card. So
> iam working on command line.
>
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


abhilash tg
abhi.tg4u@gmail.com
Re: Error installing cluster
Posted: Jun 21, 2008 10:18 AM   in response to: frueauf

  Click to reply to this thread Reply

Hi ,

I am attaching the /var/adm/messages

Abhilash

2008/6/8 Thorsten Frueauf <Thorsten dot Frueauf at sun dot com>:
Hi Abhilash,

unfortunately you missed out the important information - can you please provide the exact panic string with the subsequent stack trace?

Otherwise it is hard to tell what your problem might be.

Can you also say a bit more on your specific configuration? Is it a two node cluster? Which storage are you using? Have you already been at the point to configure a quorum device?

Greets
     Thorsten


abhilash tg wrote:
hi
  I installed cluster expree on solaris 5.11 b_86. . After installing and configuring the cluster and when i rebooted I got the foll error
 panic[cpu0] FFFFFFFFFbc245a0 BADTRAP type=d dup...
 and solaris fail to boot. it then shows
 rebooting.....
 and the screen is stuck
 I dont have the GUI as the drivers do not support my graphics card. So iam working on command line.


_______________________________________________ ha-clusters-discuss mailing list ha-clusters-discuss at opensolaris dot org http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 8, 2008 3:00 AM   in response to: abhilash tg
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi Abhilash,

When you hit the panic, was dhcp disabled on your machine. My guess is you are installing a single node cluster on a laptop and usually the dhcp is enabled. When you install cluster you need to disable dhcp.


Also, you need to provide the stack trace. The the bad trap message is insufficient.

Boot your host in non cluster mode. In order to do this, add -x parameter to the option in grub.

Then go to the directory where your core is saved. In order to find out which is the directory, run the command dumpadm. It will should you the savecore directory
e.g.
rock @ / $ dumpadm
Dump content: kernel pages
Dump device: /dev/dsk/c0t0d0s1 (swap)
Savecore directory: /var/carsh/rock
Savecore enabled: yes

Go inside that directory, in this case /var/crash/rock and see which core file is present. You will see files with *.0 or *.1 etc. Since this is your first panic ( i am guessing) it will be the *.0. Open this file with mdb

# mdb 0

This will open the core file.

Now inside mdb, you can do a $C to get the stack trace, send this o/p as this will help us to debug.



Thanks,
Tirthankar
http://blogs.sun.com/tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 18, 2008 4:22 AM   in response to: tirth
To: Communities » ha-clusters » discuss
Cc: OpenSolaris » help
  Click to reply to this thread Reply

Hi Tirthankar

sorry for the late reply. I have tried doing the steps suggested by you. But I could not lboot using the -x option, So i booted into the fail safe mode. I could not find any directories called crash inside var

regards
Abhilash

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 18, 2008 5:34 AM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

I guess you are dumping the whole memory and your swap device is smaller than the available RAm or your root FS is full.

Look at dumpadm man page and setup the options so that you take only kernel pages only

This is what my dumpadm o/p looks like
pocho4 @ / $ dumpadm
Dump content: kernel pages
Dump device: /dev/dsk/c0t0d0s1 (swap)
Savecore directory: /var/crash/pocho4
Savecore enabled: yes
pocho4 @ / $

For booting in non cluster mode, select (highlight) the normal solaris boot and press "e"
This will let you edit the arguments, add a -x at the end. Then boot with this option, thsi should allow you to boot in non cluster mode.

Thanks,
Tirthankar

http://blogs.sun.com/tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 18, 2008 6:31 PM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

hi,
My partition is as follows. I have installed both windows XP and Solaris and for Solaris I have allotted 38 GB. Of this I have given 4 GB for Swap, 4 GB for globaldevices. And 20 GB for /.

I tried booting in non cluster mode but even after specifying the -x option the system goes into trap state. I can go to failsafe mode.

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 18, 2008 9:52 PM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Your partition information looks ok. But that does not say how much data you have on each partition.

I am assuming that you are properly giving the -x parameter to boot and still hitting the panic, in that case you are hitting panic for some other software and not cluster.

For us to be able to help you, you have to configure the dump device, take the core file and send o/p of the stack trace.

Thanks,
Tirthankar
http://blogs.sun.com/tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 19, 2008 9:49 PM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi,

I gave the -x parameter correctly but again it went back to panic state. I can get into failsafe mode. I went into fail safe mode and tried configuring the dump device but it shows cannot use that directory for dump. Also I cannot see the swap partition from failsafe mode.

Is there anything that can be done? I am only learning so kindly bear with me.

Abhilash

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 19, 2008 10:22 PM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

if you have given the -x parameter correctly, then you are hitting a panic not related to cluster.

Send the exact command and error message you are getting while trying to configure the dump device in Fail Safe mode.

Thanks.
Tirthankar

http://blogs.sun.com/tirthankar

binujp

Posts: 51
From: Menlo Park

Registered: 6/1/08
Re: Error installing cluster
Posted: Jun 20, 2008 9:18 AM   in response to: abhi-1

  Click to reply to this thread Reply

Hi Abhilash,

Let's try going back to the first steps.

> I installed cluster expree on solaris 5.11 b_86. . After installing and configuring the
> cluster and when i rebooted I got the foll error

> panic[cpu0] FFFFFFFFFbc245a0 BADTRAP type=d dup...

Trap 'd' is general protection fault, too general to be of help.

I am assuming there was more on the screen after the "dup..."
To identify which module or what code caused the panic we
would need everything printed after the panic. Let's say that
is scrolling by too fast and you don't have a digi cam to take
a "screen shot".

Next step is to try booting with kmdb. You may be panicing
before a dumpdevice can be configured assuming it was setup.
With kmdb loaded, the panic will drop you to kmdb prompt
and we can do a little bit of analysis there.

This doc tells you how to boot into kmdb:
http://docs.sun.com/app/docs/doc/819-2379/fvzpl

Once you enter into kmdb run "::stack" and send us the output.
This will give the call stack for the thread that panic'd.

To get out of kmdb you can use "::cont" or "$q"

> I gave the -x parameter correctly but again it went back to panic state. I can get into failsafe mode.
> I went into fail safe mode and tried configuring the dump device but it shows cannot use that directory
> for dump. Also I cannot see the swap partition from failsafe mode.

You have a swap partition configured and I am assuming you
had configured a savecore directory. If a core dump was possible,
that is the boot had reached the crach dump configuration stage,
you would have seen a message about trying to take core dump
after the panic. Do you see any such message?

Send the output of dumpadm command by itself. Since you have
a swap partition core dump should be enabled by default. IIRC, in
failsafe mode you would have to manually mount swap.

cheers
Binu

On Thu, Jun 19, 2008 at 9:49 PM, Abhilash
<abhilash_05cse01 at students dot amrita dot ac dot in> wrote:
> Hi,
>
> I gave the -x parameter correctly but again it went back to panic state. I can get into failsafe mode. I went into fail safe mode and tried configuring the dump device but it shows cannot use that directory for dump. Also I cannot see the swap partition from failsafe mode.
>
> Is there anything that can be done? I am only learning so kindly bear with me.
>
> Abhilash
> --
>
> This message posted from opensolaris.org
>
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 20, 2008 11:32 PM   in response to: binujp
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

The exact string is

panic[cpu0]/thread=fffffffffbc245a0: BAD TRAP: type=d
(#gp General Protection) rp=fffffffffbc469d0

addr=fffffffffb900598
dump aborted


[b]I have installed the cluster on my Sony VAIO VGN-CR35G laptop. I tried to boot into the kernel debugger mode I got the Welcome message. But I am not able to type anything in.[/b]
The output is

Welcome to kmdb
Loaded Modules: [unix krtld genunix ]

But after this I am not able to type in anything. I tired different combinations to send a break, I tried the F1-A key combination but I cannot type in anything.



The output of dumpadmin is(in failsafe mode)

Dump Content: kernel pages
Dump Device: none(dumps disabled)
Save Core Directory: /var/crash/
Save Core enabled: yes

when i try to do,
dumpadm -d swap

It shows

dumpadm:no swap device are available

when I give

mount swap I get

mount: /tmp is already mounted or swap is busy

I am not able to attach a photo of the panic. I will mail it to you if you give me your mail id.

regards
Abhilash

binujp

Posts: 51
From: Menlo Park

Registered: 6/1/08
Re: Error installing cluster
Posted: Jun 21, 2008 10:25 PM   in response to: abhi-1

  Click to reply to this thread Reply

On Fri, Jun 20, 2008 at 11:32 PM, Abhilash
<abhilash_05cse01 at students dot amrita dot ac dot in> wrote:
> The exact string is
>
> panic[cpu0]/thread=fffffffffbc245a0: BAD TRAP: type=d
> (#gp General Protection) rp=fffffffffbc469d0
>
> addr=fffffffffb900598
> dump aborted

Not of much use, the panic didn't even try to dump core or
locate a dump device. We are quite early in the boot as
indicated by lack of attempt to sync file systems.

> [b]I have installed the cluster on my Sony VAIO VGN-CR35G laptop. I tried to boot into the kernel debugger mode I got the Welcome message. But I am not able to type anything in.[/b]

When you booted with kmdb in cluster mode the system still
panicked but instead of getting a kmdb prompt the system hung.
Did I get that right?

> The output of dumpadmin is
>
> Dump Content: kernel pages
> Dump Device: none(dumps disabled)
> Save Core Directory: /var/crash/
> Save Core enabled: yes
>
> when i try to do,
> dumpadm -d swap
>
> It shows
>
> dumpadm:no swap device are available
>
> when I give
>
> mount swap I get
>
> mount: /tmp is already mounted or swap is busy

This will most probably amount to nothing. Why don't you remove
the dedicated swap temporarily and add the swap device as a
dedicated dump device.

Comment out the "swap ..." line from /etc/vfstab and
use dumpadm to try and add that partition as a dedicated
dump device.

> Welcome to kmdb
> Loaded Modules: [unix krtld genunix ]
>
> But after this I am not able to type in anything. I tired different combinations to send a break, I tried the F1-A key combination but I cannot type in anything.

Before installing cluster everything was working. After installing
cluster you are not able to boot into non-cluster mode or even
single user mode. With kmdb, the panic happened, but everything
hung after entry into debugger, ie. the hang after "Welcome to
kmdb" message.

I am assuming some cluster module which is loaded even on
non-cluster boot is causing the problem. We can manually
remove suspect modules and confirm.

Abhilash, I will get back to you after checking what cluster
modules are loaded in non-cluster boot. If someone in the list
has any suggestions please let us know.

> I am not able to attach a photo of the panic. I will mail it to you if you give me your mail id.

That is fine, given what you already told this will not help.

cheers
Binu


> I have taken a photograph of the screen and attached it.
> --
>
> This message posted from opensolaris.org
>
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 22, 2008 2:17 AM   in response to: binujp
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

From the var/adm/messages

Jun 20 21:55:39 opensolaris genunix: [ID 540533 kern.notice] ^MSunOS Release 5.11 Version snv_86 64-bit
Jun 20 21:55:39 opensolaris genunix: [ID 172908 kern.notice] Copyright 1983-2008 Sun Microsystems, Inc. All rights reserved.
Jun 20 21:55:39 opensolaris Use is subject to license terms.
..
Jun 20 21:55:39 opensolaris cl_bootstrap: [ID 569010 kern.notice] NOTICE: Can't open /etc/cluster/nodeid
Jun 20 21:55:39 opensolaris cl_bootstrap: [ID 756010 kern.notice] NOTICE: BOOTING IN NON CLUSTER MODE

...
...
Jun 20 21:55:54 opensolaris genunix: [ID 454863 kern.info] dump on /dev/dsk/c0d0s1 size 8001 MB

...
Jun 20 22:02:32 opensolaris inetd[598]: [ID 702911 daemon.warning] inetd_offline method for instance svc:/network/rpc/scrcmd:default is unspecified. Taking default action: kill.

1. Can you boot into single user mode or safe mode and see if the file /etc/cluster/nodeid exists, if so send its contents ( cat /etc/cluster/nodeid )

2. Seems like some sort of dump did get saved on /dev/dsk/c0d0s1 size 8001 MB
How much RAM do you have. Give the o/p of dumpadm, wantto see what options you have configured for dumping core.


3. Is you NWAM disabled and your network/physical enabled ? (although I guess this has nothing to do with your problem)

svcadm disable nwam
svcadm enable /network/physical:default

Thanks,
Tirthankar

http://blogs.sun.com/tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 22, 2008 5:09 AM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi,

> 1. Can you boot into single user mode or safe mode
> and see if the file /etc/cluster/nodeid exists, if so
> send its contents ( cat /etc/cluster/nodeid )

The output of cat /etc/cluster/nodeid is

1

This o/p i got from failsafe mode.

My file system was mounted on /a by default. So I got this o/p frm

/a/etc/cluster/nodeid

> 2. Seems like some sort of dump did get saved on
> /dev/dsk/c0d0s1 size 8001 MB
> How much RAM do you have. Give the o/p of dumpadm,
> wantto see what options you have configured for
> dumping core.

I have a 3 GB RAM Laptop.

The o/p of dumpadm is

Dump Content: kernel pages
Dump Device: none(dumps disabled)
Save Core Directory: /var/crash/
Save Core enabled: yes

> 3. Is you NWAM disabled and your network/physical
> enabled ? (although I guess this has nothing to do
> with your problem)
>
> svcadm disable nwam
> svcadm enable /network/physical:default
>
NWAM is disabled and /network/physical is enabled.

regards
Abhilash

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 22, 2008 5:47 AM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi Abhilash,

Your nodeid looks fine. Your dumpdevice is still not setup

Do the following steps
1.Boot in single user mode. You need to pass -s to boot. Let me know if you do not know how to do this.

2. Then set the dump device.
Say the hostname of your machine "abc"
Create a directory named /var/crash/abc

3. From the var/adm/messages you have provided, /dev/dsk/c0d0s1 looks like your swap device. You can verify this by doing a cat /etc/vfstab

# cat /etc/vfstab | grep swap
/dev/dsk/c0t0d0s1 - - swap - no -

4. Execute the following command
# dumpadm -d /dev/dsk/c0d0s1 -c kernel -s /var/crash/abc

5. Now execute the command
# dumpadm

the o/p should look like
# dumpadm
Dump content: kernel pages
Dump device: /dev/dsk/c0t0d0s1 (swap)
Savecore directory: /var/crash/abc
Savecore enabled: yes

if you do not see an o/p like this means something is worng
Now reboot the machine and see if you can save the dump or not.

Note:
If you are unable to boot in single user mode (step 1), then boot in failsafe mode and open the file /a/etc/dumpadm.conf and set the parameters like below


# cat /etc/dumpadm.conf


#
# dumpadm.conf
#
# Configuration parameters for system crash dump.
# Do NOT edit this file by hand -- use dumpadm(1m) instead.
#
DUMPADM_DEVICE=/dev/dsk/c0t0d0s1
DUMPADM_SAVDIR=/var/crash/abc
DUMPADM_CONTENT=kernel
DUMPADM_ENABLE=yes
pocho5 @ / $


Let me know the outcome.

Thanks,
Tirthankar

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 22, 2008 5:49 AM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

dumpadm.conf should look like (last mail had a pocho5 in the last line, it should not be there)

# cat /etc/dumpadm.conf


#
# dumpadm.conf
#
# Configuration parameters for system crash dump.
# Do NOT edit this file by hand -- use dumpadm(1m) instead.
#
DUMPADM_DEVICE=/dev/dsk/c0t0d0s1
DUMPADM_SAVDIR=/var/crash/ abc
DUMPADM_CONTENT=kernel
DUMPADM_ENABLE=yes



Thanks,
Tirthankar

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 22, 2008 7:53 AM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Some more corrections
dumpadm.conf should look like (last mail had a pocho5 in the last line, it should not be there , also a space was present before abc)


# cat /etc/dumpadm.conf


#
# dumpadm.conf
#
# Configuration parameters for system crash dump.
# Do NOT edit this file by hand -- use dumpadm(1m) instead.
#
DUMPADM_DEVICE=/dev/dsk/c0t0d0s1
DUMPADM_SAVDIR=/var/crash/abc
DUMPADM_CONTENT=kernel
DUMPADM_ENABLE=yes


In your reply you say that the network/physical is disabled. you need to enable it
Run the following command
# svcadm enable /network/physical:default

Thanks,
Tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 22, 2008 8:28 AM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi,

I could not boot in single user mode so I took the second option.

I edited the file dumpadm.conf and gave the contents as specified. It was already the same as given by you. Except for

DUMPADM_DEVICE=/dev/dsk/c0t0d0s1 //instead of c0t0d0s1 it was c0d0s1

But I gave it as instructed by you. then I gave the comand dumpadm

but it again showed dumps disabled. But I rebooted the machine this time normally

one thing I noticed was a new message along with the panic string there was a message like this

entering debugger(continue to reboot)


> Note:
> If you are unable to boot in single user mode (step
> 1), then boot in failsafe mode and open the file
> /a/etc/dumpadm.conf and set the parameters like
> below
>
>
> # cat /etc/dumpadm.conf
>
>
> #
> # dumpadm.conf
> #
> # Configuration parameters for system crash dump.
> # Do NOT edit this file by hand -- use dumpadm(1m)
> instead.
> #
> DUMPADM_DEVICE=/dev/dsk/c0t0d0s1
> DUMPADM_SAVDIR=/var/crash/opensolaris //I edited this
> DUMPADM_CONTENT=kernel
> DUMPADM_ENABLE=yes

>
>
> Let me know the outcome.


regards
Abhilash

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 22, 2008 8:40 AM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

So now boot in failsafe mode and see if the dump got saved ?

Also, any particular reason why you are not able to boot in single user mode ?



Thanks,
Tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 22, 2008 11:51 PM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi,


I edited the dumpadm.conf as said. But on reboot the dump is not getting saved. I dont know why I am not able to log into single user mode.

Before that the system panics.

regards
Abhilash

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 23, 2008 9:10 AM   in response to: tirth
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi,

This is just for an information

I saw one post having the same problem as mine. I deleted the file

/etc/cluster/nodeid and i could boot into non cluster mode....


regards

Abhilash

binujp

Posts: 51
From: Menlo Park

Registered: 6/1/08
Re: Error installing cluster
Posted: Jun 23, 2008 11:47 AM   in response to: abhi-1

  Click to reply to this thread Reply

On Mon, Jun 23, 2008 at 9:10 AM, Abhilash
<abhilash_05cse01 at students dot amrita dot ac dot in> wrote:
> Hi,
>
> This is just for an information
>
> I saw one post having the same problem as mine. I deleted the file

Could you post a link to the post?

> /etc/cluster/nodeid and i could boot into non cluster mode....

Now that makes sense.

cl_bootstrap is the first module to be loaded if a machine is part of cluster.
http://src.opensolaris.org/source/xref/onnv/onnv-gate/usr/src/uts/intel/ia32/ml/modstubs.s#1079

If nodeid file cannot be opened then cluster modules will not be loaded.
http://src.opensolaris.org/source/xref/ohac/ohac/usr/src/common/cl/cl_bootstrap/dc_boot.c
http://src.opensolaris.org/source/xref/ohac/ohac/usr/src/common/cl/cl_bootstrap/clconf_boot.cc#41

Whereas with a "-x" non-cluster mode boot, all of the modules to be
loaded prior
to mounting root will be loaded. It is only after system has entered
single user SMD
milestone that cl_boot_check does it's stuff.

http://src.opensolaris.org/source/xref/ohac/ohac/usr/src/cmd/initpkg/manifest/cl_boot_check.xml
and
http://src.opensolaris.org/source/xref/ohac/ohac/usr/src/cmd/initpkg/svcmethod/svc_boot_check

The core problem, panic on boot, is still unsolved. Do you intend to chase
that further?

cheers
Binu

> regards
>
> Abhilash
> --
>
> This message posted from opensolaris.org
>
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 23, 2008 9:13 PM   in response to: binujp
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Hi,

As I had told I saw a post similar to mine s, I just thought of trying that solution. And I got it!!

I had clusterexpress20080213-x86[0] with me and I was trying to install it on snv_86. Yesterday I downloaded clusterexpress20080522-x86[0] and I think I got it. I got the foll output on reboot

# /usr/cluster/bin/cluster status

=== Cluster Nodes ===

--- Node Status ---

Node Name Status
--------- ------
opensolaris Online


=== Cluster Transport Paths ===

Endpoint1 Endpoint2 Status
--------- --------- ------


=== Cluster Quorum ===

--- Quorum Votes Summary ---

Needed Present Possible
------ ------- --------
1 1 1


--- Quorum Votes by Node ---

Node Name Present Possible Status
--------- ------- -------- ------
opensolaris 1 1 Online


=== Cluster Device Groups ===

--- Device Group Status ---

Device Group Name Primary Secondary Status
----------------- ------- --------- ------


--- Spare, Inactive, and In Transition Nodes ---

Device Group Name Spare Nodes Inactive Nodes In Transistion Nodes
----------------- ----------- -------------- --------------------


--- Multi-owner Device Group Status ---

Device Group Name Node Name Status
----------------- --------- ------

=== Cluster Resource Groups ===

Group Name Node Name Suspended State
---------- --------- --------- -----

=== Cluster Resources ===

Resource Name Node Name State Status Message
------------- --------- ----- --------------

=== Cluster DID Devices ===

Device Instance Node Status
--------------- ---- ------
/dev/did/rdsk/d2 opensolaris Ok



Does this mean that my cluster has been setup? I suppose so!!



Thank you very for helping me out this much! In the process I could learn a lot on what to do if a panic occurs. Now I can help my friends if such a problem occurs :) to them

Thank you Binu Sir, and Thirthankar sir for their help.. :)

Again I will be bugging you guys as I want to learn further :)

Thank you
Regards
Abhilash

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 23, 2008 10:34 PM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

Oh cool. Finally you are up and running :)

Cheers,
Tirthankar

tirth

Posts: 154
From: IN

Registered: 6/30/06
Re: Error installing cluster
Posted: Jun 23, 2008 10:33 PM   in response to: abhi-1
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

What version of Solaris and what version of cluster are you using ?

Thanks,
Tirthankar

abhi-1

Posts: 46
From:

Registered: 12/17/07
Re: Error installing cluster
Posted: Jun 22, 2008 5:13 AM   in response to: binujp
To: Communities » ha-clusters » discuss
  Click to reply to this thread Reply

hi,

> When you booted with kmdb in cluster mode the system
> still
> panicked but instead of getting a kmdb prompt the
> system hung.
> Did I get that right?

/***********/
Exactly
/***********/



> Comment out the "swap ..." line from /etc/vfstab and

> Before installing cluster everything was working.
> After installing
> cluster you are not able to boot into non-cluster
> mode or even
> single user mode. With kmdb, the panic happened, but
> everything
> hung after entry into debugger, ie. the hang after
> "Welcome to
> kmdb" message.

This is what is happening exactly


> I am assuming some cluster module which is loaded
> even on
> non-cluster boot is causing the problem. We can
> manually
> remove suspect modules and confirm.
>
> Abhilash, I will get back to you after checking what
> cluster
> modules are loaded in non-cluster boot. If someone in
> the list
> has any suggestions please let us know.
>


thank you


regards
Abhilash




Terms of Use | Privacy | Trademarks | Copyright Policy | Site Guidelines
Your use of this web site or any of its content or software indicates your agreement to be bound by these Terms of Use.
Copyright © 1995-2005 Sun Microsystems, Inc.