OpenSolaris

Discussions Communities Projects Download Source Browser

Home » OpenSolaris Forums » ha-clusters » discuss

Thread: [ha-clusters-discuss] Failover LDoms agent for Guest domain

Welcome, Guest Help
Login Login
Guest Settings Guest Settings
Reply to this Thread Reply to this Thread Search Forum Search Forum Back to Thread List Back to Thread List

Permlink Replies: 5 - Last Post: Dec 10, 2008 6:56 AM by: timread Threads: [ Previous | Next ]
hnamachi

Posts: 38
From:

Registered: 9/6/07
[ha-clusters-discuss] Failover LDoms agent for Guest domain
Posted: Dec 5, 2008 2:44 AM

  Click to reply to this thread Reply

Hi OHAC members,

I would like to introduce the "Failover LDoms agent for Guest domain"
and as an addendum for the HA-xVM agent developed in OHAC release. So we
are not talking about developing a new agent, rather focus on extending
the existing HA-xVM Agent.

Refer to HA-xVM agent at http://opensolaris.org/os/project/ha-xvm/

Please find the information below describing the project

1) Purpose

The purpose of this project is to enhance the (HA-xVM) to support
failover of LDom Guest domains between two or more OHAC nodes.

2) Project Team

- HA-xVM team

3) Project Description

Enhancing the HA-xVM agent would require to support the below
mentioned bullets for a LDom Guest domain on SPARC machine.

* Manage the start/stop and restart of LDoms Guest domains.
* Failover a Ldoms Guest domain between cluster nodes.
* Allow for ++ve and --ve affinities between LDoms Guest domains
across cluster.
* Allow for different failover techniques, i.e. Stop/Failover/Start
or Live Migration.

-Thanks and Regards
Hemachandran
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


mwalburn

Posts: 20
From: Minneapolis, MN

Registered: 9/22/06
Re: [ha-clusters-discuss] Failover LDoms agent for Guest domain
Posted: Dec 5, 2008 4:50 AM   in response to: hnamachi

  Click to reply to this thread Reply

This is excellent news! Since this would leverage the existing agent,
does that imply that one will be to leverage SRDF or TrueCopy for
campus/geo type failovers of LDOMs?

Do you plan to support secondary I/O domains?

Thanks for starting this, let me know if I can help in testing.

-M

--
matt walburn
http://mattwalburn.com

On Dec 5, 2008, at 4:44 AM, Hemachandran Namachivayam <Hemachandran dot Namachivayam at Sun dot COM
> wrote:

> Hi OHAC members,
>
> I would like to introduce the "Failover LDoms agent for Guest domain"
> and as an addendum for the HA-xVM agent developed in OHAC release.
> So we
> are not talking about developing a new agent, rather focus on
> extending
> the existing HA-xVM Agent.
>
> Refer to HA-xVM agent at http://opensolaris.org/os/project/ha-xvm/
>
> Please find the information below describing the project
>
> 1) Purpose
>
> The purpose of this project is to enhance the (HA-xVM) to support
> failover of LDom Guest domains between two or more OHAC nodes.
>
> 2) Project Team
>
> - HA-xVM team
>
> 3) Project Description
>
> Enhancing the HA-xVM agent would require to support the below
> mentioned bullets for a LDom Guest domain on SPARC machine.
>
> * Manage the start/stop and restart of LDoms Guest domains.
> * Failover a Ldoms Guest domain between cluster nodes.
> * Allow for ++ve and --ve affinities between LDoms Guest domains
> across cluster.
> * Allow for different failover techniques, i.e. Stop/Failover/
> Start
> or Live Migration.
>
> -Thanks and Regards
> Hemachandran
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


hs86490

Posts: 24
From: DE

Registered: 10/29/07
Re: [ha-clusters-discuss] Failover LDoms agent for Guest domain
Posted: Dec 8, 2008 1:49 AM   in response to: mwalburn

  Click to reply to this thread Reply

Hi Matt,

On 12/05/08 13:50, Matt Walburn wrote:
> This is excellent news! Since this would leverage the existing agent,
> does that imply that one will be to leverage SRDF or TrueCopy for
> campus/geo type failovers of LDOMs?
>
I am not sure whether you want to use SRDF/TC within a cluster or
between clusters. With SC Geo this should be straight forward also not
tested yet.
With campus cluster, once this agent is available it should be a matter
of testing.
> Do you plan to support secondary I/O domains?
>
It is my understanding that a secondary I/O domain is already supported.
At least one of the configuration diagrams in the suncluster wiki shows
a config with 2 I/O domains and volume mirroring and IPMP setup between
these IO domains.

Regards
Hartmut
> Thanks for starting this, let me know if I can help in testing.
>
> -M
>
> --
> matt walburn
> http://mattwalburn.com
>
> On Dec 5, 2008, at 4:44 AM, Hemachandran Namachivayam <Hemachandran dot Namachivayam at Sun dot COM
> > wrote:
>
>
>> Hi OHAC members,
>>
>> I would like to introduce the "Failover LDoms agent for Guest domain"
>> and as an addendum for the HA-xVM agent developed in OHAC release.
>> So we
>> are not talking about developing a new agent, rather focus on
>> extending
>> the existing HA-xVM Agent.
>>
>> Refer to HA-xVM agent at http://opensolaris.org/os/project/ha-xvm/
>>
>> Please find the information below describing the project
>>
>> 1) Purpose
>>
>> The purpose of this project is to enhance the (HA-xVM) to support
>> failover of LDom Guest domains between two or more OHAC nodes.
>>
>> 2) Project Team
>>
>> - HA-xVM team
>>
>> 3) Project Description
>>
>> Enhancing the HA-xVM agent would require to support the below
>> mentioned bullets for a LDom Guest domain on SPARC machine.
>>
>> * Manage the start/stop and restart of LDoms Guest domains.
>> * Failover a Ldoms Guest domain between cluster nodes.
>> * Allow for ++ve and --ve affinities between LDoms Guest domains
>> across cluster.
>> * Allow for different failover techniques, i.e. Stop/Failover/
>> Start
>> or Live Migration.
>>
>> -Thanks and Regards
>> Hemachandran
>> _______________________________________________
>> ha-clusters-discuss mailing list
>> ha-clusters-discuss at opensolaris dot org
>> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>>
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>

--
Sun Microsystems GmbH Hartmut Streppel
Sonnenallee 1 Systems Practice
D-85551 Kirchheim-Heimstetten Phone: +49 (0)89 46008 2563
Germany Mobile: +49 (0)172 8919711
http://www.sun.de FAX: +49 (0)89 46008 2572
mailto: hartmut dot streppel at sun dot com
Sun Microsystems GmbH, Sonnenallee 1, D-85551 Kirchheim-Heimstetten
Amtsgericht München: HRB 161028
Geschäftsführer: Thomas Schröder, Wolfgang Engels, Dr. Roland Bömer
Vorsitzender des Aufsichtsrates: Martin Häring



_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


mwalburn

Posts: 20
From: Minneapolis, MN

Registered: 9/22/06
Re: [ha-clusters-discuss] Failover LDoms agent for Guest domain
Posted: Dec 9, 2008 6:40 AM   in response to: hs86490

  Click to reply to this thread Reply

Basically, what we want to be able to do is:

1. Create a multi-node campus cluster (at least 8 sun4v nodes) where half of the nodes are in one site, with the other half in the second site. Assume that both sites are active at all times.
2. Within this cluster I would have Logical Domains that have their root filesystems provisioned onto replicated LUNs (EMC or HDS via SRDF or TC).
3. Additionally, realizing that this functionalty won't be there until LDOMs 1.1, I would want those replicated LUNs to have multiple SAN paths, with primary and secondary I/O domains each serving up at least one path to the LDOM, with the LDOMs' MPxIO aggregating those paths into one device.

My assumption is, that Sun Cluster would be running in the primary I/O-Control domain in order to facilitate the failover of the entire LDOM, managing the storage replication commands in the process. This would give us the ability to failover LDOMs between sites.

We may or may not then want to run SC _inside_  the LDOMs for failing over services between running LDOMs.

Hopefully this makes sense!

Thanks again,
Matthew
--
Matt Walburn
http://mattwalburn.com


On Mon, Dec 8, 2008 at 3:49 AM, Hartmut Streppel <Hartmut dot Streppel at sun dot com> wrote:
Hi Matt,


On 12/05/08 13:50, Matt Walburn wrote:
This is excellent news! Since this would leverage the existing agent,  does that imply that one will be to leverage SRDF or TrueCopy for  campus/geo type failovers of LDOMs?
 
I am not sure whether you want to use SRDF/TC within a cluster or between clusters. With SC Geo this should be straight forward also not tested yet.
With campus cluster, once this agent is available it should be a matter of testing.

Do you plan to support secondary I/O domains?
 
It is my understanding that a secondary I/O domain is already supported. At least one of the configuration diagrams in the suncluster wiki shows a config with 2 I/O domains and volume mirroring and IPMP setup between these IO domains.

Regards
  Hartmut

Thanks for starting this, let me know if I can help in testing.

-M

--
matt walburn
http://mattwalburn.com

On Dec 5, 2008, at 4:44 AM, Hemachandran Namachivayam <Hemachandran dot Namachivayam at Sun dot COM  > wrote:

 
Hi OHAC members,

I would like to introduce the "Failover LDoms agent for Guest domain"
and as an addendum for the HA-xVM agent developed in OHAC release.  So we
are not talking about developing a new agent, rather focus on  extending
the existing HA-xVM Agent.

Refer to HA-xVM agent at http://opensolaris.org/os/project/ha-xvm/

Please find the information below describing the project

1) Purpose

  The purpose of this project is to enhance the (HA-xVM) to support
failover of LDom Guest domains between two or more OHAC nodes.

2) Project Team

  - HA-xVM team

3) Project Description

  Enhancing the HA-xVM agent would require to support the below
mentioned bullets for a LDom Guest domain on SPARC machine.

   * Manage the start/stop and restart of LDoms Guest domains.
   * Failover a Ldoms Guest domain between cluster nodes.
   * Allow for ++ve and --ve affinities between LDoms Guest domains
across cluster.
   * Allow for different failover techniques, i.e. Stop/Failover/ Start
or Live Migration.

-Thanks and Regards
Hemachandran
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
   
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
 

--
Sun Microsystems GmbH           Hartmut Streppel                
Sonnenallee 1                   Systems Practice
D-85551 Kirchheim-Heimstetten   Phone:  +49 (0)89 46008 2563 Germany                         Mobile: +49 (0)172 8919711      
http://www.sun.de               FAX:    +49 (0)89 46008 2572
mailto: hartmut dot streppel at sun dot com
Sun Microsystems GmbH, Sonnenallee 1, D-85551 Kirchheim-Heimstetten
Amtsgericht München: HRB 161028
Geschäftsführer: Thomas Schröder, Wolfgang Engels, Dr. Roland Bömer
Vorsitzender des Aufsichtsrates: Martin Häring




_______________________________________________ ha-clusters-discuss mailing list ha-clusters-discuss at opensolaris dot org http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


mwalburn

Posts: 20
From: Minneapolis, MN

Registered: 9/22/06
Re: [ha-clusters-discuss] Failover LDoms agent for Guest domain
Posted: Dec 9, 2008 6:43 AM   in response to: mwalburn

  Click to reply to this thread Reply


--
Matt Walburn
http://mattwalburn.com


On Tue, Dec 9, 2008 at 8:40 AM, Matt Walburn <matt at railwave dot com> wrote:
Basically, what we want to be able to do is:

1. Create a multi-node campus cluster (at least 8 sun4v nodes) where half of the nodes are in one site, with the other half in the second site. Assume that both sites are active at all times.
2. Within this cluster I would have Logical Domains that have their root filesystems provisioned onto replicated LUNs (EMC or HDS via SRDF or TC).
3. Additionally, realizing that this functionalty won't be there until LDOMs 1.1, I would want those replicated LUNs to have multiple SAN paths, with primary and secondary I/O domains each serving up at least one path to the LDOM, with the LDOMs' MPxIO aggregating those paths into one device.

My assumption is, that Sun Cluster would be running in the primary I/O-Control domain in order to facilitate the failover of the entire LDOM, managing the storage replication commands in the process. This would give us the ability to failover LDOMs between sites.

We may or may not then want to run SC _inside_  the LDOMs for failing over services between running LDOMs.

Hopefully this makes sense!

Thanks again,
Matthew
--
Matt Walburn
http://mattwalburn.com



On Mon, Dec 8, 2008 at 3:49 AM, Hartmut Streppel <Hartmut dot Streppel at sun dot com> wrote:
Hi Matt,


On 12/05/08 13:50, Matt Walburn wrote:
This is excellent news! Since this would leverage the existing agent,  does that imply that one will be to leverage SRDF or TrueCopy for  campus/geo type failovers of LDOMs?
 
I am not sure whether you want to use SRDF/TC within a cluster or between clusters. With SC Geo this should be straight forward also not tested yet.
With campus cluster, once this agent is available it should be a matter of testing.

Do you plan to support secondary I/O domains?
 
It is my understanding that a secondary I/O domain is already supported. At least one of the configuration diagrams in the suncluster wiki shows a config with 2 I/O domains and volume mirroring and IPMP setup between these IO domains.

Regards
  Hartmut

Thanks for starting this, let me know if I can help in testing.

-M

--
matt walburn
http://mattwalburn.com

On Dec 5, 2008, at 4:44 AM, Hemachandran Namachivayam <Hemachandran dot Namachivayam at Sun dot COM  > wrote:

 
Hi OHAC members,

I would like to introduce the "Failover LDoms agent for Guest domain"
and as an addendum for the HA-xVM agent developed in OHAC release.  So we
are not talking about developing a new agent, rather focus on  extending
the existing HA-xVM Agent.

Refer to HA-xVM agent at http://opensolaris.org/os/project/ha-xvm/

Please find the information below describing the project

1) Purpose

  The purpose of this project is to enhance the (HA-xVM) to support
failover of LDom Guest domains between two or more OHAC nodes.

2) Project Team

  - HA-xVM team

3) Project Description

  Enhancing the HA-xVM agent would require to support the below
mentioned bullets for a LDom Guest domain on SPARC machine.

   * Manage the start/stop and restart of LDoms Guest domains.
   * Failover a Ldoms Guest domain between cluster nodes.
   * Allow for ++ve and --ve affinities between LDoms Guest domains
across cluster.
   * Allow for different failover techniques, i.e. Stop/Failover/ Start
or Live Migration.

-Thanks and Regards
Hemachandran
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
   
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
 

--
Sun Microsystems GmbH           Hartmut Streppel                
Sonnenallee 1                   Systems Practice
D-85551 Kirchheim-Heimstetten   Phone:  +49 (0)89 46008 2563 Germany                         Mobile: +49 (0)172 8919711      
http://www.sun.de               FAX:    +49 (0)89 46008 2572
mailto: hartmut dot streppel at sun dot com
Sun Microsystems GmbH, Sonnenallee 1, D-85551 Kirchheim-Heimstetten
Amtsgericht München: HRB 161028
Geschäftsführer: Thomas Schröder, Wolfgang Engels, Dr. Roland Bömer
Vorsitzender des Aufsichtsrates: Martin Häring





_______________________________________________ ha-clusters-discuss mailing list ha-clusters-discuss at opensolaris dot org http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss


timread

Posts: 83
From: GB

Registered: 6/19/06
Re: [ha-clusters-discuss] Failover LDoms agent for Guest domain
Posted: Dec 10, 2008 6:56 AM   in response to: mwalburn

  Click to reply to this thread Reply

Matt,

My first concern would be the separation of the nodes in this
environment. I/O performance is degraded mainly by the latency between
the sites although bandwidth may also pose an issue. (See my
"Architecting High Availability and Disaster Recovery" Blueprint on the
Sun Blueprint wiki.)

Then remember that I/O from within the domain will be degraded to some
degree by the intervening layer of LDom s/w. Thus you may find that
performance is unacceptable at a smaller distance than simply latency
might impose.

Running Solaris Cluster at both levels sounds weird and I'm not sure it
has ever been tried or whether it would be supported. I think we'd need
to understand more about the sort of applications you intended to run in
this configuration.

Regards,

Tim
---


On 12/09/08 14:40, Matt Walburn wrote:
> Basically, what we want to be able to do is:
>
> 1. Create a multi-node campus cluster (at least 8 sun4v nodes) where
> half of the nodes are in one site, with the other half in the second
> site. Assume that both sites are active at all times.
> 2. Within this cluster I would have Logical Domains that have their root
> filesystems provisioned onto replicated LUNs (EMC or HDS via SRDF or TC).
> 3. Additionally, realizing that this functionalty won't be there until
> LDOMs 1.1, I would want those replicated LUNs to have multiple SAN
> paths, with primary and secondary I/O domains each serving up at least
> one path to the LDOM, with the LDOMs' MPxIO aggregating those paths into
> one device.
>
> My assumption is, that Sun Cluster would be running in the primary
> I/O-Control domain in order to facilitate the failover of the entire
> LDOM, managing the storage replication commands in the process. This
> would give us the ability to failover LDOMs between sites.
>
> We may or may not then want to run SC _inside_ the LDOMs for failing
> over services between running LDOMs.
>
> Hopefully this makes sense!
>
> Thanks again,
> Matthew
> --
> Matt Walburn
> http://mattwalburn.com
>
>
> On Mon, Dec 8, 2008 at 3:49 AM, Hartmut Streppel
> <Hartmut dot Streppel at sun dot com <mailto:Hartmut dot Streppel at sun dot com>> wrote:
>
> Hi Matt,
>
>
> On 12/05/08 13:50, Matt Walburn wrote:
>
> This is excellent news! Since this would leverage the existing
> agent, does that imply that one will be to leverage SRDF or
> TrueCopy for campus/geo type failovers of LDOMs?
>
>
> I am not sure whether you want to use SRDF/TC within a cluster or
> between clusters. With SC Geo this should be straight forward also
> not tested yet.
> With campus cluster, once this agent is available it should be a
> matter of testing.
>
> Do you plan to support secondary I/O domains?
>
>
> It is my understanding that a secondary I/O domain is already
> supported. At least one of the configuration diagrams in the
> suncluster wiki shows a config with 2 I/O domains and volume
> mirroring and IPMP setup between these IO domains.
>
> Regards
> Hartmut
>
> Thanks for starting this, let me know if I can help in testing.
>
> -M
>
> --
> matt walburn
> http://mattwalburn.com
>
> On Dec 5, 2008, at 4:44 AM, Hemachandran Namachivayam
> <Hemachandran dot Namachivayam at Sun dot COM > wrote:
>
>
>
> Hi OHAC members,
>
> I would like to introduce the "Failover LDoms agent for
> Guest domain"
> and as an addendum for the HA-xVM agent developed in OHAC
> release. So we
> are not talking about developing a new agent, rather focus
> on extending
> the existing HA-xVM Agent.
>
> Refer to HA-xVM agent at
> http://opensolaris.org/os/project/ha-xvm/
>
> Please find the information below describing the project
>
> 1) Purpose
>
> The purpose of this project is to enhance the (HA-xVM) to
> support
> failover of LDom Guest domains between two or more OHAC nodes.
>
> 2) Project Team
>
> - HA-xVM team
>
> 3) Project Description
>
> Enhancing the HA-xVM agent would require to support the below
> mentioned bullets for a LDom Guest domain on SPARC machine.
>
> * Manage the start/stop and restart of LDoms Guest domains.
> * Failover a Ldoms Guest domain between cluster nodes.
> * Allow for ++ve and --ve affinities between LDoms Guest
> domains
> across cluster.
> * Allow for different failover techniques, i.e.
> Stop/Failover/ Start
> or Live Migration.
>
> -Thanks and Regards
> Hemachandran
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> <mailto:ha-clusters-discuss at opensolaris dot org>
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>
>
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> <mailto:ha-clusters-discuss at opensolaris dot org>
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss
>
>
>
> --
> Sun Microsystems GmbH Hartmut Streppel
> Sonnenallee 1 Systems Practice
> D-85551 Kirchheim-Heimstetten Phone: +49 (0)89 46008 2563 Germany
> Mobile: +49 (0)172 8919711
> http://www.sun.de FAX: +49 (0)89 46008 2572
> mailto: hartmut dot streppel at sun dot com <mailto:hartmut dot streppel at sun dot com>
> Sun Microsystems GmbH, Sonnenallee 1, D-85551 Kirchheim-Heimstetten
> Amtsgericht München: HRB 161028
> Geschäftsführer: Thomas Schröder, Wolfgang Engels, Dr. Roland Bömer
> Vorsitzender des Aufsichtsrates: Martin Häring
>
>
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> ha-clusters-discuss mailing list
> ha-clusters-discuss at opensolaris dot org
> http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss

--

Tim Read
Staff Engineer
Solaris Availability Engineering
Sun Microsystems Ltd
Springfield
Linlithgow
EH49 7LR

Phone: +44 (0)1506 672 684
Mobile: +44 (0)7802 212 137

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

NOTICE: This email message is for the sole use of the intended
recipient(s) and may contain confidential and privileged information.
Any unauthorized review, use, disclosure or distribution is prohibited.
If you are not the intended recipient, please contact the sender by
reply email and destroy all copies of the original message.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
_______________________________________________
ha-clusters-discuss mailing list
ha-clusters-discuss at opensolaris dot org
http://mail.opensolaris.org/mailman/listinfo/ha-clusters-discuss





Terms of Use | Privacy | Trademarks | Copyright Policy | Site Guidelines
Your use of this web site or any of its content or software indicates your agreement to be bound by these Terms of Use.
Copyright © 1995-2005 Sun Microsystems, Inc.