OpenSolaris

Discussions Communities Projects Download Source Browser

Home » OpenSolaris Forums » fm » discuss

Thread: Sensor design doc now available

Welcome, Guest Help
Login Login
Guest Settings Guest Settings
Reply to this Thread Reply to this Thread Search Forum Search Forum Back to Thread List Back to Thread List

Permlink Replies: 9 - Last Post: Aug 8, 2008 2:56 PM by: eschrock
cynthia

Posts: 76
From:

Registered: 5/26/06
Sensor design doc now available
Posted: Jun 8, 2007 11:00 AM

  Click to reply to this thread Reply

Folks,

Phase I of the Sensor Abstraction Layer project has been posted to the
project page. See http://opensolaris.org/os/project/sensors for more
information.

Cindi
_______________________________________________
fm-discuss mailing list
fm-discuss at opensolaris dot org



eschrock

Posts: 804
From: Menlo Park, CA

Registered: 3/9/05
Re: Sensor design doc now available
Posted: Jun 8, 2007 10:48 PM   in response to: cynthia
To: Communities » fm » discuss
  Click to reply to this thread Reply

Cool stuff. I haven't finished going over it all, but I did notice one typo:

In section 2.4, in the bottom right table cell:

"indicates whether or not the alert cache should automatically to the resolved state..."

It also looks like section 4.1 could use some formatting ("Ambient air ...")?. I can't really follow this section.

- Eric

eschrock

Posts: 804
From: Menlo Park, CA

Registered: 3/9/05
Re: Sensor design doc now available
Posted: Jun 9, 2007 2:59 PM   in response to: cynthia
To: Communities » fm » discuss
  Click to reply to this thread Reply

Cindi -

I've had a chance to take a closer look at the design doc. It all looks
great, though I have a few high-level questions. These might belong as
"future phases", but it would help to have some explanation in the
design document to know that the issues are being considered.

How do indicators relate to this work? I see them mentioned when
disucssing the FMRI syntax, but nowhere else. Are indicators related to
this work beyond the FMRI syntax, or is it completely separate? Is
there a separate project to define the role of indicators in libtopo and
Solaris?

From an implementation perspective, it would help to outline how the
facility nodes would will in libtopo. How do I read sensor values from
the nodes? Do they have all the standard properties of a topo node,
minus the obvious restriction of not having any children?

Is there going to be any standardization on the properties exported by a
sensor node? I'm imagining what it would take to construct a generic
sensor browser. While most of this is outside the scope of phase 1, and
falls under objective #3 (ability to watch sensors), it seems like some
minimal design decisions need to made just to prototype phase 1 and
justify the architecture.

We want the ability to export arbitrary payloads via sensors, but it
seems like if we're not careful we can end up with a fragmented
namespace that makes monitoring difficult. Looking towards the future,
it would seem beneficial to define some common set of properties across
all sensors, even something as basic as "value", "units", and
(optionally) "threshold".

As a nit, does 'RTN' have to be capitalized? It seems like all the
events in the registry use lowercase names, and the all caps name looks
a little off. Although your fmdump example output uses lowercase, so
maybe this is just a case of inconsistency in the document.

Thanks,

- Eric

cynthia

Posts: 76
From:

Registered: 5/26/06
Re: Re: Sensor design doc now available
Posted: Jun 10, 2007 2:33 PM   in response to: eschrock

  Click to reply to this thread Reply



Eric Schrock wrote:
> Cindi -
>
> I've had a chance to take a closer look at the design doc. It all looks
> great, though I have a few high-level questions. These might belong as
> "future phases", but it would help to have some explanation in the
> design document to know that the issues are being considered.
>
> How do indicators relate to this work? I see them mentioned when
> disucssing the FMRI syntax, but nowhere else. Are indicators related to
> this work beyond the FMRI syntax, or is it completely separate? Is
> there a separate project to define the role of indicators in libtopo and
> Solaris?

Support for indicators are really orthogonal to this project but I have
included the definition as part of the facility definition as it makes
sense.

>
>>From an implementation perspective, it would help to outline how the
> facility nodes would will in libtopo. How do I read sensor values from
> the nodes? Do they have all the standard properties of a topo node,
> minus the obvious restriction of not having any children?

Implemtation-wise, facility (sensor and indicator) nodes are just leaf
topo nodes. We can attach properties and methods just like we do for
other types of topo nodes. I see the manipulation and access of
indicators happening via topo methods exported by (hopefully) standard
properties. In this way, the consumer of an indicator does not need to
know the implementation details of how we read or write an indicator.

This is similar to the sensor provider layer but can happen as soon as
we support facilities in libtopo rather than a future phase of this
project. Your point below applies here as well. We need to come up
with a standard name-type space for indicator props.

I have working prototype of topo facility nodes so we can start thinking
about this right away.

>
> Is there going to be any standardization on the properties exported by a
> sensor node? I'm imagining what it would take to construct a generic
> sensor browser. While most of this is outside the scope of phase 1, and
> falls under objective #3 (ability to watch sensors), it seems like some
> minimal design decisions need to made just to prototype phase 1 and
> justify the architecture.
>
> We want the ability to export arbitrary payloads via sensors, but it
> seems like if we're not careful we can end up with a fragmented
> namespace that makes monitoring difficult. Looking towards the future,
> it would seem beneficial to define some common set of properties across
> all sensors, even something as basic as "value", "units", and
> (optionally) "threshold".

Agreed and the sooner we start thinking about what we want to monitor
and how the sooner we can define the namespace. If you have any ideas,
I would like to hear them.

>
> As a nit, does 'RTN' have to be capitalized? It seems like all the
> events in the registry use lowercase names, and the all caps name looks
> a little off. Although your fmdump example output uses lowercase, so
> maybe this is just a case of inconsistency in the document.

We've gone back and forth on that. The fault manager tends to use caps
for acronyms (FRU, ASRU, etc...) so I tried to be consistent with that.
>
> Thanks,
>
> - Eric
> --
> This message posted from opensolaris.org
> _______________________________________________
> fm-discuss mailing list
> fm-discuss at opensolaris dot org
_______________________________________________
fm-discuss mailing list
fm-discuss at opensolaris dot org



gdamore

Posts: 1,364
From: US

Registered: 4/27/05
Re: Re: Sensor design doc now available
Posted: Jun 10, 2007 3:55 PM   in response to: cynthia

  Click to reply to this thread Reply

So we have sensors, and indicators. What about control points? E.g. I
want to be able to have a common framework for fans, etc. (That way I
can have a rule that says

if (component.temp() > component.temp.warn_threshold() &&
component.has_fan() && component.fan.get_speed() <
component.fan.maxspeed()) {
component.fan.increase_speed();
}

without knowing anything about the implementation of component, or fans.

Also, is this framework really just for system-health related items, or
should there be some overlap, e.g. with lid switches on laptops? (Which
are just another sensor, in implementation.)

-- Garrett

cindi wrote:
>
>
> Eric Schrock wrote:
>> Cindi -
>>
>> I've had a chance to take a closer look at the design doc. It all looks
>> great, though I have a few high-level questions. These might belong as
>> "future phases", but it would help to have some explanation in the
>> design document to know that the issues are being considered.
>>
>> How do indicators relate to this work? I see them mentioned when
>> disucssing the FMRI syntax, but nowhere else. Are indicators related to
>> this work beyond the FMRI syntax, or is it completely separate? Is
>> there a separate project to define the role of indicators in libtopo and
>> Solaris?
>
> Support for indicators are really orthogonal to this project but I
> have included the definition as part of the facility definition as it
> makes sense.
>
>>
>>> From an implementation perspective, it would help to outline how the
>> facility nodes would will in libtopo. How do I read sensor values from
>> the nodes? Do they have all the standard properties of a topo node,
>> minus the obvious restriction of not having any children?
>
> Implemtation-wise, facility (sensor and indicator) nodes are just leaf
> topo nodes. We can attach properties and methods just like we do for
> other types of topo nodes. I see the manipulation and access of
> indicators happening via topo methods exported by (hopefully) standard
> properties. In this way, the consumer of an indicator does not need
> to know the implementation details of how we read or write an indicator.
>
> This is similar to the sensor provider layer but can happen as soon as
> we support facilities in libtopo rather than a future phase of this
> project. Your point below applies here as well. We need to come up
> with a standard name-type space for indicator props.
>
> I have working prototype of topo facility nodes so we can start
> thinking about this right away.
>
>>
>> Is there going to be any standardization on the properties exported by a
>> sensor node? I'm imagining what it would take to construct a generic
>> sensor browser. While most of this is outside the scope of phase 1, and
>> falls under objective #3 (ability to watch sensors), it seems like some
>> minimal design decisions need to made just to prototype phase 1 and
>> justify the architecture.
>>
>> We want the ability to export arbitrary payloads via sensors, but it
>> seems like if we're not careful we can end up with a fragmented
>> namespace that makes monitoring difficult. Looking towards the future,
>> it would seem beneficial to define some common set of properties across
>> all sensors, even something as basic as "value", "units", and
>> (optionally) "threshold".
>
> Agreed and the sooner we start thinking about what we want to monitor
> and how the sooner we can define the namespace. If you have any
> ideas, I would like to hear them.
>
>>
>> As a nit, does 'RTN' have to be capitalized? It seems like all the
>> events in the registry use lowercase names, and the all caps name looks
>> a little off. Although your fmdump example output uses lowercase, so
>> maybe this is just a case of inconsistency in the document.
>
> We've gone back and forth on that. The fault manager tends to use
> caps for acronyms (FRU, ASRU, etc...) so I tried to be consistent with
> that.
>>
>> Thanks,
>>
>> - Eric
>> --
>> This message posted from opensolaris.org
>> _______________________________________________
>> fm-discuss mailing list
>> fm-discuss at opensolaris dot org
> _______________________________________________
> fm-discuss mailing list
> fm-discuss at opensolaris dot org

_______________________________________________
fm-discuss mailing list
fm-discuss at opensolaris dot org



cynthia

Posts: 76
From:

Registered: 5/26/06
Re: Re: Sensor design doc now available
Posted: Jun 10, 2007 5:51 PM   in response to: gdamore

  Click to reply to this thread Reply



Garrett D'Amore wrote:
> So we have sensors, and indicators. What about control points? E.g. I
> want to be able to have a common framework for fans, etc. (That way I
> can have a rule that says
>
> if (component.temp() > component.temp.warn_threshold() &&
> component.has_fan() && component.fan.get_speed() component.fan.maxspeed()) {
> component.fan.increase_speed();
> }
>
> without knowing anything about the implementation of component, or fans.

We could extend the facility type to include component associations like
fans. In that way, we can define:

hc://chassis=0/fan=0
hc://chassis=0/chip=1/cpu=0
hc://chassis=0/chip=1/cpu=0?control=hc://chassis=0/fan=0
hc://chassis=0/chip=1/cpu=0?sensor=temperature


With libtopo, you may export these FMRIs with properties and methods,
such that:

hc://chassis=0/fan=0
property list:
fan-maxspeed=100
fan-get-speed() (property method)
fan-set-speed()

hc://chassis=0/chip=1/cpu=0
property list:
get-temp()

hc://chassis=0/chip=1/cpu=0?sensor=temperature
property list:
get-temp()
warn-threshold()

Of course, we could probably do the same thing with properties attached
to the CPU temperature sensor node and not extend the FMRI namespace:

hc://chassis=0/chip=1/cpu=0?sensor=temperature
property list:
get-temp()
warn-threshold()
fan=hc://chassis=0/fan=0
fan-maxspeed=100
fan-get-speed()
fan-set-speed()

BTW, libtopo documentation is coming very soon along with an updated
version of the fmd programmer's reference manual.


> Also, is this framework really just for system-health related items, or
> should there be some overlap, e.g. with lid switches on laptops? (Which
> are just another sensor, in implementation.)

If you're asking is this a framework suitable for system events, I'd say
no and refer you to sysevent. If you're asking can sysevents be used in
telemetry analysis, I'd say yes.

Cindi

_______________________________________________
fm-discuss mailing list
fm-discuss at opensolaris dot org



bshaw666

Posts: 79
From: CA

Registered: 6/14/05
Re: Re: Sensor design doc now available
Posted: Jun 12, 2007 10:09 AM   in response to: gdamore
To: Communities » fm » discuss
Cc: OpenSolaris » discuss
  Click to reply to this thread Reply

> Agreed and the sooner we start thinking about what we want to monitor
> and how the sooner we can define the namespace. If you have any
> ideas, I would like to hear them.

For the "how" part, why not simply use SNMP. Sensor monitoring is included in net-snmp 5.4.1 (and other versions). It currently uses the LM-SENSORS-MIB, which is really old and clumsy and designed specifically for the LM75 chip.

I'd like to see us create a new, up-to-date MIB to specifically support Sun sensors, but given this is an open source project, perhaps we could approach the open source sensors community and work towards a common MIB.

pavel_b

Posts: 1
From: Prague

Registered: 2/7/08
Re: Sensor design doc now available
Posted: Jul 25, 2008 4:15 AM   in response to: cynthia
To: Communities » fm » discuss
  Click to reply to this thread Reply

Sorry for maybe offtopic question.
What is the current state of the project? Is it still alive? Where can I get some info?

Thanks,
Pavel.

eschrock

Posts: 804
From: Menlo Park, CA

Registered: 3/9/05
Re: Sensor design doc now available
Posted: Jul 25, 2008 10:17 AM   in response to: pavel_b

  Click to reply to this thread Reply

On Fri, Jul 25, 2008 at 04:15:34AM -0700, Pavel Bures wrote:
> Sorry for maybe offtopic question.
> What is the current state of the project? Is it still alive? Where can I get some info?

The first phase of this - libtopo facility nodes and support for
IPMI-based sensors - will be putback to the current build (96). This
will be followed shortly after by support for SES sensors and
indicators, hopefully also in build 96.

After that, the immediate plan is to add support for fan and PSU
diagnosis based on this sensor data, as well as generalizing the
disk-monitor module to manage indicators for disks in SES enclosures.

The long term plans described in the design doc (alerts, monitors, etc)
are less clear, but we'll working on those once the immediate objectives
are achieved.

Hope that helps,

- Eric

--
Eric Schrock, Fishworks http://blogs.sun.com/eschrock
_______________________________________________
fm-discuss mailing list
fm-discuss at opensolaris dot org


eschrock

Posts: 804
From: Menlo Park, CA

Registered: 3/9/05
Re: Sensor design doc now available
Posted: Aug 8, 2008 2:56 PM   in response to: eschrock

  Click to reply to this thread Reply

FYI -

The first phase of this was putback to snv_96 as planned. Blog entries
here:

http://blogs.sun.com/robj/entry/a_sensor_abstraction_layer_for
http://blogs.sun.com/eschrock/entry/ses_sensors

As for the next phases, I'm about halfway done with a fan/psu diagnosis
prototype so that one shouldn't take too long.

- Eric

On Fri, Jul 25, 2008 at 10:17:03AM -0700, Eric Schrock wrote:
> On Fri, Jul 25, 2008 at 04:15:34AM -0700, Pavel Bures wrote:
> > Sorry for maybe offtopic question.
> > What is the current state of the project? Is it still alive? Where can I get some info?
>
> The first phase of this - libtopo facility nodes and support for
> IPMI-based sensors - will be putback to the current build (96). This
> will be followed shortly after by support for SES sensors and
> indicators, hopefully also in build 96.
>
> After that, the immediate plan is to add support for fan and PSU
> diagnosis based on this sensor data, as well as generalizing the
> disk-monitor module to manage indicators for disks in SES enclosures.
>
> The long term plans described in the design doc (alerts, monitors, etc)
> are less clear, but we'll working on those once the immediate objectives
> are achieved.
>
> Hope that helps,
>
> - Eric
>
> --
> Eric Schrock, Fishworks http://blogs.sun.com/eschrock
> _______________________________________________
> fm-discuss mailing list
> fm-discuss at opensolaris dot org

--
Eric Schrock, Fishworks http://blogs.sun.com/eschrock
_______________________________________________
fm-discuss mailing list
fm-discuss at opensolaris dot org





Terms of Use | Privacy | Trademarks | Copyright Policy | Site Guidelines
Your use of this web site or any of its content or software indicates your agreement to be bound by these Terms of Use.
© 2010, Oracle Corporation and/or its affiliates

Oracle