<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Wisdom of Specificity in Monitoring and Alerting</title>
	<atom:link href="http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/feed/" rel="self" type="application/rss+xml" />
	<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/</link>
	<description>Just a nublet SysAdmin</description>
	<lastBuildDate>Sat, 28 Jan 2012 05:18:15 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Wesley.Nonapeptide</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-11</link>
		<dc:creator>Wesley.Nonapeptide</dc:creator>
		<pubDate>Sun, 02 May 2010 00:13:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-11</guid>
		<description>@Jason - Good idea about focusing on the collection and less on the interpretation. Don&#039;t worry about pitching a product even if it&#039;s your own. I&#039;m all for people talking about what they believe in even if they also get paid from it.

I like to think of my blog as a free-for-all. If anyone wants to say anything they&#039;re welcome to (except if it involves R0l3X watches), but just be sure to back it up and be ready to get some tough questions from other commentors (or myself). =)</description>
		<content:encoded><![CDATA[<p>@Jason &#8211; Good idea about focusing on the collection and less on the interpretation. Don&#8217;t worry about pitching a product even if it&#8217;s your own. I&#8217;m all for people talking about what they believe in even if they also get paid from it.</p>
<p>I like to think of my blog as a free-for-all. If anyone wants to say anything they&#8217;re welcome to (except if it involves R0l3X watches), but just be sure to back it up and be ready to get some tough questions from other commentors (or myself). =)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Dixon</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-10</link>
		<dc:creator>Jason Dixon</dc:creator>
		<pubDate>Sat, 01 May 2010 14:16:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-10</guid>
		<description>We&#039;re attempting to address this chasm in thought with Circonus (http://circonus.com/).  The emphasis should be on metric collection, not some interpretation of what they mean (that is best left to the user).  Enable the user to correlate any disparate metrics for trending and root cause analysis... it&#039;s a very powerful method (and tool) that to date has been vastly underutilized.

P.S.  Sorry for the sales-pitch-esque answer, but we&#039;re passionate about this stuff.  :)</description>
		<content:encoded><![CDATA[<p>We&#8217;re attempting to address this chasm in thought with Circonus (<a href="http://circonus.com/" rel="nofollow">http://circonus.com/</a>).  The emphasis should be on metric collection, not some interpretation of what they mean (that is best left to the user).  Enable the user to correlate any disparate metrics for trending and root cause analysis&#8230; it&#8217;s a very powerful method (and tool) that to date has been vastly underutilized.</p>
<p>P.S.  Sorry for the sales-pitch-esque answer, but we&#8217;re passionate about this stuff.  :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wesley.Nonapeptide</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-8</link>
		<dc:creator>Wesley.Nonapeptide</dc:creator>
		<pubDate>Fri, 30 Apr 2010 21:33:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-8</guid>
		<description>@Doug: &quot;the key when monitoring is to look at it from the perspective of the service offered to end user&quot; 

Quoted for truth. What tools do you typically use for that kind of service monitoring?

I think the key to Systems Administration / Engineering in general is to look at everything from the perspective of the service being offered to the user.</description>
		<content:encoded><![CDATA[<p>@Doug: &#8220;the key when monitoring is to look at it from the perspective of the service offered to end user&#8221; </p>
<p>Quoted for truth. What tools do you typically use for that kind of service monitoring?</p>
<p>I think the key to Systems Administration / Engineering in general is to look at everything from the perspective of the service being offered to the user.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Luxem</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-7</link>
		<dc:creator>Doug Luxem</dc:creator>
		<pubDate>Fri, 30 Apr 2010 21:22:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-7</guid>
		<description>Reiterating what Jason said - the key when monitoring is to look at it from the perspective of the service offered to end user, not the individual components.  You could monitor the HTTP port, DNS queries, the Postgre back-end; or instead, have a monitoring platform that can enter a username and password on your login page (or whatever) and make sure it gets a valid response back.

The other benefit is that you can monitor and alert on response time of the total system whereas the ICMP round trip is not really useful.

Regarding switching web servers via DNS - that sounds like typical hosting extortion :)</description>
		<content:encoded><![CDATA[<p>Reiterating what Jason said &#8211; the key when monitoring is to look at it from the perspective of the service offered to end user, not the individual components.  You could monitor the HTTP port, DNS queries, the Postgre back-end; or instead, have a monitoring platform that can enter a username and password on your login page (or whatever) and make sure it gets a valid response back.</p>
<p>The other benefit is that you can monitor and alert on response time of the total system whereas the ICMP round trip is not really useful.</p>
<p>Regarding switching web servers via DNS &#8211; that sounds like typical hosting extortion :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wesley.Nonapeptide</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-6</link>
		<dc:creator>Wesley.Nonapeptide</dc:creator>
		<pubDate>Fri, 30 Apr 2010 20:42:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-6</guid>
		<description>@Matt: I sympathize concerning the assumption that if I&#039;m not getting alerts I&#039;m okay. I have a backup program that is supposed to email me upon failure. It&#039;s poorly implemented and sometimes that email doesn&#039;t reach me. For those situations, I create recurring tasks in Outlook to check on various systems. It seems to defeat the purpose of automation if you have to manually check it... but such is life as long as the second law of thermodynamics is in effect.

@Jason: Sorry your comment got moderated. Wordpress captures comments that have more than one URL in it. I&#039;ll ease up on that.

Moving on, you&#039;re exactly right about focusing on purpose over availability as well as needing to have insight into the unexpected. It was totally unexpected that my web host would switch the resolution of my domain to one of their other servers upon account suspension. I would have assumed that all resolution for my domain would have been suspended. Doesn&#039;t that seem odd to anyone else?</description>
		<content:encoded><![CDATA[<p>@Matt: I sympathize concerning the assumption that if I&#8217;m not getting alerts I&#8217;m okay. I have a backup program that is supposed to email me upon failure. It&#8217;s poorly implemented and sometimes that email doesn&#8217;t reach me. For those situations, I create recurring tasks in Outlook to check on various systems. It seems to defeat the purpose of automation if you have to manually check it&#8230; but such is life as long as the second law of thermodynamics is in effect.</p>
<p>@Jason: Sorry your comment got moderated. WordPress captures comments that have more than one URL in it. I&#8217;ll ease up on that.</p>
<p>Moving on, you&#8217;re exactly right about focusing on purpose over availability as well as needing to have insight into the unexpected. It was totally unexpected that my web host would switch the resolution of my domain to one of their other servers upon account suspension. I would have assumed that all resolution for my domain would have been suspended. Doesn&#8217;t that seem odd to anyone else?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Dixon</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-4</link>
		<dc:creator>Jason Dixon</dc:creator>
		<pubDate>Fri, 30 Apr 2010 17:50:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-4</guid>
		<description>The problem with &quot;uptime monitoring&quot; and monitoring in general is that we&#039;ve been trained to focus on availability, rather than purpose.  It&#039;s not easy to make good monitors;  it takes knowledge of the application, insight into the unexpected and a commitment to perfection (excuse the cheese).

Anyhoo, I&#039;ve written on this issue a couple times already.

http://obfuscurity.com/2009/09/Business-Metrics
http://omniti.com/seeds/business-metrics-too</description>
		<content:encoded><![CDATA[<p>The problem with &#8220;uptime monitoring&#8221; and monitoring in general is that we&#8217;ve been trained to focus on availability, rather than purpose.  It&#8217;s not easy to make good monitors;  it takes knowledge of the application, insight into the unexpected and a commitment to perfection (excuse the cheese).</p>
<p>Anyhoo, I&#8217;ve written on this issue a couple times already.</p>
<p><a href="http://obfuscurity.com/2009/09/Business-Metrics" rel="nofollow">http://obfuscurity.com/2009/09/Business-Metrics</a><br />
<a href="http://omniti.com/seeds/business-metrics-too" rel="nofollow">http://omniti.com/seeds/business-metrics-too</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matt Simmons</title>
		<link>http://thenubbyadmin.com/2010/04/30/the-wisdom-of-specificity-in-monitoring-and-alerting/comment-page-1/#comment-2</link>
		<dc:creator>Matt Simmons</dc:creator>
		<pubDate>Fri, 30 Apr 2010 17:13:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.thenubbyadmin.com/?p=41#comment-2</guid>
		<description>Oh man....me and assumptions...we go way back :-) 

DNS is one of those really twitchy things, too, because one small error has untold ramifications. A large problem like the one you experienced wreaks havoc. 

My biggest assumption when monitoring is always that if I don&#039;t get alerts, then things are fine. Before I had two Nagios servers watching each other&#039;s backs, if Nagios goes down, how do you know? For that matter, even if you&#039;re monitoring it, if your only network connection goes down, how do you find out? 

Yeah, I&#039;ve got a loooong history of bad assumptions. I&#039;d assume that I&#039;m getting better, but that&#039;s just part of the problem ;-)</description>
		<content:encoded><![CDATA[<p>Oh man&#8230;.me and assumptions&#8230;we go way back :-) </p>
<p>DNS is one of those really twitchy things, too, because one small error has untold ramifications. A large problem like the one you experienced wreaks havoc. </p>
<p>My biggest assumption when monitoring is always that if I don&#8217;t get alerts, then things are fine. Before I had two Nagios servers watching each other&#8217;s backs, if Nagios goes down, how do you know? For that matter, even if you&#8217;re monitoring it, if your only network connection goes down, how do you find out? </p>
<p>Yeah, I&#8217;ve got a loooong history of bad assumptions. I&#8217;d assume that I&#8217;m getting better, but that&#8217;s just part of the problem ;-)</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Served from: thenubbyadmin.com @ 2012-02-04 21:42:11 by W3 Total Cache -->
