<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The AstroStat Slog &#187; likelihoodist</title>
	<atom:link href="http://groundtruth.info/AstroStat/slog/tag/likelihoodist/feed/" rel="self" type="application/rss+xml" />
	<link>http://groundtruth.info/AstroStat/slog</link>
	<description>Weaving together Astronomy+Statistics+Computer Science+Engineering+Intrumentation, far beyond the growing borders</description>
	<lastBuildDate>Fri, 09 Sep 2011 17:05:33 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Is 3sigma the same as 3*1sigma?</title>
		<link>http://groundtruth.info/AstroStat/slog/2007/3-times-sigma/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2007/3-times-sigma/#comments</comments>
		<pubDate>Thu, 08 Feb 2007 20:29:22 +0000</pubDate>
		<dc:creator>vlk</dc:creator>
				<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Fitting]]></category>
		<category><![CDATA[Frequentist]]></category>
		<category><![CDATA[Jargon]]></category>
		<category><![CDATA[Stat]]></category>
		<category><![CDATA[Uncertainty]]></category>
		<category><![CDATA[central limit theorem]]></category>
		<category><![CDATA[confidence bounds]]></category>
		<category><![CDATA[error bar]]></category>
		<category><![CDATA[highest posterior density]]></category>
		<category><![CDATA[HPD]]></category>
		<category><![CDATA[likelihoodist]]></category>
		<category><![CDATA[sigma]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/2007/3-times-sigma/</guid>
		<description><![CDATA[Sometime early this year, Jeremy Drake asked this innocuous sounding question in the context of determining the bounds on the amplitude of an absorption line: Is the 3sigma error bar the same length as 3 times the 1sigma error bar?
In other words, if he measured the 99.7% confidence range for his model parameter, would it [...]]]></description>
			<content:encoded><![CDATA[<p>Sometime early this year, Jeremy Drake asked this innocuous sounding question in the context of determining the bounds on the amplitude of an absorption line: Is the 3sigma error bar the same length as 3 times the 1sigma error bar?</p>
<p>In other words, if he measured the 99.7% confidence range for his model parameter, would it always be thrice as large as the nominal <em>1sigma</em> confidence range?  The answer is complicated, and depends on who you ask: Frequentists will say &#8220;yes, of course!&#8221;, Likelihoodists will say &#8220;Maybe, yeah, er, depends&#8221;, and Bayesians will say &#8220;<em>sigma</em>?  what&#8217;s that?&#8221;  So I think it would be useful to expound a bit more on this to astronomers, whose mental processes are generally Bayesian but whose computational tools are mostly Frequentist.<br />
<span id="more-10"></span></p>
<p>In the Frequentist paradigm, one fits a parameterized model to numerous datasets, and computes the errors on the fitted parameters by looking at the ensemble of the parameter values.  That is, if you have one dataset <em>Y<sub>i</sub>(X<sub>i</sub>) {i=1..N<sub>bin</sub>}</em> and fit a model <em>f(<strong>A</strong>)</em> with parameters <em><strong>A</strong></em> (usually by minimizing the chisquare (χ<sup>2</sup>) deviations or something of that sort) you will get one set of best-fit parameters, <em><strong>Â</strong></em>.  It is important to note that in this paradigm, the concept of determining an error on <em><strong>Â</strong></em> <em>does not exist!</em>.  Errors are only determined by fitting <em>f(<strong>A</strong>)</em> to a series of datasets <em>Y<sub>ij</sub>(X<sub>i</sub>), {i=1..N<sub>bin</sub>, j=1..N<sub>obs</sub>}</em>, and deriving <em><strong>Â</strong><sub>j</sub></em> for each of them, and computing <em>sigma<sub><strong>A</strong></sub><sup>2</sup>=variance( {<strong>Â</strong><sub>j</sub>} )</em>.  By the central limit theorem, as <em>N<sub>obs</sub></em> increases, the distribution of <em>{ <strong>Â</strong><sub>j</sub></em> } will tend closer to a Gaussian.  So, in this case, <em>3*sigma<sub><strong>A</strong></sub></em> will asymptotically become equal to <em>3sigma<sub><strong>A</strong></sub></em>.</p>
<p>However, Astronomers are not afforded this luxury of having multiple observations.  With dreary regularity, we have to make do with <em>N<sub>obs</sub>=1</em>.  In that case, one takes recourse in the behavior of the χ<sup>2</sup>, and in particular, the likelihood of obtaining a certain χ<sup>2</sup> for the observed data given the model.  If we assume that the measurement errors are Normally distributed, and that we are looking in that part of the parameter space close to the best-fit solution, then it can be shown that the χ<sup>2</sup> varies as a parabolic function of the parameter values. The <em>1sigma</em> error on the parameter (not necessarily from a linear model; see <a href="http://adsabs.harvard.edu/abs/1976A%26A....52..307C" title="Cash 1979">Cash 1979</a>, for example) is computed by determining the parameter values at which the χ<sup>2</sup> function (computed by stepping through the values of one parameter and varying all the other parameters to minimize the χ<sup>2</sup> at that point) increases by 1.  Similarly, the <em>3sigma</em> error is computed by checking at what values the χ<sup>2</sup> increases by 9.  As long as the assumptions of local linearity and the Gaussian measurement error hold (i.e., the χ<sup>2</sup> function is parabolic), then we will have <em>3*1sigma=3sigma</em>, otherwise not.</p>
<p>Bayesians on the other hand compute the marginalized posterior probability density for each parameter and calculate the bounds on the parameter via credible regions that enclose a given percentage of the area under the curve.  There is no set recipe for specifying how the bounds are calculated: it could be one-sided, centered on the median, or include the mode and the highest posterior densities.  (The last leads to the smallest possible interval.)  The percentage levels for which the intervals are calculated are usually matched to the percentages that Gaussian standard deviations provide, e.g., <em>68% == 1sigma</em>, <em>99.7% == 3sigma</em>, etc.  Thus, when the 68% credible range is mentioned, it could also be referred to as the &#8220;Gaussian-equivalent <em>1sigma</em> region&#8221;.  However, because the intervals can be defined in many different ways, they do not necessarily match the <em>1sigma</em> error bar, and for that matter are often asymmetrical.  Given that, it is quite unreasonable to expect that <em>3*1sigma</em>, even on one side, would match the Gaussian-equivalent <em>3sigma</em>.  IF the pdf is Gaussian, AND we choose to use HPD (highest posterior density) intervals, THEN the 99.7% interval would be thrice as long as the 68% interval.</p>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2007/3-times-sigma/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

