<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The AstroStat Slog &#187; CHASC</title>
	<atom:link href="http://groundtruth.info/AstroStat/slog/category/chasc/feed/" rel="self" type="application/rss+xml" />
	<link>http://groundtruth.info/AstroStat/slog</link>
	<description>Weaving together Astronomy+Statistics+Computer Science+Engineering+Intrumentation, far beyond the growing borders</description>
	<lastBuildDate>Fri, 09 Sep 2011 17:05:33 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>[AAS-HEAD 2011] Time Series in High Energy Astrophysics</title>
		<link>http://groundtruth.info/AstroStat/slog/2011/head2011/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2011/head2011/#comments</comments>
		<pubDate>Fri, 09 Sep 2011 17:05:33 +0000</pubDate>
		<dc:creator>vlk</dc:creator>
				<category><![CDATA[Astro]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[Methods]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Optical]]></category>
		<category><![CDATA[Stat]]></category>
		<category><![CDATA[Timing]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[AAS]]></category>
		<category><![CDATA[HEAD]]></category>
		<category><![CDATA[September]]></category>
		<category><![CDATA[special session]]></category>
		<category><![CDATA[Timing Analysis]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/?p=4279</guid>
		<description><![CDATA[We organized a Special Session on Time Series in High Energy Astrophysics: Techniques Applicable to Multi-Dimensional Analysis on Sep 7, 2011, at the AAS-HEAD conference at Newport, RI.  The talks presented at the session are archived at http://hea-www.harvard.edu/AstroStat/#head2011
A tremendous amount of information is contained within the temporal variations of various measurable quantities, such as [...]]]></description>
			<content:encoded><![CDATA[<p>We organized a Special Session on <em>Time Series in High Energy Astrophysics: Techniques Applicable to Multi-Dimensional Analysis</em> on Sep 7, 2011, at the AAS-HEAD conference at Newport, RI.  The talks presented at the session are archived at <a href="http://hea-www.harvard.edu/AstroStat/#head2011">http://hea-www.harvard.edu/AstroStat/#head2011</a></p>
<blockquote><p>A tremendous amount of information is contained within the temporal variations of various measurable quantities, such as the energy distributions of the incident photons, the overall intensity of the source, and the spatial coherence of the variations. While the detection and interpretation of periodic variations is well studied, the same cannot be said for non-periodic behavior in a multi-dimensional domain.  Methods to deal with such problems are still primitive, and any attempts at sophisticated analyses are carried out on a case-by-case basis. Some of the issues we seek to focus on are methods to deal with are:<br />
* Stochastic variability<br />
* Chaotic Quasi-periodic variability<br />
* Irregular data gaps/unevenly sampled data<br />
* Multi-dimensional analysis<br />
* Transient classification</p>
<p>Our goal is to present some basic questions that require sophisticated temporal analysis in order for progress to be made.   We plan to bring together astronomers and statisticians who are working in many different subfields so that an exchange of ideas can occur to motivate the development of sophisticated and generally applicable algorithms to astronomical time series data. We will review the problems and issues with current methodology from an algorithmic and statistical perspective and then look for improvements or for new methods and techniques.
</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2011/head2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>mini-Workshop on Computational AstroStatistics [announcement]</title>
		<link>http://groundtruth.info/AstroStat/slog/2010/workshop-aug2010/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2010/workshop-aug2010/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 16:25:31 +0000</pubDate>
		<dc:creator>chasc</dc:creator>
				<category><![CDATA[Astro]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Methods]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Optical]]></category>
		<category><![CDATA[Stat]]></category>
		<category><![CDATA[X-ray]]></category>
		<category><![CDATA[gamma-ray]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Announcement]]></category>
		<category><![CDATA[astronomical data]]></category>
		<category><![CDATA[astrostatistics]]></category>
		<category><![CDATA[Aug 2010]]></category>
		<category><![CDATA[August]]></category>
		<category><![CDATA[Computational]]></category>
		<category><![CDATA[workshop]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/?p=4244</guid>
		<description><![CDATA[mini-Workshop on Computational Astro-statistics: Challenges and Methods for Massive Astronomical Data
Aug 24-25, 2010
Phillips Auditorium, CfA,
60 Garden St., Cambridge, MA 02138
URL: http://hea-www.harvard.edu/AstroStat/CAS2010

The California-Boston-Smithsonian Astrostatistics Collaboration plans to host a mini-workshop on Computational Astro-statistics.  With the advent of new missions like the Solar Dynamic Observatory (SDO), Panoramic Survey and Rapid Response (Pan-STARRS) and Large Synoptic Survey [...]]]></description>
			<content:encoded><![CDATA[<p><strong>mini-Workshop on Computational Astro-statistics: Challenges and Methods for Massive Astronomical Data</strong><br />
<em>Aug 24-25, 2010</em><br />
Phillips Auditorium, CfA,<br />
60 Garden St., Cambridge, MA 02138</p>
<p>URL: <a href="http://hea-www.harvard.edu/AstroStat/CAS2010">http://hea-www.harvard.edu/AstroStat/CAS2010</a><br />
<span id="more-4244"></span></p>
<blockquote><p>The California-Boston-Smithsonian Astrostatistics Collaboration plans to host a mini-workshop on Computational Astro-statistics.  With the advent of new missions like the Solar Dynamic Observatory (SDO), Panoramic Survey and Rapid Response (Pan-STARRS) and Large Synoptic Survey (LSST), astronomical data collection is fast outpacing our capacity to analyze them.  Astrostatistical effort has generally focused on principled analysis of individual observations, on one or a few sources at a time.  But the new era of data intensive observational astronomy forces us to consider combining multiple datasets and infer parameters that are common to entire populations. Many astronomers really want to use every data point and even non-detections, but this becomes problematic for many statistical techniques.</p>
<p>The goal of the Workshop is to explore new problems in Astronomical data analysis that arise from data complexity.  Our focus is on problems that have generally been considered intractable due to insufficient computational power or inefficient algorithms, but are now becoming tractable.  Examples of such problems include: accounting for uncertainties in instrument calibration; classification, regression, and density estimations of massive data sets that may be truncated and contaminated with measurement errors and outliers; and designing statistical emulators to efficiently approximate the output from complex astrophysical computer models and simulations, thus making statistical inference on them tractable.  We aim to present some issues to the statisticians and clarify difficulties with the currently used methodologies, e.g. MCMC methods.  The Workshop will consist of review talks on current Statistical methods by Statisticians, descriptions of data analysis issues by astronomers, and open discussions between Astronomers and Statisticians.  We hope to define a path for development of new algorithms that target specific issues, designed to help with applications to SDO, Pan-STARRS, LSST, and other survey data.</p>
<p>We hope you will be able to attend the workshop and present a brief talk on the scope of the data analysis problem that you confront in your project.  The workshop will have presentations in the morning sessions, followed by a discussion session in the afternoons of both days.
</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2010/workshop-aug2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>[MADS] multiscale modeling</title>
		<link>http://groundtruth.info/AstroStat/slog/2008/mads-multiscale-modeling/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2008/mads-multiscale-modeling/#comments</comments>
		<pubDate>Thu, 11 Dec 2008 19:46:05 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Cross-Cultural]]></category>
		<category><![CDATA[Jargon]]></category>
		<category><![CDATA[MADS]]></category>
		<category><![CDATA[Markov random field]]></category>
		<category><![CDATA[model]]></category>
		<category><![CDATA[multiscale]]></category>
		<category><![CDATA[multiscale modeling]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/?p=1322</guid>
		<description><![CDATA[A few scientists in our group work on estimating the intensities of gamma ray observations from sky surveys. This work distinguishes from typical image processing which mostly concerns the point estimation of intensity at each pixel location and the size of overall white noise type error. Often times you will notice from image processing that [...]]]></description>
			<content:encoded><![CDATA[<p>A few scientists in our group work on estimating the intensities of gamma ray observations from sky surveys. This work distinguishes from typical image processing which mostly concerns the point estimation of intensity at each pixel location and the size of overall white noise type error. Often times you will notice from image processing that the orthogonality between errors and sources, and the white noise assumptions. These assumptions are typical features in image processing utilities and modules. On the other hand, CHASC scientists relate more general and broad statistical inference problems in estimating the intensity map, like intensity uncertainties at each point and the scientifically informative display of the intensity map with uncertainty according to the Poisson count model and constraints from physics and the instrument, where the field, <b>multiscale modeling</b> is associated. <span id="more-1322"></span></p>
<p>As the post title [MADS] indicates, no abstract has keywords <b>multiscale modeling.</b> It seems like that just the jargon is not listed in ADS since &#8220;multiscale modeling&#8221; is practiced in astronomy. One of examples is our group&#8217;s work. Those CHASC scientists take Bayesian modeling approaches, which makes them unique to my knowledge in the astronomical society. However, I expected constructing an intensity map through statistical inference (estimation) or &#8220;multiscale modeling&#8221; to be popular among astronomers in recent years. Well, none came along from my abstract keyword search.  </p>
<p>Wikipedia also shows a very brief description of <b>multiscale modeling</b> and emphasized that it is a fairly new interdisciplinary topic. <a href="http://en.wikipedia.org/wiki/Multiscale_modeling">wiki:multiscale_modeling.</a> TomLoredo kindly informed me some relevant references from <a href="http://ads.harvard.edu">ADS</a> after my post <a href="groundthruth.info/AstroStat/slog/2008/mads-hmm">[MADS] HMM.</a> He mentioned his search words were <b>Markov Random Fields</b> which can be found from <u><i>stochastic geometry</i><i></i></u> and <u><i>spatial statistics</i></u> in addition to many applications in computer science. Not only these publications but he gave me a nice comment on analyzing astronomical data, which I&#8217;d rather postpone for another discussion.</p>
<ul>
<li><a href="http://adsabs.harvard.edu/abs/2008HEAD...10.0301C">Quantifying Doubt and Confidence in Image &#8220;Deconvolution&#8221;</a> by Connors, Alanna; van Dyk, D.; Chiang, J.; CHASC</li>
<li><a href="http://adsabs.harvard.edu/abs/1998SPIE.3353.1006J">Blind Bayesian restoration of adaptive optics telescope images using generalized Gaussian Markov random field models</a>  by Jeffs, Brian D.; Christou, Julian C.</li>
<li><a href="http://adsabs.harvard.edu/abs/1997scma.conf..409T"> Segmenting Chromospheric Images with Markov Random Fields  (paper in SCMA II)</a> Turmon, Michael J.; Pap, Judit M.</li>
<li><a href="http://adsabs.harvard.edu/abs/1997daa..conf..289M"> Bayesian deconvolution methods in astronomy</a> by Molina, R.; Katsaggelos, A. K.; Mateos, J</li>
<li><a href="http://adsabs.harvard.edu/abs/1996VA.....40..539M">Compound Gauss-Markov random fields for astronomical image restoration</a> by Molina, R.; Katsaggelos, A. K.; Mateos, J.; Abad, J</li>
<li><a href="http://adsabs.harvard.edu/abs/1992daia.conf...39J">Markov random field applications in image analysis</a> by Jain, A. K.; Nadabar, S. G (I bet &#8220;Jain&#8221; is the author of many celebrated papers in image processing and machine learning. I often find that well known computer scientists involve in astronomical researches ).</li>
</ul>
<p>The reason I was not able to find these papers was that they are not published in the 4 major astronomical publications + Solar Physics. The reason for this limited search is that I was overwhelmed by the amount of unlimited search results including arxiv. (I wonder if there is a way to do exclusive searches in ADS by excluding arxiv:comp, arxiv:phys, arxiv:math, etc). Thank you, Tom, for providing me these references. </p>
<p>Please, check out <a href="http://hea-www.harvard.edu/AstroStat">CHASC website</a> for more study results related to &#8220;multiscale modeling&#8221; from our group. </p>
<p><b>[Added]</b> Nice tutorials related to Markov Random Fields (MRF) recommended by an expert in the field and a friend (all are pdfs).</p>
<ol>
<li><a href="https://engineering.purdue.edu/%7Ebouman/publications/tutorials/mrf_tutorial/view.pdf">Markov Random Fields and Stochastic Image Models (ICIP 1995 Invited Tutorial)</a>
</li>
<li><a href="https://engineering.purdue.edu/%7Ebouman/ece641/notes/pdf/ReadingList.pdf">Digital Image Processing II Reading List</a>
</li>
<li><a href="https://engineering.purdue.edu/%7Ebouman/ece641/notes/pdf/Book-header.pdf">MAP, EM, MRFs, and All That: A User&#8217;s Guide to the Tools of Model Based Image Processing (incomplete)<br />
</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2008/mads-multiscale-modeling/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>survey and design of experiments</title>
		<link>http://groundtruth.info/AstroStat/slog/2008/survey-and-design-of-experiments/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2008/survey-and-design-of-experiments/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 20:16:24 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[Astro]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Cross-Cultural]]></category>
		<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[Jargon]]></category>
		<category><![CDATA[Methods]]></category>
		<category><![CDATA[Misc]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Stat]]></category>
		<category><![CDATA[213]]></category>
		<category><![CDATA[AAS]]></category>
		<category><![CDATA[Alanna Connors]]></category>
		<category><![CDATA[catalog]]></category>
		<category><![CDATA[census]]></category>
		<category><![CDATA[detection]]></category>
		<category><![CDATA[experimental design]]></category>
		<category><![CDATA[Long Beach]]></category>
		<category><![CDATA[special session]]></category>
		<category><![CDATA[SPS]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/?p=894</guid>
		<description><![CDATA[People of experience would say very differently and wisely against what I&#8217;m going to discuss now. This post only combines two small cross sections of each branch of two trees, astronomy and statistics. 
When it comes to survey, the first thing comes in my mind is the census packet although I only saw it once [...]]]></description>
			<content:encoded><![CDATA[<p>People of experience would say very differently and wisely against what I&#8217;m going to discuss now. This post only combines two small cross sections of each branch of two trees, astronomy and statistics. <span id="more-894"></span></p>
<p>When it comes to <b>survey,</b> the first thing comes in my mind is the census packet although I only saw it once (an easy way to disguise my age but this is true) but the questionaire layouts are so carefully and extensively done so as to give me a strong impression. Such survey is designed prior to collecting data so that after collection, data can be analyzed according to statistical methodology suitable to the design of the survey. Strategies for response quantification is also included (yes/no for 0/1, responses in 0 to 10 scale, bracketing salaries, age groups, and such, handling missing data) for elaborated statistical analysis to avoid subjective data transformation and arbitrary outlier eliminations.</p>
<p>In contrast, survey in astronomy means designing a mesh, not questionaires, unable to be transcribed into statistical models. This mesh has multiple layers like telescope, detector, and source detection algorithm, and eventually produces a catalog. Designing statistical methodology is not a part of it that draws interpretable conclusion. Collecting what goes through that mesh is astronomical survey. Analyzing the catalog does not necessarily involve sophisticated statistics but often times adopts chi-sq fittings and cast aways of unpleasant/uninteresting data points.</p>
<p>As other conflicts in jargon, &#8211;a simplest example is H<sub>o</sub>: I used to know it as <b>Hubble constant</b> but now, it is recognized first as a notation for a <b>null hypothesis</b> &#8212; <b>survey</b> has been one of them and like the measurement error, some clarification about the term, <b>survey</b> is expected to be given by knowledgeable astrostatisticians to draw more statisticians involvement in grand survey projects soon to come. Luckily, the first opportunity will be given soon at the <a href="http://aas.org/meetings/aas213/event_special_ss05.php">Special Session: Meaning from Surveys and Population Studies: BYOQ</a> during <a href="http://aas.org/meetings/aas213">the 213 AAS meeting,</a> at Long Beach, California on Jan. 5th, 2009. </p>
<p><!--I have a bit vague memory using probably the daophot package under IRAF (more than 10 years ago, this big gap is likely to cause biased opinion), my knowing IRAF happened before I see census package) but remember how magnitudes of stars were retrieved. They were retrieved, not collected. That's the largest difference between survey considered in astronomy versus statistics. Gold mining...as much as you can get is astronomy;<br />
My point is that because different philosophy under one word, it is not recommended to apply traditional statistical survey methods or experimental designs into astronomical survey. Furthermore, it is not wise to apply methodology of 19th century for 21st century survey data. --></p>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2008/survey-and-design-of-experiments/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The LRT is worthless for &#8230;</title>
		<link>http://groundtruth.info/AstroStat/slog/2008/the-lrt-is-worthless-for/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2008/the-lrt-is-worthless-for/#comments</comments>
		<pubDate>Fri, 25 Apr 2008 05:48:06 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[Bad AstroStat]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Frequentist]]></category>
		<category><![CDATA[Stat]]></category>
		<category><![CDATA[arXiv]]></category>
		<category><![CDATA[LRT]]></category>
		<category><![CDATA[mixture]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/?p=214</guid>
		<description><![CDATA[We prove in this paper that the LRT is worthless from testing a distribution against a two components mixture when the set of parameters is large. ]]></description>
			<content:encoded><![CDATA[<p>One of the speakers from <a href="http://groundtruth.info/AstroStat/slog/2008/google-sky/">the google talk series</a> exemplified model based clustering and mentioned the likelihood ratio test (LRT) for defining the number of clusters. Since I&#8217;ve seen the examples of ill-mannerly practiced LRTs from astronomical journals, like testing two clusters vs three, or a higher number of components, I could not resist indicating that the LRT is improperly used from his illustration. As a reply, the citation regarding the LRT was different from his plot and the test was carried out to test one component vs. two, which closely observes the regularity conditions. I was relieved not to find another example of the ill-used LRT.<span id="more-214"></span></p>
<p>There are various tests applicable according to needs and conditions from data and source models but it seems no popular astronomical lexicons have these on demand tests except the LRT (Once I saw the score test since posting [ArXiv]s in the slog and a few non-parametric rank based tests over the years). I&#8217;m sure well knowledgeable  astronomers soon point out that I jumped into a conclusion too quickly and bring up counter-examples. Until then, be advised that your LRTs, &#967^2 tests, and F-tests ask for your statistical attention prior to their applications for any statistical inferences. These tests are not magic crystals, producing answers you are looking for. To bring such care and attentions, here&#8217;s a thought provoking titled paper that I found some years ago. </p>
<blockquote><p>
<strong> The LRT is worthless for testing a mixture when the set of parameters is large</strong><br />
JM Azaıs, E Gassiat, C Mercadier (<a href="http://www.math.u-psud.fr/~biblio/ppo/2003/ppo2003-57.html">click here</a> :I found it from internet but it seems the link was on and off and sometimes was not available.)
</p></blockquote>
<p>Here, quotes replace theorems and their proofs<sup>[1]</sup> :</p>
<ul>
<li>We prove in this paper that the LRT is worthless from testing a distribution against a two components mixture when the set of parameters is large.
</li>
<li>One knows that the traditional Chi-square theory of Wilks[16<sup>[2]</sup>] does not apply to derive the asymptotics of the LRT due to a lack of <u>identifiability</u> of the alternative under the null hypothesis.
</li>
<li>&#8230;for unbounded sets of parameters, the LRT statistic tends to infinity in probability, as Hartigan[7<sup>[3]</sup>] first noted for normal mixtures.
</li>
<li>&#8230;the LRT cannot distinguish the null hypothesis (single gaussian) from any contiguous alternative (gaussian mixtures). In other words, the LRT is worthless<sup>[4]</sup>.
</li>
</ul>
<p>For astronomers, the large set of parameters are of no concern due to theoretic constraints from physics. Experiences and theories bound the set of parameters small. Sometimes, however, the distinction between small and large sets can be vague. </p>
<p>The characteristics of the LRT is well established under the compactness set assumption (either compact or bounded) but troubles happen when the limit goes to the boundary. As cited before in the slog a few times, readers are recommend to read for more rigorously manifested ideas from astronomy about the LRT, <a href="http://adsabs.harvard.edu/abs/2002ApJ...571..545P"> Protassov, et.al. (2002) Statistics, Handle with Care: Detecting Multiple Model Components with the Likelihood Ratio Test, ApJ, 571, p. 545</a></p>
<ol class="footnotes"><li id="footnote_0_214" class="footnote">Readers might want to look for mathematical statements and proofs from the paper</li><li id="footnote_1_214" class="footnote">S.S.Wilks. The large sample distribution of the likelihood ratio for testing composite hypothesis, Ann. Math. Stat., 9:60-62, 1938</li><li id="footnote_2_214" class="footnote">J.A.Hartigan, A failure of likelihood asymptotics for normal mixtures. in Proc. Berkeley conf. Vol. II, pp.807-810</li><li id="footnote_3_214" class="footnote">Comment to Theorem 1. They proved the lack of worth in the LRT under more general settings, see Theoremm 2</li></ol>]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2008/the-lrt-is-worthless-for/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>[ArXiv] Astronomy Job Market in US</title>
		<link>http://groundtruth.info/AstroStat/slog/2007/arxiv-astronomy-job-market-in-us/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2007/arxiv-astronomy-job-market-in-us/#comments</comments>
		<pubDate>Fri, 21 Dec 2007 17:47:59 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Cross-Cultural]]></category>
		<category><![CDATA[Misc]]></category>
		<category><![CDATA[arXiv]]></category>
		<category><![CDATA[astrostatistician]]></category>
		<category><![CDATA[correlation]]></category>
		<category><![CDATA[job market]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/2007/arxiv-astronomy-job-market-in-us/</guid>
		<description><![CDATA[It&#8217;s a report about the job market in US. 

[astro-ph:0712.2820]  The Production Rate and Employment of Ph.D. Astronomers  T.S. Metcalfe


Related Comments:

  Much more jobs than I expected. However, it cannot compete with jobs in Statistics. 
  Three jobs before having a stable one in astronomy. Do not know in statistics. 
 [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s a report about the job market in US. </p>
<blockquote><p>
<a href="http://arxiv.org/abs/0712.2820">[astro-ph:0712.2820]</a> <strong> The Production Rate and Employment of Ph.D. Astronomers </strong> T.S. Metcalfe
</p></blockquote>
<p><span id="more-205"></span></p>
<p>Related Comments:</p>
<ol>
<li>  Much more jobs than I expected. However, it cannot compete with jobs in Statistics. </li>
<li>  Three jobs before having a stable one in astronomy. Do not know in statistics. </li>
<li>  Astronomy Ph.D. students receive more cares, in a sense that the job market is controlled to guarantee a position for every student. In statistics, without care you can find something (not necessary a research position).
</ol>
<p>Unrelated Comment on Correlation:<br />
It&#8217;s a cultural difference. Maybe not. When I learned correlation years ago from a textbook, the procedure is, 1. compute the correlation and 2. do a t-test. In astronomical papers, 1. do regression and 2. plot the simple linear regression line with error bands and data points. The computing procedure is same but the way illustrates the results seems different. </p>
<p><i> I wonder what would it be like when we narrow the job market for astrostatisticians. </i></p>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2007/arxiv-astronomy-job-market-in-us/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Arrogant?</title>
		<link>http://groundtruth.info/AstroStat/slog/2007/arrogant/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2007/arrogant/#comments</comments>
		<pubDate>Fri, 07 Sep 2007 00:47:14 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Cross-Cultural]]></category>
		<category><![CDATA[Misc]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/2007/arrogant/</guid>
		<description><![CDATA[I once talked about the relationship between astronomers and statisticians in the slog posting Data Doctors. To astronomers, statisticians are assistants. Statisticians are just helping astronomical data analysis with statistically limited eyes. Less frequently statistical improvements and modification occurred in the astronomical society through collaborations with statisticians compared to other fields.

Not all agree with it. [...]]]></description>
			<content:encoded><![CDATA[<p>I once talked about the relationship between astronomers and statisticians in the slog posting <a href="http://groundtruth.info/AstroStat/slog/2007/data-doctors/">Data Doctors.</a> To astronomers, statisticians are assistants. Statisticians are just helping astronomical data analysis with statistically limited eyes. Less frequently statistical improvements and modification occurred in the astronomical society through collaborations with statisticians compared to other fields.<br />
<span id="more-137"></span></p>
<p>Not all agree with it. I myself changed this biased opinion to the lesser degree. However, I still think astronomers appear to be  full of pride from the eyes of non astronomers.</p>
<p>Having educations from both fields, while learning statistics and being away from astronomy for some years, I&#8217;ve tried to talk astronomers if I have some chance. For years I couldn&#8217;t get rid of the impression that astronomers are arrogant in a sense that they believe they can do statistics without the help of statisticians. When I was looking for a job as statistician in astronomy, most of astronomers said they only need technicians who are good at coding algorithms including statistical ones. Their behind implication looked as only astronomers do statistics and statistical data analysis with astronomical data.</p>
<p>However, as mentioned above, my opinion has been changed by some degree after coming to <a href="http://www.cfa.harvard.edu">CfA</a> and working with <a href="http://hea-www.harvard.edu/AstroStat/index.html">CHASC.</a> The reasons for this change are:</p>
<ul>
<li> Assistants can serve arrogant chiefs (this is for a laugh).</li>
<li> When you work with astronomers, you don&#8217;t feel they are arrogant at all. Astronomers are very passionate about what they are doing and have very pure minds. The passion and objectivity of astronomers is different from that of statisticians, which gave a little barrier me talking to astronomers, particularly when I was looking for a job.</li>
<li> Most importantly, astronomers know better in their data, in general very expensive, and the data peculiarity, partially discussed in <a href="http://groundtruth.info/AstroStat/slog/2007/quote-of-the-week-aug-31-2007/">Quote of the week, Aug. 31, 2007</a> hinders quick collaborations between two communities (Instead of explaining instruments and physics to statisticians years and years, it could be quicker that astronomers do statistics).</li>
</ul>
<p>As we separate probability, theoretical statistics, biostatistics, spatial statistics, bioinformatics, applied statistics, data mining, machine learning, etc, I hope astrostatistics could share its position equivalently with other subfields in the statistics community.  I hope more astronomers explain astronomy to statisticians with patience. Eventually, the impression of proud astronomers will die and many statistics for improved estimation and inference will born.</p>
<p>[Addendum] A few senior statisticians, in a casual fashion, expressed that my interests in astrostatistics is reckless.</p>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2007/arrogant/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>[ArXiv] NGC 6397 Deep ACS Imaging, Aug. 29, 2007</title>
		<link>http://groundtruth.info/AstroStat/slog/2007/arxiv-ngc-6397-deep-acs-imaging/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2007/arxiv-ngc-6397-deep-acs-imaging/#comments</comments>
		<pubDate>Wed, 05 Sep 2007 06:26:20 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[Astro]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[Imaging]]></category>
		<category><![CDATA[Jargon]]></category>
		<category><![CDATA[Stars]]></category>
		<category><![CDATA[arXiv]]></category>
		<category><![CDATA[ACS]]></category>
		<category><![CDATA[chi-square]]></category>
		<category><![CDATA[CMD]]></category>
		<category><![CDATA[empirical method]]></category>
		<category><![CDATA[globular cluster]]></category>
		<category><![CDATA[LF]]></category>
		<category><![CDATA[MF]]></category>
		<category><![CDATA[NGC]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/2007/arxiv-ngc-6397-deep-acs-imaging/</guid>
		<description><![CDATA[From arxiv/astro-ph:0708.4030v1
 Deep ACS Imaging in the Globular Cluster NGC 6397: The Cluster Color Magnitude Diagram and Luminosity Function by H.B. Richer et.al
This paper presented an observational study of a globular cluster, named NGC 6397, enhanced and more informative compared to previous observations in a sense that 1) a truncation in the white dwarf cooling [...]]]></description>
			<content:encoded><![CDATA[<p>From <a href="http://www.arxiv.org/abs/0708.4030">arxiv/astro-ph:0708.4030v1</a><br />
<strong> Deep ACS Imaging in the Globular Cluster NGC 6397: The Cluster Color Magnitude Diagram and Luminosity Function</strong> by H.B. Richer et.al</p>
<p>This paper presented an observational study of a globular cluster, named <a href="http://x.astrogeek.org/observations/log.php?object_id=217">NGC 6397</a>, enhanced and more informative compared to previous observations in a sense that 1) a truncation in the white dwarf cooling sequence occurs at 28 magnitude, 2) the cluster main sequence seems to terminate approximately at the hydrogen-burning limit predicted by two independent stellar evolution models, and 3) luminosity functions (LFs) or mass functions (MFs) are well defined. Nothing statistical, but the idea of defining color magnitude diagrams (CMDs) and LFs described in the paper, will assist developing suitable statistics on CMD and LF fitting problems in addition to the improved measurements (ACS imaging) of stars in NGC 6397.<br />
<span id="more-133"></span></p>
<p>Instead of adding details of data properties and calibration process including the instrument characteristics, I like to add a few things for statisticians: First, ACS stands for Advance Camera of Surveys and its information can be found at <a href="http://www.stsci.edu/hst/acs/">this link</a>. Second, NGC is an abbreviation of New General Catalogue, one of astronomers&#8217; cataloging systems (<a href="http://en.wikipedia.org/wiki/New_General_Catalogue">click for its wiki</a>). Third,  CMDs and LFs are results of data processing, described in the paper, but can be considered as scatter plots and kernel density plots (histograms) to be analyzed for inferencing physical parameters. This data processing, or calibration requires multi-level transformations, which cause error propagation. Finally, the chi-square method is incorporated to fit LFs and MFs. Among numerous fitting methods, in astronomy, only the chi-square is ubiquitously used (<a href="http://groundtruth.info/AstroStat/slog/2007/what-is-so-special-about-chi-square-in-astronomy/">link to a discussion on the chi-square</a>). <strong>Could we develop more robust statistics for fitting astronomical (empirical) functions?</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2007/arxiv-ngc-6397-deep-acs-imaging/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Quote of the Week, Aug 31, 2007</title>
		<link>http://groundtruth.info/AstroStat/slog/2007/quote-of-the-week-aug-31-2007/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2007/quote-of-the-week-aug-31-2007/#comments</comments>
		<pubDate>Sat, 01 Sep 2007 03:47:06 +0000</pubDate>
		<dc:creator>aconnors</dc:creator>
				<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Cross-Cultural]]></category>
		<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[Imaging]]></category>
		<category><![CDATA[MC]]></category>
		<category><![CDATA[Quotes]]></category>
		<category><![CDATA[Stat]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/2007/quote-of-the-week-aug-31-2007/</guid>
		<description><![CDATA[<a href="http://www.ics.uci.edu/~dvd/images/meinlondon-sml.jpg"><img src="http://www.ics.uci.edu/~dvd/images/meinlondon-sml.jpg" height="48" /></a><a href="http://www.ics.uci.edu/~dvd/">David van Dyk</a> (representing statistics culture):<blockquote>My assertion is that I find replicated results more convincing than extreme p-values. And the controversial part: Astronomers should aim for replication rather than worry about 5-sigma.</blockquote>]]></description>
			<content:encoded><![CDATA[<p>Once again, the middle of a recent (Aug 30-31, 2007) argument within <a href="http://www.ics.uci.edu/~dvd/astrostat.html">CHASC</a>, on why physicists and astronomers view &#8220;3 sigma&#8221; results with suspicion and expect (roughly) &gt; 5 sigma; while statisticians and biologists typically assume 95% is OK:</p>
<p><a href="http://www.ics.uci.edu/~dvd/images/meinlondon-sml.jpg"><img src="http://www.ics.uci.edu/~dvd/images/meinlondon-sml.jpg" height="48" /></a><a href="http://www.ics.uci.edu/~dvd/">David van Dyk</a> (representing statistics culture):</p>
<blockquote><p>Can&#8217;t you look at it again? Collect more data?</p></blockquote>
<p><a href="http://www.hcs.harvard.edu/~badmintn/members/profiles/vkashyap.jpg"><img src="http://www.hcs.harvard.edu/~badmintn/members/profiles/vkashyap.jpg" height="48" /></a> <a href="http://hea-www.harvard.edu/~kashyap/">Vinay Kashyap</a> (representing astronomy and physics culture):</p>
<blockquote><p>&#8230;I can confidently answer this question: no, alas, we usually cannot look at it again!!</p></blockquote>
<blockquote><p>Ah. Hmm. To rephrase [the question]: if you have a &#8220;7.5 sigma&#8221; feature, with a day-long [imaging Markov Chain Monte Carlo] run you can only show that it is &#8220;&gt;3sigma&#8221;, but is it possible, even with that day-long run, to tell that the feature is <bold>really</bold> at 7.5sigma &#8212; is that the question? Well that would be nice, but I don&#8217;t understand how observing again will help?</p></blockquote>
<p><a href="http://www.ics.uci.edu/~dvd/images/meinlondon-sml.jpg"><img src="http://www.ics.uci.edu/~dvd/images/meinlondon-sml.jpg" height="48" /></a><a href="http://www.ics.uci.edu/~dvd/">David van Dyk</a> :</p>
<blockquote><p>No one believes any realistic test is properly calibrated that far into the tail. Using 5-sigma is really just a high bar, but the precise calibration will never be done. (This is a reason not to sweet the computation TOO much.)</p>
<p>Most other scientific areas set the bar lower (2 or 3 sigma) BUT don&#8217;t really believe the results unless they are replicated.</p>
<p>My assertion is that I find replicated results more convincing than extreme p-values. And the controversial part: Astronomers should aim for replication rather than worry about 5-sigma.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2007/quote-of-the-week-aug-31-2007/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>[ArXiv] Numerical CMD analysis, Aug. 28th, 2007</title>
		<link>http://groundtruth.info/AstroStat/slog/2007/arxiv-numerical-cmd-analysis/</link>
		<comments>http://groundtruth.info/AstroStat/slog/2007/arxiv-numerical-cmd-analysis/#comments</comments>
		<pubDate>Fri, 31 Aug 2007 01:36:38 +0000</pubDate>
		<dc:creator>hlee</dc:creator>
				<category><![CDATA[Astro]]></category>
		<category><![CDATA[CHASC]]></category>
		<category><![CDATA[Fitting]]></category>
		<category><![CDATA[Quotes]]></category>
		<category><![CDATA[Stars]]></category>
		<category><![CDATA[arXiv]]></category>
		<category><![CDATA[CMD]]></category>
		<category><![CDATA[globular cluster]]></category>
		<category><![CDATA[isochrone]]></category>
		<category><![CDATA[Padova]]></category>
		<category><![CDATA[satellite]]></category>
		<category><![CDATA[SDSS]]></category>
		<category><![CDATA[synthetic]]></category>

		<guid isPermaLink="false">http://groundtruth.info/AstroStat/slog/2007/arxiv-numerical-cmd-analysis/</guid>
		<description><![CDATA[From arxiv/astro-ph:0708.3758v1
Numerical Color-Magnitude Diagram Analysis of SDSS Data and Application to the New Milky Way Satellites by  J. T. A. de Jong  et. al.
The authors applied MATCH (Dolphin, 2002[1] -note that the year is corrected) to M13, M15, M92, NGC2419, NGC6229, and Pal14 (well known globular clusters), and BooI, BooII, CvnI, CVnII, Com, [...]]]></description>
			<content:encoded><![CDATA[<p>From <a href="http://arxiv.org/abs/0708.3758">arxiv/astro-ph:0708.3758v1</a><br />
<strong>Numerical Color-Magnitude Diagram Analysis of SDSS Data and Application to the New Milky Way Satellites</strong> by  J. T. A. de Jong  et. al.</p>
<p>The authors applied <a href="http://adsabs.harvard.edu/abs/2002MNRAS.332...91D">MATCH</a> (Dolphin, 2002<sup>[1]</sup> -note that the year is corrected) to M13, M15, M92, NGC2419, NGC6229, and Pal14 (well known globular clusters), and BooI, BooII, CvnI, CVnII, Com, Her, LeoIV, LeoT, Segu1, UMaI, UMaII and Wil1 (newly discovered Milky Way satellites) from <a href="http://www.sdss.org">Sloan Digital Sky Survey (SDSS)</a> to fit Color Magnitude diagrams (CMDs) of these stellar clusters and find the properties of these satellites.<br />
<span id="more-126"></span></p>
<p>A traditional CMD fitting begins with building synthetic CMDs: Completeness of SDSS Data Release 5, Hess diagram (a bivariate histogram from a CMD), and features in MATCH for CMD synthesis were taken into account. The synthetic CMDs of these well known globular clusters   were utilized with the observations from SDSS and compared to previous discoveries to validate their modified MATCH for the SDSS data sets. Afterwards, their method was applied to the newly discovered Milky Way satellites and discussion on their findings of these satellites was presented.</p>
<p>The paper provides plots that enhance the understanding of age, metalicity, and other physical parameter distributions of stellar clusters after they were fit with synthetic CMDs. The paper also describes steps and tricks (to a statistician, the process of simulating stars looks very technical without a mathematical/probabilistic justification) to acquire proper synthetic CMDs that match observations. The paper adopted <a href="http://pleiadi.pd.astro.it/">Padova database of stellar evolutionary tracks and isochrones</a> (there are other databases beyond Padova).</p>
<p>At last, I&#8217;d like to add a sentence from their paper, which supports my idea that a priori knowledge in choosing a proper isochrone database is necessary.</p>
<blockquote><p>In the case of M15, this is due to the blue horizontal branch (BHB) stars that are not properly reproduced by the theoretical isochrones, causing the code to fit them as a younger turn-off.</p></blockquote>
<ol class="footnotes"><li id="footnote_0_126" class="footnote"><a href="http://adsabs.harvard.edu/abs/2002MNRAS.332...91">Numerical methods of star formation history measurement and applications to seven dwarf spheroidals,</a>Dolphin (2002), MNRAS, 332, p. 91</li></ol>]]></content:encoded>
			<wfw:commentRss>http://groundtruth.info/AstroStat/slog/2007/arxiv-numerical-cmd-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

