Aug 27th, 2008| 02:35 pm | Posted by hlee
I didn’t realize this post was sitting for a month during which I almost neglected the slog. As if great books about probability and information theory for statisticians and engineers exist, I believe there are great statistical physics books for physicists. On the other hand, relatively less exist that introduce one subject to the other kind audience. In this regard, I thought the lecture note can be useful.
[arxiv:physics.data-an:0808.0012]
Lectures on Probability, Entropy, and Statistical Physics by Ariel Caticha
Abstract:
Continue reading ‘A lecture note of great utility’ »
Tags:
Bayes Theorem,
Boltzmann,
Carnot,
Entropy,
Gibbs paradox,
Information,
laws of thermodynamics,
lecture note,
maximum likelihood,
probability,
Shannon,
statistical physics,
Tchebyshev inequality,
thermodynamics Category:
Bayesian,
Cross-Cultural,
Data Processing,
Fitting,
Physics,
Stat,
arXiv |
Comment
Jul 9th, 2008| 01:00 pm | Posted by vlk
The Kaplan-Meier (K-M) estimator is the non-parametric maximum likelihood estimator of the survival probability of items in a sample. “Survival” here is a historical holdover because this method was first developed to estimate patient survival chances in medicine, but in general it can be thought of as a form of cumulative probability. It is of great importance in astronomy because so much of our data are limited and this estimator provides an excellent way to estimate the fraction of objects that may be below (or above) certain flux levels. The application of K-M to astronomy was explored in depth in the mid-80’s by Jurgen Schmitt (1985, ApJ, 293, 178), Feigelson & Nelson (1985, ApJ 293, 192), and Isobe, Feigelson, & Nelson (1986, ApJ 306, 490). [See also Hyunsook's primer.] It has been coded up and is available for use as part of the ASURV package. Continue reading ‘Kaplan-Meier Estimator (Equation of the Week)’ »
Tags:
censored,
EotW,
Equation,
Equation of the Week,
Feigelson,
Isobe,
Kaplan-Meier,
maximum likelihood,
Nelson,
Schmitt,
survival analysis,
upper limit Category:
Frequentist,
Jargon,
Methods,
Stat |
13 Comments
Jun 8th, 2008| 09:45 pm | Posted by hlee
Despite no statistic related discussion, a paper comparing XSPEC and ISIS, spectral analysis open source applications might bring high energy astrophysicists’ interests this week. Continue reading ‘[ArXiv] 1st week, June 2008’ »
Tags:
black box,
catalog,
CMB,
confidence interval,
EGRET,
ICA,
ISIS,
maximum likelihood,
radio,
sample size,
student t,
XSPEC Category:
Data Processing,
High-Energy,
Methods,
Stat,
arXiv,
gamma-ray |
Comment
May 26th, 2008| 02:59 pm | Posted by hlee
Tags:
clustering,
high dimension,
LF,
maximum likelihood,
multivariate,
Poisson,
Schechter,
zero count Category:
Bayesian,
Fitting,
MCMC,
Methods,
Stat,
arXiv |
Comment
May 19th, 2008| 10:42 am | Posted by hlee
There’s no particular opening remark this week. Only I have profound curiosity about jackknife tests in [astro-ph:0805.1994]. Including this paper, a few deserve separate discussions from a statistical point of view that shall be posted. Continue reading ‘[ArXiv] 2nd week, May 2008’ »
Tags:
bimodality,
bootstrap,
calibration uncertainty,
CF,
Classification,
CMB,
dip,
exoplanet,
Fisher matrix,
flare,
GL,
jackknife,
KS test,
marked point,
maximum likelihood,
MLE,
poisson point process,
spatial data,
XLF Category:
Frequentist,
Uncertainty,
X-ray,
arXiv |
Comment
Apr 27th, 2008| 11:29 am | Posted by hlee
The last paper in the list discusses MCMC for time series analysis, applied to sunspot data. There are six additional papers about statistics and data analysis from the week. Continue reading ‘[ArXiv] 4th week, Apr. 2008’ »
Tags:
clusters,
CMB,
GALEX,
gravitaional waves,
lensing,
LF,
LMC,
machine learning,
maximum likelihood,
priors,
probability,
SDSS,
stellar populations,
sunspot,
time series Category:
MCMC,
arXiv |
Comment
Apr 20th, 2008| 09:05 pm | Posted by hlee
The dichotomy of outliers; detecting outliers to be discarded or to be investigated; statistics that is robust enough not to be influenced by outliers or sensitive enough to alert the anomaly in the data distribution. Although not related, one paper about outliers made me to dwell on what outliers are. This week topics are diverse. Continue reading ‘[ArXiv] 3rd week, Apr. 2008’ »
Tags:
background,
bootstrap,
calibration errors,
Cash statistics,
clusters,
CMB,
corona,
edge detection,
FFT,
gravitational lens,
maximum likelihood,
multiscale,
neural network,
outlier,
SDSS,
sunspot,
systematic errors,
topology,
WMAP,
XMM-Newton Category:
High-Energy,
MCMC,
arXiv |
Comment
Apr 11th, 2008| 02:21 am | Posted by hlee
Markov chain Monte Carlo became the most frequent and stable statistical application in astronomy. It will be useful collecting tutorials from both professions. Continue reading ‘[ArXiv] 2nd week, Apr. 2008’ »
Tags:
Classification,
GRB,
Hubble constant,
K-S test,
kurtosis,
mask,
maximum likelihood,
SDSS,
skewness,
Solar Oscillation,
Vicent Martinez Category:
Bayesian,
MCMC,
Methods,
Stat,
arXiv |
3 Comments
Feb 19th, 2008| 10:15 pm | Posted by hlee
I was reading [1]. I must say that I do not know Bayesian methods to cope with model misspecification, tests with an unknown true model, or tests for non-nested hypotheses except Bayes factor (concerns a lot how to choose priors). Nonetheless, the zeal among economists to test non-nested models might assist astronomers to move forward beyond testing nested hypotheses with F statistic. Continue reading ‘Non-nested hypothesis tests’ »
Nov 2nd, 2007| 05:59 pm | Posted by hlee
To be exact, the title of this posting should contain 5th week, Oct, which seems to be the week of EGRET. In addition to astro-ph papers, although they are not directly related to astrostatistics, I include a few statistics papers which may be profitable for astronomical data analysis. Continue reading ‘[ArXiv] 1st week, Nov. 2007’ »
Tags:
bootstrap,
EGRET,
Fisher information,
Laplace transform,
maximum likelihood,
PCA,
PDF,
Poisson,
Ratio,
Uncertainty,
variance Category:
arXiv |
1 Comment
Oct 30th, 2007| 03:37 am | Posted by hlee
From arxiv/astro-ph:0705.4199v1
In search of an unbiased temperature estimator for statistically poor X-ray spectra
A. Leccardi and S. Molendi
There was a delay of writing about this paper, which by accident was lying under the pile of papers irrelevant to astrostatistics. (It has been quite overwhelming to track papers with various statistical applications and papers with rooms left for statistical improvements from arxiv:astro-ph). Although there is a posting about this paper (see Vinay’s posting), I’d like to give a shot. I was very excited because I haven’t seen any astronomical papers discussing unbiased estimators solely.
Continue reading ‘[ArXiv] An unbiased estimator, May 29, 2007’ »
Tags:
chi-square,
maximum likelihood,
mixing distribution,
mixture,
nonparametric,
robust,
subsampling,
transformation,
unbiased,
Uncertainty Category:
Frequentist,
Stat,
arXiv |
Comment
Aug 19th, 2007| 11:35 pm | Posted by hlee
One of the most frequently cited papers in model selection would be An Asymptotic Equivalence of Choice of Model by Cross-Validation and Akaike’s Criterion by M. Stone, Journal of the Royal Statistical Society. Series B (Methodological), Vol. 39, No. 1 (1977), pp. 44-47.
(Akaike’s 1974 paper, introducing Akaike Information Criterion (AIC), is the most often cited paper in the subject of model selection).
Continue reading ‘Cross-validation for model selection’ »
Tags:
AIC,
Cash statistics,
cross-validation,
exponential family,
Fisher information,
maximum likelihood,
Model Selection,
resampling,
score,
TIC Category:
Algorithms,
Frequentist,
Methods,
Stat,
arXiv |
5 Comments
Aug 17th, 2007| 06:15 pm | Posted by hlee
From arxiv/math.st:0708.2153v1
Estimating the number of classes by Mao and Lindsay
This study could be linked to identifying the number of lines from Poisson nature x-ray count data, one of the key interests for astronomers. However, as pointed by the authors, estimating the numbers of classes is a difficult statistical problem. I.J.Good[] said that
I don’t believe it is usually possible to estimate the number of species, but only an appropriate lower bound to that number. This is because there is nearly always a good chance that there are a very large number of extremely rare species.
Continue reading ‘[ArXiv] Poisson Mixture, Aug. 16, 2007’ »
Jun 19th, 2007| 05:00 pm | Posted by hlee
From arxiv/astro-ph, arXiv:0706.2590v1 Extreme Value Theory and the Solar Cycle by Ramos, A. This paper might drag a large attention from CHASC members.
Continue reading ‘[ArXiv] Solar Cycle, June 18, 2007’ »