Archive for the ‘Methods’ Category.
Jun 3rd, 2008| 02:53 am | Posted by vlk
It is somewhat surprising that astronomers haven’t cottoned on to Lowess curves yet. That’s probably a good thing because I think people already indulge in smoothing far too much for their own good, and Lowess makes for a very powerful hammer. But the fact that it is semi-parametric and is based on polynomial least-squares fitting does make it rather attractive.
And, of course, sometimes it is unavoidable, or so I told Brad W. When one has too many points for a regular polynomial fit, and they are too scattered for a spline, and too few to try a wavelet “denoising”, and no real theoretical expectation of any particular model function, and all one wants is “a smooth curve, damnit”, then Lowess is just the ticket.
Well, almost.
There is one major problem — how does one figure what the error bounds are on the “best-fit” Lowess curve? Clearly, each fit at each point can produce an estimate of the error, but simply collecting the separate errors is not the right thing to do because they would all be correlated. I know how to propagate Gaussian errors in boxcar smoothing a histogram, but this is a whole new level of complexity. Does anyone know if there is software that can calculate reliable error bands on the smooth curve? We will take any kind of error model — Gaussian, Poisson, even the (local) variances in the data themselves.
Tags:
Brad Wargelin,
error bands,
error bars,
Fitting,
least-squares,
Loess,
Lowess,
polynomial,
question for statisticians,
smoothing Category:
Algorithms,
Fitting,
Methods,
Stat,
Uncertainty |
11 Comments
May 26th, 2008| 02:59 pm | Posted by hlee
Tags:
clustering,
high dimension,
LF,
maximum likelihood,
multivariate,
Poisson,
Schechter,
zero count Category:
Bayesian,
Fitting,
MCMC,
Methods,
Stat,
arXiv |
Comment
Apr 29th, 2008| 02:24 am | Posted by hlee
Scheming arXiv:astro-ph abstracts almost an year never offered me an occasion that the fit of the Poisson distribution is tested in different ways, instead it is taken for granted by plugging data and (source) model into a (modified) χ2 function. If any doubts on the Poisson distribution occur, the following paper might be useful: Continue reading ‘tests of fit for the Poisson distribution’ »
Apr 21st, 2008| 11:56 pm | Posted by hlee
Because of the extensive works by Prof. Peebles and many (observational) cosmologists (almost always I find Prof. Peeble’s book in cosmology literature), the 2 (or 3) point correlation function is much more dominant than any other mathematical and statistical methods to understand the structure of the universe. Unusually, this week finds an astro-ph paper written by a statistics professor addressing the K-function to explore the mystery of the universe.
[astro-ph:0804.3044] J.M. Loh
Estimating Third-Order Moments for an Absorber Catalog
Continue reading ‘[ArXiv] Ripley’s K-function’ »
Apr 11th, 2008| 02:21 am | Posted by hlee
Markov chain Monte Carlo became the most frequent and stable statistical application in astronomy. It will be useful collecting tutorials from both professions. Continue reading ‘[ArXiv] 2nd week, Apr. 2008’ »
Tags:
Classification,
GRB,
Hubble constant,
K-S test,
kurtosis,
mask,
maximum likelihood,
SDSS,
skewness,
Solar Oscillation,
Vicent Martinez Category:
Bayesian,
MCMC,
Methods,
Stat,
arXiv |
3 Comments
Mar 12th, 2008| 03:32 pm | Posted by hlee
Astrometry.net, a cool website I heard from Harvard Astronomy Professor Doug Finkbeiner’s class (Principles of Astronomical Measurements), does a complex job of matching your images of unknown locations or coordinates to sources in catalogs. By providing your images in various formats, they provide astrometric calibration meta-data and lists of known objects falling inside the field of view. Continue reading ‘Astrometry.net’ »
Mar 5th, 2008| 04:46 pm | Posted by hlee
This is a quite long paper that I separated from [Arvix] 4th week, Feb. 2008:
[astro-ph:0802.3916] P. Carvalho, G. Rocha, & M.P.Hobso
A fast Bayesian approach to discrete object detection in astronomical datasets - PowellSnakes I
As the title suggests, it describes Bayesian source detection and provides me a chance to learn the foundation of source detection in astronomy. Continue reading ‘[ArXiv] A fast Bayesian object detection’ »
Tags:
Bayesian evidence,
coloured background,
CRLB,
decision theory,
filter,
Fisher informatoin,
likelihood,
PowellSnake,
prior,
simulated annealing,
SNR,
source detection,
state space,
Sunyaev-Zel'dovich effect,
symmetric loss,
templates Category:
Algorithms,
Bayesian,
Cross-Cultural,
Data Processing,
Fitting,
Frequentist,
MCMC,
Methods,
Objects,
arXiv |
Comment
Feb 19th, 2008| 10:15 pm | Posted by hlee
I was reading [1]. I must say that I do not know Bayesian methods to cope with model misspecification, tests with an unknown true model, or tests for non-nested hypotheses except Bayes factor (concerns a lot how to choose priors). Nonetheless, the zeal among economists to test non-nested models might assist astronomers to move forward beyond testing nested hypotheses with F statistic. Continue reading ‘Non-nested hypothesis tests’ »
Jan 30th, 2008| 02:33 am | Posted by hlee
Astronomers have developed their ways of processing signals almost independent to but sometimes collaboratively with engineers, although the fundamental of signal processing is same: extracting information. Doubtlessly, these two parallel roads of astronomers’ and engineers’ have been pointing opposite directions: one toward the sky and the other to the earth. Nevertheless, without an intensive argument, we could say that somewhat statistics has played the medium of signal processing for both scientists and engineers. This particular issue of IEEE signal processing magazine may shed lights for astronomers interested in signal processing and statistics outside the astronomical society.
IEEE Signal Processing Magazine Jul. 2007 Vol 24 Issue 4: Bootstrap methods in signal processing
This link will show the table of contents and provide links to articles; however, the access to papers requires IEEE Xplore subscription via libraries or individual IEEE memberships). Here, I’d like to attempt to introduce some articles and tutorials.
Continue reading ‘Signal Processing and Bootstrap’ »
Tags:
bootstrap,
compressive sensing,
confidence interval,
GLM,
IEEE,
jacknife,
machine learning,
multitaper estimate,
particle filter,
signal processing,
statistical inference,
Tutorial,
wavelet Category:
Algorithms,
Bayesian,
Cross-Cultural,
Fitting,
Frequentist,
MC,
MCMC,
Methods,
Misc,
Spectral,
Stat,
Uncertainty,
arXiv |
Comment
Sep 12th, 2007| 04:31 pm | Posted by hlee
From arxiv/astro-ph:0709.1359,
A robust morphological classification of high-redshift galaxies using support vector machines on seeing limited images. I Method description by M. Huertas-Company et al.
Machine learning and statistical learning become more and more popular in astronomy. Artificial Neural Network (ANN) and Support Vector Machine (SVM) are hardly missed when classifying on massive survey data is the objective. The authors provide a gentle tutorial on SVM for galactic morphological classification. Their source code GALSVM is linked for the interested readers.
Continue reading ‘[ArXiv] SVM and galaxy morphological classification, Sept. 10, 2007’ »
Sep 11th, 2007| 03:38 am | Posted by hlee
In the past month, I’ve noticed relatively frequent paper appearance in arxiv/astro-ph whose title includes Bayesian or Markov Chain Monte Carlo (MCMC). Those papers are:
- [astro-ph:0709.1058v1] Joint Bayesian Component Separation and CMB Power Spectrum Estimation by H.K.Eriksen et. al.
- [astro-ph:0709.1104v1] Monolithic or hierarchical star formation? A new statistical analysis by M. Kampakoglou, R. Trotta, and J. Silk
- [astro-ph:0411573v2] A Bayesian analysis of the primordial power spectrum by M.Bridges, A.N.Lasenby, M.P.Hobson
- [astro-ph:0709.0596v1] Bayesian inversion of Stokes profiles by A. A. Ramos, M.J.M. Gonzales, and J.A. Rubino-Martin
- [astro-ph:0709.0711v1] Bayesian posterior classification of planetary nebulae according to the Peimbert types by C. Quireza, H.J.Rocha-Pinto, and W.J. Maciel
- [astro-ph:0708.2340v1]
Bayesian Galaxy Shape Measurement for Weak Lensing Surveys -I. Methodology and a Fast Fitting Algorithm by L. Miller et. al.
- [astro-ph:0708.1871v1] Dark energy and cosmic curvature: Monte-Carlo Markov Chain approach by Y. Gong et. al.
Continue reading ‘[ArXiv] Recent bayesian studies from astro-ph’ »
Sep 11th, 2007| 01:12 am | Posted by hlee
From arxiv/astro-ph:0708.1208v1:
The measurement errors in the Swift-UVOT and XMM-OM by N.P.M. Kuin and S.R. Rosen
The probability distribution of photon counts from the Optical Monitor on XMM Newton satellite (XMM-OM) and the UVOT on the Swift satellite follows a binomial distribution due to detector characteristics. Incident count rate was derived as a function of the measured count rate, which was shown to follow a binomial distribution.
Continue reading ‘[ArXiv] Swift and XMM measurement errors, Sep. 8, 2007’ »
Tags:
binomial,
measurement error,
photon count,
Swift,
UVOT,
XMM Category:
Astro,
Data Processing,
Fitting,
Methods,
Uncertainty,
arXiv |
1 Comment
Aug 23rd, 2007| 11:08 pm | Posted by aconnors
These are from two lively CHASC discussions on classification, or cluster analysis. The first was on Feb 7, 2006; the continuation on Dec 12, 2006, at the Harvard Statistics Department, as part of Stat 310 .
David van Dyk:
Don’t demand too much of the classes. You’re not going to say that all events can be well-classified…. It’s more descriptive. It gives you places to look. Then you look at your classes.
Xiao Li Meng:
Then you’re saying the cluster analysis is more like -
David van Dyk:
It’s really like you have a propsal for classes. You then investigate the physical processes more thoroughly. You may have classes that divide it [up]
……
David van Dyk:
But it can make a difference, where you see the clusters, depending on your [parameter] transformation.You can squish the white spaces, and stretch out the crowded spaces; so it can change where you think the clusters are.
Aneta Siemignowska:
But that is interesting.
Andreas Zezas:
Yes, that is very interesting.
These are particularly in honor of Hyunsook Lee’s recent posting of Chattopadhyay et. al.’s new work about possible intrinsic classes of gamma-ray bursts. Are they really physical classes — or do they only appear to be distinct clusters because we view them through the “squished” lens (parameter spaces) of our imperfect instruments?
Aug 19th, 2007| 11:35 pm | Posted by hlee
One of the most frequently cited papers in model selection would be An Asymptotic Equivalence of Choice of Model by Cross-Validation and Akaike’s Criterion by M. Stone, Journal of the Royal Statistical Society. Series B (Methodological), Vol. 39, No. 1 (1977), pp. 44-47.
(Akaike’s 1974 paper, introducing Akaike Information Criterion (AIC), is the most often cited paper in the subject of model selection).
Continue reading ‘Cross-validation for model selection’ »
Tags:
AIC,
Cash statistics,
cross-validation,
exponential family,
Fisher information,
maximum likelihood,
Model Selection,
resampling,
score,
TIC Category:
Algorithms,
Frequentist,
Methods,
Stat,
arXiv |
5 Comments
Aug 19th, 2007| 12:31 am | Posted by vlk
I think of Markov-Chain Monte Carlo (MCMC) as a kind of directed staggering about, a random walk with a goal. (Sort of like driving in Boston.) It is conceptually simple to grasp as a way to explore the posterior probability distribution of the parameters of interest by sampling only where it is worth sampling from. Thus, a major savings from brute force Monte Carlo, and far more robust than downhill fitting programs. It also gives you the error bar on the parameter for free. What could be better? Continue reading ‘An alternative to MCMC?’ »