370 likes | 446 Vues
Dive into the world of data analysis and master time series analysis techniques, including Fourier and Wavelet methods. Learn how to interpret data value fluctuations, identify periods and frequencies, pre-whiten data, and choose the right analysis tools. Discover the significance of null hypothesis, avoid aliasing issues, and enhance your analytical skills to unlock valuable insights from your data.
E N D
Data Analysis Do it yourself!
What to do with your data? • Report it to professionals (e.g., AAVSO) • Excellent! A real service to science; don’t neglect this • Publish observations (e.g., JAAVSO) • Analyze it – yourself!
But … • I’m not a mathematician • Let the computer do the math • I’m not a programmer • Get programs from the net (often free) • I don’t know how to use or interpret them • Neither do the pros! • Practice, practice, practice …
Time Series Analysis • A time series is a set of data pairs • t is the time, x is the data value • Usually, times are assumed error-free • Data = Signal + Error • x can be anthing, e.g. brightness of variables star, time of eclipse, eggs/day from a laying hen
Actual Mean = = expected value Standard deviation = expected rms difference from mean Estimated Average = estimated Sample standard deviation = estimated Basic properties of data x
Average and sample standard deviation • Average • Sample standard deviation
Method #1: world’s best • Eye + Brain: Look at the data! • Plot x as a function of t: Explore! • Scientific name: Visual Inspection • World’s best – but not infallible • Programs: • TS http://www.aavso.org • MAGPLOT http://www.aavso.org
Method #2: Fourier Analysis • Period analysis and curve-fitting • Powerful, well-understood, popular • Programs • TS http://www.aavso.org • PerAnSo http://www.peranso.com
Method #3: Wavelet Analysis • Time-frequency analysis • Old versions bad, new version good • Programs: • WWZ http://www.aavso.org • WinWWZ http://www.aavso.org
Fourier analysis for period search • Match the data to sine/cosine waves • = frequency • Period = • Amplitude = A = size of fluctuation • Obvious choice is period; mathematically sound choice is frequency
Null Hypothesis (important!) • Null hypothesis: no time variation at all • So = constant • So, • Quite important! Often neglected. Even the pros often forget this.
Is it real? • Fit produces a test statistic under the null hypothesis • Is usually “ /degree of freedom” (d) • Linear: is significant (not just by accident) at 95% confidence • 95% confidence means 5% false-alarm probability
Meaning of significance • Significance does not mean the signal is linear, sinusoidal, periodic, etc. • It only means the null hypothesis is incorrect, i.e., the signal is not constant • Important!!!
Pre-whitening • If you find a significant fit, then subtract the estimated signal, leaving residuals • Analyze the residuals for more structure • This process is called pre-whitening
How to choose frequency? • Test all reasonable values, get a “strength of fit” for each. Common is “chi-square per degree of freedom” (but there are many) • Plot frequency .vs. fit – the Fourier transform (aka periodogram, aka power spectrum)
Fourier decomposition Any periodic function of period P (frequency ) can be expressed as a Fourier series:
Fundamental + harmonics For a pure sinusoid, expect response at frequency For a general periodic signal at a given frequency, expect a fundamental component at , as well as harmonics at frequencies etc.
Lots of Fourier methods • FFT: fast Fourier transform • Not just fast: it’s wicked fast • Requires even time spacing • Requires N=integer power of 2 • Beware! • DFT: discrete Fourier transform • Applies to any time sampling, but incorrect results for highly uneven (as in astronomy!) • Beware!
Problems from uneventime sampling • Aliasing: false peaks, often from a periodic data density • Aliases at • Common in astronomy: data density have a period P = 1 yr = 365.2422 d, so • Solution: pre-whitening
Problems from uneventime sampling • Mis-calculation of frequency (slightly) and amplitude (greatly); sabotages prewhitening
Solution: better Fourier methods (for astronomy) • Lomb-Scargle modified periodogram • Improvement over FFT, DFT • CLEAN spectrum • Bigger improvement • DCDFT: date-compensated discrete Fourier transform (this is the one you want) • CLEANEST spectrum: DCDFT-like for multiple frequencies
DCDFT • Much better estimates of period, amplitude
Let’s take a look • Peranso (uses DCDFT and CLEANEST) • Available from CBA Belgium • http://www.peranso.com
Wavelets • Fit sine/cosine-like functions of brief duration • Shift them through time • Gives a time-frequency analysis
Problems • Same old same old: uneven time spacing, especially variable data density, invalidate the results • But: even worse than Fourier • Essentially useless for most astronomical data
Wavelet methods • DWT: discrete wavelet transform • Just not right for unevenly sampled data (astronomy!) • Solution: WWZ = weighted wavelet Z-transform
Data Analysis • Do it yourself • Use your eyes and brain • Healthy skepticism • tamino_9@hotmail.com • Enjoy!