Astronomical research is undergoing a transformation due to the proliferation of publicly available online datasets from all types of telescopes, and a large international effort is already underway to federate these diverse datasets for ready use by astronomers. A particularly important class of data arises from multi-epoch wide-field surveys, which are essentially 'movies' of the sky. These advances in time domain astronomy are crucial for such diverse and important research topics as exoplanet discovery, supernovae and other transients, variable stars, and accretion phenomena. However, most astronomers use only a narrow range of classical statistical methods for interpreting these large datasets. This problem can now be alleviated with the R statistical computing environment and its rapidly growing CRAN add-on packages. This project will bring the R software capabilities into the astronomical research community and introduce specialized astrostatistical methodology into R.
In particular, the research includes two complementary projects. First, CRAN packages will be developed for the analysis of time domain data with irregularly spaced observation times. This is a difficulty rarely encountered in other fields but common in multi-epoch astronomical studies, due to diurnal cycles, satellite orbits, survey cadence patterns, and telescope allocation limitations. Astronomers have developed a wide range of treatments for such problems, but most have not been evaluated statistically or incorporated into widely-used software packages, so a part of this study will be a statistical evaluation of competing methods. Second, the prototype VOStat Web service will be developed into a major tool and integrated into the growing Virtual Astronomical Observatory (VAO) software environment. VOStat will provide dozens of functionalities in many areas of applied statistics: data manipulation and visualization, nonparametric statistics and density estimation, probability density functions, regression and inference, multivariate analysis, clustering and classification, censoring and truncation, time series analysis, spatial point processes and image processing. These achievements will improve the statistical sophistication within the VAO and for thousands of other astronomical studies.
These software developments will improve the statistical analysis of a large number of astronomical research studies every year. Coding within R has the simultaneous advantage of inheriting the large infrastructure of methodology and graphics, itself of enormous value to the entire astronomical community. While the production of CRAN packages directly allows wide dissemination of the code, integrating the code into the VAO software environment through VOStat will make it conveniently accessible to all astronomers. A strong pedagogical component will further encourage less experienced astronomers to learn and use more advanced statistical methods. In addition, the CRAN packages on astrostatistical methods for irregular time series may have value to statisticians, physicists and economists who also might encounter datasets of this type.
|Effective start/end date||9/15/10 → 8/31/14|
- National Science Foundation: $450,000.00