As data is collected from more and more sources and across life areas, the applications for using big data for public health purposes are growing. This class will focus on the use of large data sets, specifically claims data and other national survey sources, and provide an overview of structuring data, selecting optimal analytic approaches, and display results in compelling data visualization formats.
The class will be taught in SAS, though other statistical software is possible. Tableau will also be used (free trials are available).
After completing this course, participants will be able to:
- Identify large data set sources
- Describe methods for cleaning and structuring data for use
- Outline methods for identifying data analysis opportunities
- Review data visualization strategies to maximize reach and impact
Prerequisite knowledge: Prior experience in SAS, SPSS, STATA, R, or other statistical coding software is required for this course.