EPBI 6344 Data Management for Biostatistics

This course emphasizes data management and software applications using the SAS (Statistical Analysis System) software package. It will introduce the student to SAS codes for inputting and outputting data; creating temporary and permanent data sets; creating formatted and labeled SAS data sets; merging and connecting SAS data sets; creating output using the TABULATE and REPORT procedures; debugging an SAS program that includes the TABULATE, REPORT and SQL procedures; using characteristic functions in SAS; and using a random number generator, probability distributions, arrays and date and time functions. Students will also write a simple and complex query using the SQL procedure; create, populate and modify a set of tables/views using the SQL procedure; and create an SAS program which includes one or more macros. This course will cover basic relational database design and descriptive statistics in SAS. Particular focus is on applications pertaining to public health and biomedical research.

Credits

3

Prerequisite

PBHL 5317