Princeton University Data and Statistical 
Services Princeton University Library

Search DSS





Finding Data Using Data About Us

DSS lab
(Monday-Friday)
Sep 1-
Oct 2
By appt. only
Oct 3-
Dec 16
2-5 pm*, other times by appt.
Dec17-
Feb 4
By appt. only
Feb 6-
May15
1-5 pm*, other times by appt.
May16-
Aug31
By appt. only
Email data@princeton.edu for an appt or questions.
*No appts. necessary during walk-in hrs.

Follow DssData on 
Twitter
See DSS on Facebook

Home Online Help Statistical Packages Stata

STATA

Stata is an interactive data analysis program which runs on a variety of platforms. Stata is installed on the Windows machines and Macs in OIT's public clusters and on the Windows machines in the DSS Data Lab, as well as on the Tombstone Unix server.

Introduction/data manipulation

Statistical Analysis

  • Exploring your data (brief notes). Introduction to examining/exploring data, getting summary/descriptive statistics
  • Linear regresssion. Tutorial on interpreting the outcome of linear regression, interactions and diagnostics: heteroskedasticity, functional form, predicted values, omitted-variable test, multicollinearity, outliers, normality, coefficients table (estto/esttab). Include how to present the regression output using outreg2 (in Word and Excel)
  • Interpreting Stata Regression Output. Brief review of regression, R squared, significance.
  • Dummy variables. Using/creating dummy variables
  • Time Series (brief notes). Brief review of basic time series commands and date functions.
  • Event studies with Stata. Cleaning the data, calculating the event window, estimating normal performance, calculating the abnormal and cumulative abnormal returns, testins for significance.
  • Panel data analysis (brief notes). Brief review of fixed/between/random effects, hausman test
  • Descriptive statistics. Descriptive statistics using Stata, Excel and R.
  • Fixed/random effects (panel data). Stata tutorial on panel data analysis showing fixed effects, random effects, hausman tests, test for time fixed effects, Breusch-Pagan Lagrange multiplier, contemporaneous correlation, cross-sectional dependence, testing for heteroskedasticity, serial correlation, unit roots
  • Time series. Tutorial on setting the data as time series, use of lag operators, subsetting, interpreting correlograms, unit roots, cointegration, QLR or sup-Wald test, Granger causality, Chow test, test for serial correlation
  • Logit/ordered logit regression. Tutorial on interpreting logit regression output and ordered logit regression output, odds ratios and estimation of probabilities.
  • Probit regression. Brief introduction to probit regression
  • Factor analysis. Tutorial on factor analysis, predicting and interpreting output
  • Multilevel analysis. Tutorial on multilevel analysis: varying intercept, varying coefficient model, varying slope model and postestimation
  • Brief notes on statistical analysis. Overall review of basic statistics.

Additional Resources

  • DSS Wiki
  • DSS Libguides (Stata)
  • Stata Learning Modules at the UCLA Statistical Computing portal
    Graphing tutorials, data entry, collapsing and merging, and some regression diagnostics.
  • Stata Programming: Data Management at the Carolina Population Center
    Tutorials on topics including basic summary statistics, entering data, merging data files, and some more advanced programming concepts.
  • Regression with Stata online text from UCLA
    Model specification, regression commands, data diagnostics, interpreting Stata output.
  • Stata.com FAQs
    Official answers to questions on how to perform various specific tasks with Stata.
This page was last updated on Jan 6, 2010