Quasi-variance
Quasi-variance (qv) estimates are a statistical approach that is suitable for communicating the effects of a categorical explanatory variable within a statistical model. In standard statistical models the effects of a categorical explanatory variable are assessed by comparing one category (or level) that is set as a benchmark against which all other categories are compared. The benchmark category is usually referred to as the 'reference' or 'base' category. In order for comparisons to be made the reference category is arbitrarily fixed to zero. Statistical data analysis software usually undertakes formal comparisons of whether or not each level of the categorical variable differs from the reference category. These comparisons generate the well known ‘significance values’ of parameter estimates (i.e., coefficients). Whilst it is straightforward to compare any one category with the reference category, it is more difficult to formally compare two other categories (or levels) of an explanatory variable with each other when neither is the reference category. This is known as the reference category problem.
Quasi-variances are approximations of variances. Quasi-variances are statistics associated with the parameter estimates (coefficients) of the different levels of categorical explanatory variables within statistical models. Quasi-variances can be presented alongside parameter estimates to enable readers to assess differences between any combinations of parameter estimates for a categorical explanatory variable. The approach is beneficial because such comparisons are not usually possible without access to the full variance-covariance matrix for the estimates.
Using quasi-variance estimates addresses the reference category problem. The underlying idea was first proposed by Ridout[1] but the technique was set out by David Firth and Renee Menezes.[2][3] The suitability of this technique for social science data analysis has been demonstrated.[4] An on-line tool for the calculation of quasi-variance estimates is available and a short technical description of the methodology is provided.
Quasi-variances can be calculated in Stata using the QV module[5] and can also be calculated in R using the package qvcalc.
References
- Ridout, M.S. (1989). Summarizing the Results of Fitting Generalized Linear Models to Data from Designed Experiments. New York: Springer-Verlag. pp. 262–9.
- Firth, David (2016-06-24). "1. Overcoming the Reference Category Problem in the Presentation of Statistical Models". Sociological Methodology. 33 (1): 1–18. doi:10.1111/j.0081-1750.2003.t01-1-00125.x.
- Firth, David; Menezes, RX (2004). "Quasi-variances" (PDF). Biometrika. 91 (1): 65–80. doi:10.1093/biomet/91.1.65. Retrieved 2017-03-17.
- Gayle, Vernon; Lambert, Paul S. (2007-12-01). "Using Quasi-variance to Communicate Sociological Results from Statistical Models". Sociology. 41 (6): 1191–1208. CiteSeerX 10.1.1.611.3153. doi:10.1177/0038038507084830. ISSN 0038-0385.
- Chen, Aspen (2014-07-21), QV: Stata module to compute quasi-variances, retrieved 2017-03-15