by
Paul Barnett, PhD
Seminar date: 3/6/2019
Description: Statistical analysis of health care cost is made difficult by two data problems. Some patients incur disproportionate costs, a statistical property called skewness. Other patients incur no cost at all; the distribution is truncated. As a result of these problems, it is rarely a good idea to analyze cost using the classic linear statistical model, ordinary least squares (OLS). Transforming cost by the taking its log results in a variable that is more normally distributed, allowing use of an OLS regression. The parameters from this regression have a natural interpretation as the proportionate effect of a unit change in the independent variable on cost. Care must be used when predicting costs from a model based on the log of costs. Log models have other limitations. The most important of these is that they should not be used when there are many zero cost observations in the data.
DOWNLOAD: PDF handout | Audio only (mp3) | transcript