Another approach would be to consider whether a rescaling or transformation of the data might create a more normal-looking distribution, allowing us to use the very friendly normal distribution to estimate probabilities. In this case, we need a transformation that will downsize the large values more than the small values. Log transformations are often very helpful in this regard.
|
Definition: A data transformation applies a mathematical function to each value to re-express the data on an alternative scale. For example, a one-unit increase on the Richter scale conveys the magnitude of an earthquake is 10 times worse (the amplitude of seismic waves is 10 times greater).
|
Create a new variable which is log(responsetime). (You can use either natural log or log base 10, but so we all do the same thing, let’s use natural log here, which is the default in most software when you say “log.”)
• In R: > lnresponsetime = log(honking$responsetime)
• In JMP: Create a new column (e.g., double click on next column over) and then open the formula editor for that column (e.g., Cols > Formula). Type or use your mouse to select Transcendental > Log to create Log(responsetime). Press OK.
Create a histogram of this new variable and a normal probability plot. This time, I do want you to upload your output.