The z-score is a statistical measurement that describes a value's relationship to the mean of a group of values. It indicates how many standard deviations an element is from the mean. The formula to calculate the z-score is:
\[
z = \frac{(X - \mu)}{\sigma}
\]
Where:
- \( z \) = z-score
- \( X \) = the value you want to standardize
- \( \mu \) = the mean (average) of the dataset
- \( \sigma \) = the standard deviation of the dataset
### Breaking Down the Formula
1. **Mean (\( \mu \))**: This is the average of all the values in your dataset. You calculate it by summing all the values and dividing by the number of values.
\[
\mu = \frac{\sum X}{N}
\]
Where \( N \) is the number of values in the dataset.
2. **Standard Deviation (\( \sigma \))**: This measures the amount of variation or dispersion in a set of values. A low standard deviation means the values tend to be close to the mean, while a high standard deviation indicates that the values are spread out over a wider range.
The formula for standard deviation is:
\[
\sigma = \sqrt{\frac{\sum (X - \mu)^2}{N}}
\]
3. **Calculating the z-score**: Once you have the mean and standard deviation, you can substitute those values into the z-score formula.
### Example
Let's say you have a dataset of test scores: 70, 75, 80, 85, and 90.
1. **Calculate the mean (\( \mu \))**:
\[
\mu = \frac{70 + 75 + 80 + 85 + 90}{5} = \frac{400}{5} = 80
\]
2. **Calculate the standard deviation (\( \sigma \))**:
\[
\sigma = \sqrt{\frac{(70-80)^2 + (75-80)^2 + (80-80)^2 + (85-80)^2 + (90-80)^2}{5}} = \sqrt{\frac{100 + 25 + 0 + 25 + 100}{5}} = \sqrt{\frac{250}{5}} = \sqrt{50} \approx 7.07
\]
3. **Calculate the z-score for a specific score, say \( X = 85 \)**:
\[
z = \frac{(85 - 80)}{7.07} \approx \frac{5}{7.07} \approx 0.71
\]
### Interpretation
A z-score of 0.71 means that the score of 85 is 0.71 standard deviations above the mean score of 80.
### Summary
The z-score helps you understand how far a particular value lies from the mean in terms of standard deviations, allowing for comparison across different datasets or distributions.