Z
stands forStandard Normal Distribution
.
It's fairly important in real life: Japan useZ-score
on exam to estimate each student's study skills.
Z-score is the essential concept of Z-Statistics
.
▶︎ Jump over to have practice: Comparing with z-scores
Refer to Wiki: Standard score
Refer to Khan academy: Z-score introduction
Refer to youtube: Why Do We Need z Scores
Refer to youtube: Statistics 101: Understanding Z-scores
Refer to Crash Course: Z-Scores and Percentiles: Crash Course Statistics #18
Refer to youtube: z-score Calculations & Percentiles in a Normal Distribution
Z-score
is all about comparison: compare different kind of data set.
In another word, Z-score
indicates How many standard deviations
away (above or below) from the mean
to the given point.
“Z-scores in general allow us to compare things that are NOT in the same scale, as long as they are NORMALLY distributed.” — CrashCourse
For example, although we know everyone’s score, but by only watching those scores it’s hard to know how good he is or how bad he is compare to anyone else in the dataset. etc., if most of the students score above 90, can we say someone scores 90 is good?
So Z-score
gives a solution for this: compare the score to the "average".
Z-score
is especially good to compare different type of data, etc., compare 100-score exam & 150-score exam, compare IELTS & TOFEL, compare apples & oranges, compare a baseball player & football player....
All in all, Z-score
is a process of Normalization
, which "normalize" different set of data to same standard and compare.
Compares the various grading methods in a normal distribution:
With comparing each one’s score with the mean: x - μ
, we will get a kind of deviation
.
But at this point we still don’t know whether each one’s deviation
is big or small.
We need a "standard" to compare each deviation.
Just like the mean
is the average of all scores,standard deviation
is the average amount of deviation of all scores, which will tell us each deviation is large or not.
So we want to compare each deviation
with the Standard deviation
: deviation ÷ 𝜎
And we get the whole picture:Standard Score = (𝓍 - μ) / 𝜎
Assume the standard deviation is 𝜎
(sigma), so the number of it just means how much it is scaled.
etc., 2𝜎
means a doubled standard deviation
, and 1.5𝜎
means 1.5 times larger SD
.
If your Z-score is 2𝜎
, it means your score is doubled standard deviation away from the mean
.
There’s some exam data of a class:
Here’s their z-scores:
Solve:
- Isabella’s z-score is:
(20-22)/5 = -0.4
- Hannah’s z-score is:
(33-38)/12.5 = -0.4
- So they’re equally young in their degree level.
This ONLY applies to
Normal Distribution
Refer to Khan academy: Standard normal table for proportion below
If you know someone’s z-score
, you will easily get his percentile
from the Z-table
.
Vice versa, if you know his percentile
, you can get his z-score
as well.
How to use?
The 1st Row
represents the tenth decimal
of the z-score
,
the 1st Column
represents the hundredth decimal
of the z-score
.
According to the given z-score
, and search over the rows & columns to get the corresponded intersection, which is the percentile
.
etc.,
Someone’s z-sore
is "0.57", and you want to know what percentile
he's at, or what proportion is below his score.
Just go over to the z-table
, first get to the row at 0.5
, and find the column of 0.7
, and the intersection will be his percentile
, which is "0.7157" or "71.57%" in this case.
Common values:
Explicit Z-table:
Solve:
- Get the
z-score
of student Faisal:(103.1-105)/10 = -0.19
. - Refer to the
Z-table
we'll get the correspondingpercentile rank
: 0.4247. - The answer is 0.4247 (42.47%) of students are shorter than Faisal.
Solve:
- Get two z-scores:
(82-83.2)/8 = -0.15
,(89.2-83.2)/8 = 0.75
- Get both points’ corresponding
percentiles
: 0.4404 & 0.7734 - Cut out the “overlays”:
0.7734 - 0.4404 = 0.333
- So the answer is 0.333 or 33.3%.
Refer to Khan academy: Finding z-score for a percentile
Just do the other way around by looking for the given percentile cell
and then read out the corresponded column & row, that will get you the z-score.
Solve:
- “Top 5%” means the
minimum percentile rank
is at 95, which is 0.95 in percentage. - Find out the corresponding
z-score
according to thepercentile
: - There’s no “0.95” in z-table but “0.9495” & “0.9505”
- Since the “minimum percentile` is 0.95, so “0.9505” is the one
- “0.9505” corresponds to the z-score “1.65”
- Take the z-score back to
z-score formula
:1.65 = (x-66000)/21000
- Get the
x=100650
which is the minimum annual profit.
Greetings, I'm an expert in statistical analysis and Z-scores. My depth of knowledge stems from both academic understanding and practical application in various fields. Allow me to provide you with a comprehensive overview of the concepts discussed in the article by Solomon Xie.
Solomon Xie delves into the concept of Z-scores, particularly emphasizing its importance in real-life applications, such as the use of Z-scores in Japan to estimate students' study skills. The key concepts covered in the article include:
-
Z-Score and Standard Normal Distribution:
- Z stands for Standard Normal Distribution, denoted as Z-score.
- It is a measure that indicates how many standard deviations a data point is from the mean in a normal distribution.
-
Purpose of Z-Scores:
- Z-scores allow for the comparison of different datasets, even if they are not on the same scale.
- They normalize data, bringing diverse sets to a common standard for effective comparison.
-
Z-Score Calculation:
- The Z-score is calculated using the formula: Z = (x - μ) / σ, where x is the data point, μ is the mean, and σ is the standard deviation.
- The standard deviation indicates the average amount of deviation of all scores from the mean.
-
Normalization and Comparison:
- Z-scores help compare various grading methods within a normal distribution.
- They enable comparing different types of data, such as exam scores, language proficiency tests, or sports performance.
-
Percentiles and Z-Table:
- Z-scores can be used to find percentiles using a Z-table.
- The Z-table provides the percentile rank for a given Z-score or vice versa.
-
Practical Examples:
- The article provides examples of calculating Z-scores for individuals and interpreting the results in terms of percentiles.
- It demonstrates how Z-scores can be applied to compare individuals in different scenarios, like students' heights or annual profits.
-
Application in Business:
- The article concludes by applying Z-scores to determine the minimum annual profit required to be in the top 5%.
In summary, Solomon Xie's article serves as a valuable resource for understanding Z-scores, their calculation, and practical applications in comparing and normalizing different sets of data. The inclusion of examples enhances comprehension, making it accessible for readers with varying levels of statistical expertise.