Statistical Measures: Percentiles and Quartiles Explained
Master Statistical Rankings and Data Position Analysis
Percentiles and quartiles help us understand where individual values stand relative to the entire dataset, not just their absolute scores.
Key Statistical Concepts
Percentiles
Shows what percentage of data falls below a particular value. A 90th percentile score means you performed better than 90% of all participants.
Quartiles
Divides data into four equal parts (25% each). Useful for identifying data distribution patterns and outliers in your dataset.
Sample Dataset Overview
PERCENTILE.INC vs PERCENTILE.EXC Functions
| Feature | PERCENTILE.INC | PERCENTILE.EXC |
|---|---|---|
| Zero Values | Includes 0% | Excludes 0% |
| Maximum Values | Includes 100% | Excludes 100% |
| Best Use Case | Complete range needed | Avoid extreme values |
Calculating Percentiles in Excel
Select Function Type
Choose PERCENTILE.EXC to exclude extreme values or PERCENTILE.INC to include the full range from 0% to 100%.
Lock Your Array
Select your data range and press F4 to lock it with absolute references before autofilling formulas down.
Set Percentile Value
Enter your desired percentile (like 10% for bottom performers) as the K value in the function.
Apply Conditional Formatting
Use color coding to visually identify high performers (green), average (neutral), and low performers (red).
Quartile Breakdown Structure
When creating quartile calculations, lock rows (not columns) when dragging down, and lock columns (not rows) when dragging across to maintain proper cell references.
QUARTILE.INC vs QUARTILE.EXC Functions
| Feature | QUARTILE.INC | QUARTILE.EXC |
|---|---|---|
| Quartile Range | Q1 to Q4 (full range) | Q1 to Q3 only |
| Zero Inclusion | Includes zero values | Excludes zero values |
| Recommended Use | Complete quartile analysis | Core performance ranges |
Box and Whisker Chart Analysis
Box Plot Component Guide
Shaded Box
Represents the interquartile range (Q1 to Q3). Shows where 50% of your data falls, indicating the core performance range.
Whisker Lines
Upper and lower lines show the complete data range from minimum to maximum values. Hover to see exact numbers.
X Mark and Center Line
X represents the mean (average) while the line inside the box shows the median (middle value). Differences indicate data skew.
You want to take a look at risk versus reward. If you're looking for a safe range of values, one that is more consistent, then the third quarter would be ideal.
Statistical Analysis Best Practices
High potential rewards often come with proportionally high risks
Stable, predictable results often outweigh volatile high performers
Large differences indicate data skew or outliers affecting your analysis
Color coding helps identify patterns and outliers at a glance
This lesson is a preview from our Data Analytics Certificate (includes software). Enroll in this course for detailed lessons, live instructor support, and project-based training.
Key Takeaways