O

Services

Case studies

Contact

15.12.21

Stop using boxplots

featured image thumbnail for post Stop using boxplots

Only statisticians understand box plots.

Boxplots have been used for decades as a way of visualising the distribution of a continuous random variable. I've used them extensively.

But they are hard to explain and can be easily misinterpreted. To interpret them, you need to understand what a quartile is without thinking hard about it. A recent article from Nick Desbarats does a good job of persuading you to stop using them.

Better alternatives are available. Try using strip plots instead. geom_jitter in ggplot2, stripplot in seaborn . Each dot is a value, you can see them all on the plot. Its easy to see patterns. If you've got lots of dots, jitter them a bit. Possible additions involve adding a violin plot. Maybe geom_sina?

←Previous: How not to sort by average rating

Next: There's no such thing as raw data→


Keep up with the latest developments in data science. One email per month.

ortom logoortom logoortom logoortom logo

©2025

LINKEDIN

CLUTCH.CO

TERMS & PRIVACY