Skip to content
Khan Academy · STATS-PROB · Introductory · April 7, 2026

Khan Academy: Statistics & Probability — Summary & Key Concepts

Instructor: Sal Khan

Mathematics

Khan Academy: Statistics & Probability — Summary & Key Concepts

Instructor: Sal Khan Platform: Khan Academy Difficulty: Introductory Department: Mathematics Original course: View on Khan Academy

Course Overview

Khan Academy's Statistics & Probability course provides a comprehensive, beginner-friendly introduction to the tools and reasoning used to analyze data and quantify uncertainty. Sal Khan guides students from basic data visualization through probability theory, sampling distributions, confidence intervals, and hypothesis testing — the core methods used in every data-driven field. The course is distinctive for its emphasis on intuition before formulas: students first understand why a statistical method exists and what question it answers before learning how to compute it. With extensive practice exercises, interactive simulations, and real-world data examples, the course builds both computational competence and the statistical thinking required to evaluate claims, design studies, and draw valid conclusions from data. Whether you are preparing for an AP exam, a college statistics course, or a career in data analysis, this course provides the foundation.

Key Concepts

  1. Descriptive statistics and data visualization — Before analyzing data, you must summarize and visualize it. The course covers measures of center (mean, median, mode), measures of spread (range, interquartile range, standard deviation), and visualization tools (histograms, box plots, scatter plots, dot plots). Students learn to choose the right summary statistic for different data shapes and to identify outliers, skewness, and clusters.

  2. Probability as the language of uncertainty — Probability quantifies how likely events are to occur. The course builds from basic counting (sample spaces) through conditional probability, independence, Bayes' theorem, and the law of total probability. Students develop the ability to model uncertain situations mathematically — a skill that underpins insurance, medicine, finance, and every scientific experiment.

  3. Random variables and probability distributions — A random variable assigns numerical values to outcomes of a random process. The course covers discrete distributions (binomial, geometric) and continuous distributions (normal), teaching students to compute expected values, variances, and probabilities using distribution formulas and tables. The normal distribution is treated as the central character of statistics — students learn why it appears everywhere and how to use z-scores for standardization.

  4. Sampling distributions and the Central Limit Theorem — The Central Limit Theorem (CLT) is the most important result in applied statistics: regardless of the population's shape, the distribution of sample means approaches a normal distribution as the sample size grows. This theorem justifies the use of normal-based inference methods (confidence intervals, hypothesis tests) in virtually every real-world application.

  5. Inference: confidence intervals and hypothesis testing — Statistical inference lets you draw conclusions about a population from a sample. Confidence intervals estimate a parameter with a margin of error. Hypothesis tests evaluate whether observed data provides sufficient evidence to reject a null hypothesis. The course covers one-sample and two-sample tests for means and proportions, chi-square tests, and the interpretation of p-values — emphasizing what these tools can and cannot tell you.

Module/Lecture Breakdown

ModuleTopicKey Concepts
1Analyzing Categorical DataFrequency tables, bar charts, two-way tables, marginal and conditional distributions
2Displaying and Describing Quantitative DataHistograms, dot plots, stem-and-leaf plots, shape, center, spread, outliers
3Summarizing Quantitative DataMean, median, IQR, standard deviation, box plots, comparing distributions
4Modeling Data DistributionsPercentiles, z-scores, normal distributions, empirical rule, normal probability calculations
5Exploring Bivariate Numerical DataScatter plots, correlation, least-squares regression, residuals, R-squared
6ProbabilitySample spaces, basic probability rules, conditional probability, independence, Bayes' theorem
7Random Variables and DistributionsDiscrete and continuous random variables, binomial distribution, geometric distribution, expected value
8The Normal Distribution and CLTNormal distribution properties, Central Limit Theorem, sampling distributions of means and proportions
9Confidence IntervalsPoint estimates, margin of error, confidence intervals for means and proportions, interpretation
10Hypothesis TestingNull and alternative hypotheses, test statistics, p-values, Type I and Type II errors, significance levels
11Chi-Square Tests and Advanced InferenceChi-square goodness-of-fit, chi-square test for independence, conditions and assumptions

Notable Insights

"Statistics is the science of learning from data. In a world drowning in information, the ability to extract meaning from data is one of the most valuable skills you can have." — Sal Khan, on the importance of statistics

"The Central Limit Theorem is almost magical. No matter how strange the population distribution looks, the sample means will form a nice, predictable normal curve. That's why statistics works." — Sal Khan, on the CLT

"A p-value is not the probability that the null hypothesis is true. It's the probability of seeing data this extreme if the null hypothesis were true. That distinction matters enormously." — Sal Khan, on interpreting p-values

"Correlation does not imply causation. But correlation does imply something — it tells you that two variables move together. Understanding why requires additional investigation, not more statistics." — Sal Khan, on correlation vs. causation

Who Should Take This Course

  • High school students preparing for AP Statistics or introductory college statistics who want clear, patient explanations of every concept
  • College students in social sciences, business, or health fields who need to understand statistics for research methods courses
  • Self-learners entering data science or analytics who need to build statistical foundations before working with tools like Python or R
  • Professionals who encounter data in their work (marketing, operations, product management) and want to make better decisions based on evidence
  • Anyone who reads news articles citing statistics and wants to understand what confidence intervals, p-values, and margins of error actually mean

Build Your Course Knowledge Vault

Taking an online course and want to retain what you learn? DistillNote helps you capture, organize, and review course material — automatically generating structured notes from video lectures and readings.

Paste a lecture URL → get structured summaries, key concepts, and searchable notes in 60 seconds. Build a vault of everything you study.

Try DistillNote free — no credit card required


More Khan Academy course summaries: View all courses Related: AI Course Note Taker · Best Study Tools 2026

Get AI-powered summaries of any course

Paste a course or lecture URL and get structured notes in 60 seconds.

More from Khan Academy

Wir verwenden Cookies zur Analyse, Verbesserung und Bewerbung unserer Website. (We use cookies for analytics, site improvement, and marketing.) Mehr erfahren / Learn more