Running a Basic Statistical Test in Python on a Real World Example

The problem: Is scoring really higher in the NFL this year?

Obligatory let Russ cook
  1. My null hypothesis was that average scoring was no different from last year to this year. My alternative hypothesis was that it had increased. The Null hypothesis is the status quo, while your alternative hypothesis is based on something being different from the null hypothesis based on your observations
  2. I chose a standard significance of 0.05. This is your chance of returning a false positive or rejecting the null hypothesis when it is true. A lower alpha level reduces the risk of this but what I was doing was not especially risky. No one is going to lose money or be affected medically if I’m wrong. So going with the commonly used level of 0.05 seemed like a fine choice.
  3. To calculate the test statistic I will need some data.
A basic dist plot of the two years scoring averages, 2020 in orange, 2019 in blue

--

--

--

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

WHY DATA ANALYTICS IS CRUCIAL TO NETFLIX’S BUSINESS SUCCESS

Starting with NYC, new initiative weighs pros and cons of data re-use for COVID-19 and future…

5 Essential Pandas Tricks You Didn’t Know About

Big Data Analytics — What Companies Can’t Do Without

Feature Engineering

When to use pandas series, numpy ndarrays or simply python dictionaries?

NLP 101 3/3 — Neural Architectures for NLP

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Aidan Coco

Aidan Coco

More from Medium

Unintended Lessons in How Intro Stats is Taught

How PMM Algorithm Works

Try to relax while sleeping with soft music or classic music which makes you sleep better with…

Resource for Human Resources: Part-2