Chapter 2 Lab 2: Descriptive Statistics
Describing comic sensibility is near impossible. It’s sort of an abstract silliness, that sometimes the joke isn’t the star. —Dana Carvey
The purpose of this lab is to show you how to compute basic descriptive statistics, including measures of central tendency (mean, mode, median) and variation (range, variance, standard deviation).
2.1 General Goals
- Compute measures of central tendency using software
- Compute measures of variation using software
- Ask some questions of a data set using descriptive statistics
2.1.1 Important info
We will be using data from the gapminder project. You can download a small snippet of the data in .csv format from this link (note this dataset was copied from the gapminder library for R) gapminder.csv
2.2 JAMOVI
In this lab, we will use jamovi to calculate a variety of descriptive statistics. jamovi allows us to specify which statistics we would like calculated and produce them all in one output table. Here, we will learn to:
- Calculate descriptive statistics
- Graph data using a histogram
Let’s begin with a short data set {x= 1, 1, 4, 1, 2, 5, 7}
Suppose we want to calculate the measures of central tendency (mean, median, and mode) as well as variability (range, standard deviation, and variance). First, we will have to enter our data into the jamovi spreadsheet. There are 7 measurements, so we will need 7 rows of dat, see below on how to enter data into jamovi.
2.2.1 Descriptive Statistics
From here, go to the top menu and choose ‘Exploration’, and a new window will ask you to specify for which variables you want descriptives statistics calculated. Check the ones you want to see, and you will find them displayed on the right.
2.2.2 Descriptive Statistics and Histograms
Now let’s use a real dataset to calculate the same measures of central tendency and variability as in the last example, but with the addition of a histogram to visualize a distribution to relate back to the descriptive statistics.
Here is a link to the life expectancy dataset we used for our graphing tutorial. href=“https://raw.githubusercontent.com/CrumpLab/statisticsLab/master/data/gapminder.csv” download>gapminder.csv
Suppose we wanted to know about life expectancy (around the world) in the year 1952. This will include calculating descriptive statistics, as well as graphing a histogram to examine the distribution of our data. First, we have to filter the data to only include the rows relevant to the year 1952. As with our previous example, we will then click the ‘Exploration’ button next, and choose our measures of central tendency (mean, median, mode) as well as variability (range, standard deviation, variance). We will also check the box for histogram in the ‘Plots’ section. Then, jamovi will produce a table of the requested descriptive statistics and a histogram showing the distribution of life expectancy (in years). See the steps below to get the descriptives and histogram:
Something to think about: What do the mean, median, and mode indicate about the shape of the distribution? Is it confirmed when we look at the histogram? How does the shape of this distribution compare to the symmetrical normal distribution which is superimposed over it?
2.2.3 Practice Problems
Using the life expectancy data set, produce a table of output showing the descriptive statistics (measures of central tendency and variability) for the year 2007.
Advanced: Plot histograms of life expectancy for the year 1952 and 2007 on the same scale. How are these distributions different? .