Stratified random sampling

Stratified random sampling is a type of probability sampling technique [see our article Probability sampling if you do not know what probability sampling is]. Unlike the simple random sample and the systematic random sample, sometimes we are interested in particular strata (meaning groups) within the population (e.g., males vs. females; houses vs. apartments, etc.) [see our article, Sampling: The basics, if you are unsure about the terms unit, sample, strata and population]. With the stratified random sample, there is an equal chance (probability) of selecting each unit from within a particular stratum (group) of the population when creating the sample. This article explains (a) what stratified random sampling is, (b) how to create a stratified random sample, and (c) the advantages and disadvantages (limitations) of stratified random sampling.

Stratified random sampling explained

Imagine that a researcher wants to understand more about the career goals of students at the University of Bath. Let's say that the university has roughly 10,000 students. These 10,000 students are our population (N). In order to select a sample (n) of students from this population of 10,000 students, we could choose to use a simple random sample or a systematic random sample. However, sometimes we are interested in particular strata (groups) within the population. Therefore, the stratified random sample involves dividing the population into two or more strata (groups). These strata are expressed as H.

For example, imagine we were interested in comparing the differences in career goals between male and female students at the University of Bath. If this was the case, we would want to ensure that the sample we selected had a proportional number of male and female students. This is known as proportionate stratification (as opposed to disproportionate stratification, where the sample size of each of the stratum is not proportionate to the population size of the same stratum). With stratified random sampling, there would an equal chance (probability) that each female or male student could be selected for inclusion in each stratum of our sample. However, in line with proportionate stratification, the total number of male and female students included in our sampling frame would only be equal if 5,000 students from the university were male and the other 5,000 students were female. Since this is unlikely to be the case, the number of units that should be selected for each stratum (i.e., the number of male and female students selected) will vary. We explain how this is achieved in the next section: Creating a stratified random sample.

Creating a stratified random sample

To create a stratified random sample, there are seven steps: (a) defining the population; (b) choosing the relevant stratification; (c) listing the population; (d) listing the population according to the chosen stratification; (e) choosing your sample size; (f) calculating a proportionate stratification; and (g) using a simple random or systematic sample to select your sample.

Define the population

In our example, the population is the 10,000 students at the University of Bath. The population is expressed as N. Since we are interested in all of these university students, we can say that our sampling frame is all 10,000 students. If we were only interested in female university students, for example, we would exclude all males in creating our sampling frame, which would be much less than 10,000.

Choose the relevant stratification

If we wanted to look at the differences in male and female students, this would mean choosing gender as the stratification, but it could similarly involve choosing students from different subjects (e.g., social sciences, medicine, engineering, education, etc.), year groups, or some other variable(s). For the purposes of this example, we will use gender (male/female) as our strata.

List the population

We need to identify all 10,000 students at the University of Bath. If you were actually carrying out this research, you would most likely have had to receive permission from Student Records (or another department in the university) to view a list of all students studying at the university. You can read about this later in the article under Disadvantages (limitations) of stratified random sampling.

List the population according to the chosen stratification

As with the simple random sampling and systematic random sampling techniques, we need to assign a consecutive number from 1 to NK to each of the students in each stratum. As a result, we would end up with two lists, one detailing all male students and one detailing all female students.

Choose your sample size

Let's imagine that we choose a sample size of 100 students. The sample is expressed as n. This number was chosen because it reflects the limit of our budget and the time we have to distribute our questionnaire to students. However, we could have also determined the sample size we needed using a sample size calculation, which is a particularly useful statistical tool. This may have suggested that we needed a larger sample size; perhaps as many as 400 students.

Calculate a proportionate stratification

Imagine that of the 10,000 students, 60% of these are female and 40% male. We need to ensure that the number of units selected for the sample from each stratum is proportionate to the number of males and females in the population. To achieve this, we first multiply the desired sample size (n) by the proportion of units in each stratum. Therefore, to calculate the number of female students required in our sample, we multiply 100 by 0.60 (i.e., 0.60 = 60% of the population of students at the university), which gives us a total of 60 female students. If we do the same for male students, we get 40 students (i.e., 40% of students are male, where 100 x 0.40 = 40). This means that we need to select 60 female students and 40 male students for our sample of 100 students.

Use a simple random or systematic sample to select your sample

Now that we have chosen to sample 40 male and 60 female students, we still need to select these students from our two lists of male and female students (see STEP FOUR above). We do this using either simple random sampling or systematic random sampling [click on the links to see what to do next].

Advantages and disadvantages (limitations) of stratified random sampling

The advantages and disadvantages (limitations) of stratified random sampling are explained below. Many of these are similar to other types of probability sampling technique, but with some exceptions. Whilst stratified random sampling is one of the 'gold standards' of sampling techniques, it presents many challenges for students conducting dissertation research at the undergraduate and master's level.