Sampling (statistics)

From Wikiversity

(Redirected from Sampling)
Jump to: navigation, search

Contents

This page provides an introductory overview of statistical sampling. For more detailed information, see Sampling (statistics) on Wikipedia.

[edit] Definition of sampling

A term used in statistics. Sampling is the process of choosing a representative sample from a population and collecting data from that sample in order to understand something about the population as a whole.

Here is a simple illustration. You have before you a freshly baked homemade cherry pie (the greater whole) and you are pondering the question, “Does it taste delicious?” It looks delicious, it smells delicious, but is it delicious? You take a bite (sample) of the pie (greater whole of many bites), let your taste buds study it, and then make a generalization, otherwise known as an inference, about the whole pie….mmm this pie is delicious!

The term used in statistical sampling to describe the greater whole is population. A population isn’t just people; it could be any group of objects you are studying. A population could be comprised of things such as rocks containing gold, dog biscuits made by Purina, or all left handed one toothed people in the world. Unfortunately, studying populations other than one pie can be complicated, expensive, or time consuming; so researchers have developed several different ways to sample whatever it is they are studying. Some of the different methods that researchers use are: Random sampling, systematic sampling, stratified sampling, cluster sampling and convenience sampling.

Let's say after the cherry pie experiment you decide to become a researcher. Before you, the researcher, can determine which method to use you must first decide what will be your target population, such as left handed one toothed people. From that you would develop a sample frame, or list, of all the left handed one toothed people. This can be a difficult task at times. However once you have the list of the entire target population you would probably want to apply one of the following methods for determining what, or who, will be in the sample.

[edit] Random sampling

This doesn’t mean haphazard. It means every left handed one toothed person in the sampling frame (list) has an equal and unbiased chance of being sampled. Usually you will number the names or items on the list and then randomly pick numbers to make up the sample. This method is great for statistical accuracy, but very difficult to do sometimes in practice. What if you have to travel to Guam, Brazil, Canada and Germany to sample your left handed one toothed people?..

[edit] Systematic random sampling

Here random sampling is given a little structure. You decide you want to sample 100 of your 1000 left handed one toothed people in your sampling frame. First, a random starting point is picked and then the rest of the sample is selected at equal intervals from that starting point. So for our example, you would pick a number from 1-100, say the#8, and then pick every 100th person from that number. You start with person # 8, which means 8,108, 208, 308 etc… would make up your sample. This method ensures better coverage of the population, as long as nothing quirky comes up as an underlying pattern, such as every 100th person lost their teeth by eating taffy.

[edit] Stratified sampling

In this method your list would be divided up in non-overlapping groups and then samples from each of those groups would be picked. For example, we could group our left handed one tooth people according to geographic region or age group. This could give us more valuable precise data if the way they are grouped is relevant to what is being studied. So, we might learn not only how left handed people lose their teeth but that how they lost them differs between Europe, South America, Asia and North America.

[edit] Clustering sampling

First the sampling frame is broken up into groups. Then a sample of groups is randomly picked out of all the groups. Finally, the people in those groups are randomly sampled. This cuts down on travel, time and expense. For example, you would end up only sampling people from Paris, Buenos Aires, Beijing, and Chicago instead of traveling to 40 different towns and cities in ten different countries. The one drawback is that the groups need to be as dissimilar as possible or you could have a large sampling error. For instance, if all of your groups end up being in Europe you would lose valuable information and it wouldn’t be representative of your population.

[edit] Convenience sampling

This is where you pick your sample according to what is available. This is why college students are studied so much…no, truly, it isn’t because they are so strange! Another example is when you see someone on a street corner randomly stopping people to do a survey. Convenience sampling is great because…here it comes…it’s convenient, but it is often difficult to make inferences to the population at large.

That's it! If you need to know more see Sampling (statistics) on Wikipedia.

[edit] See also

[edit] External links

  • Sampling (Research Methods Knowledge Base)