Lesson 1: Statistical Methods

1 Introduction to Statistical Concepts and Terminology

Lesson 1: Statistical Methods
< Lessons |

Statistics is essentially the process of learning from data. The goal of statistics is to make correct statements or inferences about a population based on a sample.

Almost all natural resource professionals deal with data, for example a wildlife biologist may look at the number of owl packs on a given forest or a fuels specialist may look at the surface fuel loadings present in a prescribed burning block. In either case statistics is used to help us design how the data will be collected collected and to summarize the data after it is collected. Not only do we want to understand how to collect and summarize data we also want to be able to understand data that is published.

Understanding published data is just as important to us as natural resource professionals as collecting and summarizing our own data. Since we are not always able to collect the data we would like to have we often rely on published data to provide critical information. For example, it may not be possible to collect fire history data for completion of a restoration project, so we can use published data to provide that information. 

However, the results of that study are inferences based on a sampling design, and therefore may be biased or even completely invalid. For this reason it is important for us to not only be able to collect and summarize data our selves but we must interpret published data.

LESSON 1
< Lessons |

2 Populations and Samples

Lesson 1: Statistical Methods
The act of conducting a study begins with the collection of some measurement or number commonly called a random variable.

Random variables are any measurable characteristic or trait, in other words a measurement!

Example

We could measure the diameter of a tree or the spread rate of a forest fire. The object from which we collected the data is called an experimental unit. So for the last two examples the experimental unit would be a tree and the forest fire.

Population

A population is the set of all values of a single measurement (random variable) collected under certain conditions called the environmental conditions.

example 1 EXAMPLE 2
The population is equal to the set of all diameters of all trees on the Boise National Forest in 2006. The population is equal to the fire spread rate for all forest fires on the Boise National Forest during 2006.
 

In both cases our environmental conditions limit our population to the Boise National Forest in 2006.

Sample

A sample is a subset of the population. For example, we may only measure the diameter of 20,000 trees or collect the spread rate for 10 of forest fires on the Boise National Forest in 2006.

Although we are usually interested in the population we can often not obtain measurements on the entire population. So a sample provides us with some information about the population.

Remember that both populations and samples are sets of numbers, they are never sets of physical units or experimental units.

Example

Population is equal to the set of all trees on the Boise national Forest in 2006. This statement is wrong; it should read the population is equal to the set of diameters for all trees on the Boise National Forest in 2006.

LESSON 1

3 Parameters and Statistics

Lesson 1: Statistical Methods
Parameters and statistics are both numbers which are calculated. The difference between these two terms comes from where you get the numbers from.
bulletA parameter is any number calculated from a population.
bulletA statistic is any number calculated from a sample.

It is important to remember that we are often concerned about the population and the calculated parameters associated with that population. However, we usually do not know or can not obtain these values, so we rely on sampling and statistics to provide us with inferences about the population.

For any given population such as the fire rate of spread for all forest fires in Idaho during 2006 the parameters are fixed numbers, while statistics will vary depending upon the sample within the population.

It is important to understand that the use of statistics is simply a means to an end. We only use statistics to estimate the corresponding parameter in the population.

LESSON 1

4 Descriptive Statistics

Lesson 1: Statistical Methods
The objective of descriptive statistics is to produce numbers which describe attributes of the sample. In short, descriptive statistics allows us to summarize our data in clear and meaningful way.
Example

Let’s assume we collected fuel loading data from 50 stands on a National Forest. So we can now summarize this data in one of two ways. First we can summarize the data numerically by computing statistics such as the mean and standard deviation; to show the average amount of fuel loading and the degree to which fuel loading differs between stands. WE WILL GO OVER CALCULATING THESE STATISTICS IN LESSON 3.

The second way we could summarize this data is graphically by creating a box plot or a histogram. This method would provide information on the distribution of fuel loadings. WE WILL GO OVER HOW TO CREATE THESE TYPES OF GRAPHICAL SUMMARIES AS WELL AS OTHERS IN LESSON 3.

You should remember that graphical representation of data is best used to show patterns within the data, where as numerical summarization is more precise and objective. However, since both types of summarization are complementary it is best to use both.

LESSON 1

5 Inferential Statistics

Lesson 1: Statistical Methods
Inferential statistics are used to draw inferences about a population from a sample.
Example

Consider an experiment where tree growth rates were increased by 25 percent following a forest thinning operation on 10 sites compared to tree growth rates on 10 sites which were not thinned. Inferential statistics allows us to decide if the increased growth rates are due to chance or are real.

There are primarily two ways to use inferential statistics:

  1. The first is to estimate a parameter about a population.
  2. The second is to test a hypothesis.

For the example above, the hypothesis would be forest thinning has no effect on tree growth rates. This type of hypothesis is often called the null hypothesis. WE WILL EXPLAIN MORE ABOUT INFERENTIAL STATISTICS IN LESSON 4!

LESSON 1

6 Review Questions

Lesson 1: Statistical Methods
| Lesson 2 >
  1. Which of the following are characteristics of a statistic?
a. Is a number calculated from a sample
b. Vary from sample to sample
c. Are not important, that is to say they are not the numbers we would ideally have
d. All of the above

Response:

  1. True or False. A population is the set of all values of a single measurement from a random variable collected under certain environmental conditions?
a. True
b. False

Response:

  1. Which of the following is an experimental unit?
a. A ponderosa pine tree
b. A mountain chickadee
c. A wildland fire
d. A particular chain saw
e. All of the above

Response:

  1. True or False. A sample is a subset of a population but it is often imposable to collect.
a. True
b. False

Response:

  1. Which of the following are characteristics of a parameter?
a. A parameter is any number calculated from a population
b. We use parameters to calculate their corresponding statistic
b. Parameters are usually known, or they are not hard to obtain
d. None of the above

Response:

LESSON 1
| Lesson 2 >