BIO 412 Principles of Ecology Phil Ganter 302 Harned Hall 963-5782 |
|
Australian Sealions basking on shore |
Population Estimation
Back to:
Course
Page |
Tennessee
State Home page
|
Bio 412
Page |
Ganter
home page |
Introduction:
A fundamental question ecologists must often answer is: "How many are there?" Plants, because they do not move, can be counted by mapping an area or the number can be estimated by quadrat or transect methods (described in the lab on spatial patterns). When estimating population size (or other population parameters, like mortality or migration) for animals with biological characteristics that differ from plants, other techniques must be used. For instance, it may be difficult to distinguish individuals when surveying the population of a soil-dwelling fungus. In such a case, counting individuals may not be as useful as determining the biomass (amount of living tissue) per cubic meter of soil and ignoring the question if it represents many individuals or a single, large individual. Animals present their own difficulties. Some animals are sessile (nonmoving) and colonial, so that individuals are once again difficult to detect even if they don't move. However, most animals are easy to distinguish as individuals but do have the ability to move, which can make counting them difficult to do. Sometimes, counts can be taken so fast that it is impossible for one animal to be counted twice (for instance, flying over a grassland and taking photos in which animals can be counted). In this case, the sample is a snapshot in which no individual is moving and one treats such data like quadrat or transect data on trees in a forest. However, often such an approach is impossible and one must recognize the ability of individuals to move and be counted more than once. Often, ecologists take advantage of this by marking individuals in one sample and taking a second sample in which there is a chance that the same (now marked) individual can be recaptured. There are many sampling designs that are variations on this basic idea and all are collectively called Mark-Recapture Population Estimation Techniques. In this lab, we will simulate an animal population and use several mark-recapture techniques to estimate population number. We will do this with peanuts or beans, as they will allow us to concentrate on the technique without being subject to field error. Any estimation technique will make assumptions about the animals being counted, and we will look at some means of testing their validity.
Just a short note on the methods of capture and marking. Capture methods are as varied as are animals, and some are listed below. Marking is often a very difficult problem, as you want to mark an individual, but you do not want to alter the individual beyond your ability to detect the mark. Specifically, you do not want the capturing and marking to affect individual mortality or fecundity, either through the trauma of marking or by making the animal more visible to predators (if prey) or to prey (if a predator). In addition, you do not want to alter the marked individuals behavior so that it is either more or less easy to capture a second time. The mark should be permanent, or at least last as long as the study. Some capture and marking techniques are listed below with the animals upon which the technique is used.
Animal
- Capture Technique
- Marking Technique
Birds
- netting, nest invasion
- leg banding, recording color patterns, feather clipping
Mammals
- live trapping, sedation
- hair clipping, collars, dyeing hair, recording color pattern, clipping nails, leg banding, subdermal radio transponders
Amphibians
- Pit traps, hand collection
- toe clipping, paint injection, recording color pattern
Insects
- Sweep Netting, Pit trapping, Aspirating, Pheromone Trapping, Light Trapping
- Fluorescent dust, Tags (glued on), Recording Wing Damage Patterns, Radioactive Tracers
Mollusks
- Hand collection
- Paint, Glue-on Tags
Mark-Recapture Models:
The mark-recapture method is perhaps the most common way of estimating population number. The basic idea is to take some animals from the population and mark them for future identification. Then they are returned to the population, where they then mix into the population (so sufficient time must elapse between mark and recapture to allow complete mixing). The simplest mark-recapture techinques assume that the remixing of the population is uniform so that, when you capture a second group from the population, there is an equal chance of capturing marked and unmarked individuals. The population size is estimated from the fraction of captures that are marked individuals.
Single-Recapture Methods:
Closed Populations: the Lincoln/Petersen Index
In a closed population, no mortality, natality, or migration takes place, so that the assumption is that the population does not change between mark and recapture times. The Lincolc-Petersen method is not really an index, rather it is a method of estimating population size, which is usually designated as N. The procedure is close to that described above and you must capture twice. The individuals taken in the first capture are all marked and released into the population (M1). A second capture (C) is then done and the fraction of the second capture marked (M2) is noted. The rationale for estimating population size is that the ratio of marked individuals to unmarked in the second capture should be the same ratio of total number of individuals captured the first time to total population size. For example, if you capture 1 % of the population and mark all of them, then you should expect that a second capture to contain 1% marked individuals.
This proposition is easily made into an equation. If M1 is the number of individuals marked (the number taken in the first capture, M2 is the number of marked individuals, C is the total second catch (including previously marked individuals), and N is the population size (the only unknown that you are estimating) then:
or
.
In an analysis of this method, Bailey (1952) found that the above formulation did not give as accurate an estimate as when he added one to the number of recaptured marked individuals and the total caught in the second capture:
.
The hat over the N means that this is an estimator of N and is read "N-hat". Now that we have the estimator, we need to know just how good the estimate is. We need a confidence interval. The idea of a confidence interval is always linked to an arbitrarily chosen probability, like 95%. We will use this number, and so we will construct a 95% confidence interval for our estimate of population size. This confidence interval should be interpreted thus: "given the data and our calculated estimate of N, in 95 out of 100 cases, the true population size is within the boundaries of the confidence interval." There is more that one way to calculate this interval, but we will do it as follows.
Calculate the standard error of N (the expected spread of multiple estimates done from this population) for our estimate as:
.
Then the 95% confidence interval is:
.
This method assumes that:
Multiple-Recapture Methods:
Multiple capture methods can not always be done, but they improve the accuracy of the population estimate when they can be done.
Closed Populations: the Schnabel Method
This works well in closed populations such as the fish in a lake, the spiders in an old field (where the surrounding woods are hostile territory for the spiders), etc. Beans in a bucket work well too! To do this method, capture on multiple occasions, mark, and make a chart with the columns below:
Capture |
# Captured |
Recaptures |
Free Marks |
||
Date |
Mt |
Rt |
Ft |
MtFt^2 |
Rt*Ft |
4-Mar |
10 |
0 |
0 |
0 |
0 |
6-Mar |
27 |
0 |
10 |
2700 |
0 |
8-Mar |
17 |
0 |
37 |
23273 |
0 |
10-Mar |
7 |
0 |
54 |
20412 |
0 |
12-Mar |
1 |
0 |
61 |
3721 |
0 |
14-Mar |
5 |
0 |
62 |
19220 |
0 |
16-Mar |
6 |
2 |
67 |
26934 |
134 |
18-Mar |
15 |
1 |
71 |
75615 |
71 |
20-Mar |
9 |
5 |
85 |
65025 |
425 |
22-Mar |
18 |
5 |
89 |
142578 |
445 |
24-Mar |
16 |
4 |
102 |
166464 |
408 |
26-Mar |
5 |
2 |
114 |
64980 |
228 |
28-Mar |
7 |
2 |
117 |
95823 |
234 |
30-Mar |
19 |
3 |
122 |
282796 |
366 |
sum 1 |
sum 2 |
||||
989541 |
2311 |
||||
N= |
sum1/sum2= |
428 |
In the table, the # marked is just that, the number marked at each sampling. The recapture column is also easy to understand. It is simply the number recaptured. The marks here are the same for all individuals and for all samplings. This complicates the third column, the number of marked individuals in the population. For each sampling, it should be the sum of all marked individuals from the beginning. However, if you catch 8 and two already have marks, you are really only increasing the number of marked individuals by six. This is why the Ft entry for March 18 is 71. If you simply add 6 to the previous total number of marked individuals (67 for March 16), you would be counting the two recaptures on March 18 twice (once when they were first marked and once when they were recaptured). So you should add 4 (= 6 caught - 2 already marked) for a total of 71. The next two columns are for calculating the estimate of N. The MtFt2 column is the product of Mt and the square of Ft. The last column is the product of Rt and Ft. The estimate of N is:
The standard deviation of this value is difficult to calculate and so we will merely mention that it exists and that one can calculate a confidence interval for the Schnabel estimate as well.
This method assumes that the marked individuals have had time to mix into the population so that each marked individual is "equally catchable" as any unmarked individual (we will examine this assumption later). The population size must remain the same during the course of the sampling. Also, it is assumed that the number of marked individuals does not change between the captures (no loss of marking, no mortality, no migration). Actually, there can be mortality, but the free marked individuals must be reduce by the number that died.
Open Populations:
the Jolly Stochastic Method:
This method allows individuals to die or migrate, so the population size can fluctuate. This method requires that more than two captures are done and that the marks made on one date are different from those made at another time. Note that no organism gets more than one mark and that that mark identifies not just that it has been captured before but also identifies when it was first captured.
We shall use an example taken from the inventor of the method (Jolly, 1965). In the table below, the diagonal table represents the actual number of marked insects collected over 13 days of marking. If you look at the fourth column you can see that that night 209 insects were trapped, that 56 were carrying marks (153 had no marks and had never been captured before) and that 202 of the 209 captured were subsequently marked and released. Presumably the 7 that were not released died either during capture or marking. Of the 56 that were captured with pre-existing marks, 5 were last captured on the first night, 18 on the second night, and 33 on the third night (notice that 56 = 5 + 18 + 33
Also, in our case, si and ni will usually be the same. They differ when an organism is captured but is not released (maybe marking or capturing it harmed it or killed it). This will not happen to you unless you lose a bean in the process of counting. However, the data below reflects lots of mortality due to this. The first capture, none were killed and all were released, so ni = si. The second time, three died, so ni (146) is three greater than si (143. The number killed is entered in the row above si.
Time of capture | |||||||||||||
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
11 |
12 |
13 |
|
Total Caught ni | 54 |
146 |
169 |
209 |
220 |
209 |
250 |
176 |
172 |
127 |
123 |
120 |
142 |
Total Marked mi | 0 |
10 |
37 |
56 |
53 |
77 |
112 |
86 |
110 |
114 |
77 |
72 |
95 |
Total Unmarked | 54 |
136 |
132 |
153 |
167 |
132 |
138 |
90 |
62 |
13 |
46 |
48 |
47 |
Number that died | 0 |
3 | 5 | 7 | 6 | 2 | 7 | 1 | 3 | 1 | 3 | 0 | 0 |
Total Released si | 54 |
143 |
164 |
202 |
214 |
207 |
243 |
175 |
169 |
126 |
120 |
120 |
142 |
Time of | |||||||||||||
Last Capture | |||||||||||||
1 |
10 |
3 |
5 |
2 |
2 |
1 |
0 |
0 |
0 |
1 |
0 |
0 |
|
2 |
34 |
18 |
8 |
4 |
6 |
4 |
2 |
0 |
2 |
1 |
1 |
||
3 |
33 |
13 |
8 |
5 |
0 |
4 |
1 |
3 |
3 |
0 |
|||
4 |
30 |
20 |
10 |
3 |
2 |
2 |
1 |
1 |
2 |
||||
5 |
43 |
34 |
14 |
11 |
33 |
0 |
1 |
3 |
|||||
6 |
56 |
19 |
12 |
5 |
4 |
2 |
3 |
||||||
7 |
46 |
28 |
17 |
8 |
7 |
2 |
|||||||
8 |
51 |
22 |
12 |
4 |
10 |
||||||||
9 |
34 |
16 |
11 |
9 |
|||||||||
10 |
30 |
16 |
12 |
||||||||||
11 |
26 |
18 |
|||||||||||
12 |
35 |
We need to define some parameters to calculate the size of the population from this sort of data. The first is the proportion of the capture that was marked:
.
For example, this is 56/209 (=0.268) for the fourth capture, 53/220 (0.241) for the fifth capture. Next, we calculate the size of the marked population, Mi (notice that it is a capital letter, and is not the same thing as mi, the total number of marked individuals in a capture):
.
In this expression, si is the total of released individuals (202 for the fourth capture, 214 for the fifth), zi is the sum of all individuals marked in earlier captures that are ever caught after the capture. Thus, zi is
z4 = 2+2+1+0+0+0+1+0+0+8+4+6+4+2+0+2+1+1+13+5+8+0+4+1+3+3+0 = 71 Note that this sum is from the table above. The terms "2+2+1+0+0+0+1+0+0" that begin it are from the first row (time of last capture 1) from the fourth column (time of capture) to the end of the row (the cells are in yellow). Thus, these are all of the recaptures of individuals marked in the first capture (past the fourth recapture). We also need the same data for those individuals marked in the second and third captures (remember we are calculating the population size for the fourth capture date) and if you look at the last 18 terms of the summation, you will see the next two rows of recaptures there.
z5 = 2+1+0+0+0+1+0+0+4+6+4+2+0+2+1+1+5+8+0+4+1+3+3+0+20+10
+3+2+2+1+1+2 = 89
Here, you can see that z5 starts out identical to the z4 case, but one cell to the left. This is so because we are now looking at all of the recaptures after time 5, not time 4. Also, since it is time 5, we look at the recaptures marked at time 4 in addition to those at times 1, 2, and 3 (as it was for z4).
Ri is the sum of times individuals marked at time i are ever caught. . BE AWARE THAT R4 AND z4 BOTH EQUAL 71, BUT THIS IS BY CHANCE, NOT BECAUSE THEY HAVE TO BE THE SAME.
R4 = 30+20+10+3+2+2+1+1+2 = 71 (the cells are in light green) R5 = 43+34+14+11+33+0+1+3 = 129
So, M4 = (202*71)/71 + 56 = 258 and M5 = (214*89)/139 + 53 = 190.02. Make sure you know where the numbers come from in the table and examples above! You will not be able to complete this lab if you do not.
With these parameters, we can estimate population size, N for each capture date (Ni):
.
N4 = 258/0.268 = 963 and N5 = 190/0.241 = 789.
We can also estimate the probability of surviving from time i to time i+1:
Notice that this value can be greater than 1 if the population grows between i and i+1. F 4 = 190/(258 - 56 + 202) = 0.47 and F 5 = 386/(190 - 53 + 214) = 1.10. Finally, we can estimate the number of animals added to the population between time i and time i+1:
With this method, one can get an accurate estimation about the dynamics of a population, not just a static estimate of its size at one time.
Testing assumptions of the multiple capture models:
Equal Catchability:
There are several methods for testing the assumption that all individuals are equally catchable. We will use a simple index that requires that the population is sampled at least three times (Cormack, 1966). For this test, we need:
We want to be 95% confident that all animals are equally catchable. (I chose 95% arbitrarily and it can be anything between 0 and 1.) We will calculate a statistic that, if less than 1.96, allows us to make the statement above about equal catchability. If the value is 1.96 or above, the we can not make the statement above because some animals were more difficult to catch than others. We have to say that it is likely that not all animals are equally catchable. The statistic is calculated as:
This is long, but all of the terms are simple and no complicated summations are needed. I will not go into the details of how this is derived, but I will say that it has to do with a probability distribution called the multinomial.
An example: Suppose that you capture 25 individuals, mark all of them an release them. The next time, you recapture 8. These 8 keep their old marks and have a new mark added to them. The third capture produces 9 recaptures, 7 of them with only the first mark on them and two with both marks. Then n1 is 25, m12 is 8, m13 is 7, m123 is 2. The last term, m10 is the difference between n1 (25) and all those ever caught. This total is m12 + m13 (exclude m123, as they were counted in the first 8 caught). So, z is:
And so, z equals ~180/147 or ~1.22. This z is less than the critical value and so these organisms were equally catchable and a key assumption of the Jolly (and other methods) has been tested and found to be acceptable.
You can use your jolly data if you record the number of beans captured and add marks to those beans you have caught before. Image that you have just used blue and yellow marks (in that order) The next recapture, you get 9 recaptures with blue marks. Three also have yellow marks. Thus, you know that the three with the blue and yellow marks are the m123 types and the other six (which have blue but not yellow marks) are m13 types.
Gathering the Data:
Closed Populations &endash; Lincoln/Petersen Index:
OOPS! The second time you capture, you must get at least one marked individual or this technique will not work, as the number of marked individuals is a divisor, and division by zero is undefined. If you get at least one marked individual, skip the four steps below. If you get no marked beans in the first capture:
Closed Populations - Schnabel method:
Open Populations - Jolley Method:
At this point, you will do either Jolley A or one of the four experiments from Jolley B. DO NOT DO BOTH. Talk to your instructor about the two variations. It is your choice, but be sure you understand the choices before you do anything.
Jolley A - population fluctuations
Jolley B - Births and deaths
Following the marking procedure outlined above, but manipulate the population in the following manner. Each group will do ONLY ONE of the four procedures outlined below. Each procedure is designed to explore different kinds of pupulation changes on the results of the jolly method. We will do this by using set sample sized and manipulating the population size.
The procedure should go like this
Experiment Number | Starting Population Size | Capture | Sample Size | Removals (Deaths) | Add Unmarked (Births) |
1 | 400 | 1 | 60 | 40 | 40 |
Stationary | 2 | 60 | 40 | 40 | |
Population | 3 | 60 | 40 | 40 | |
4 | 60 | 40 | 40 | ||
5 | 60 | 40 | 40 | ||
6 | 60 |
Experiment Number | Starting Population Size | Capture | Sample Size | Removals (Deaths) | Add Unmarked (Births) |
2 | 400 | 1 | 60 | 0 | 26 |
Growing | 2 | 60 | 0 | 36 | |
Population | 3 | 60 | 0 | 52 | |
4 | 60 | 0 | 64 | ||
5 | 60 | 0 | 80 | ||
6 | 60 |
Experiment Number | Starting Population Size | Capture | Sample Size | Removals (Deaths) | Add Unmarked (Births) |
3 | 400 | 1 | 60 | 80 | 0 |
Declining | 2 | 60 | 64 | 0 | |
Population | 3 | 60 | 52 | 0 | |
4 | 60 | 36 | 0 | ||
5 | 60 | 26 | 0 | ||
6 | 60 |
Experiment Number | Starting Population Size | Capture | Sample Size | Removals (Deaths) | Add Unmarked (Births) |
4 | 20 | 1 | 4 | 2 | 18 |
Growing Population | 2 | 8 | 4 | 32 | |
Varying Survival | 3 | 13 | 16 | 29 | |
4 | 25 | 12 | 52 | ||
5 | 45 | 20 | 94 | ||
6 | 59 |
Data Analysis, Presentation, and Interpretation:
First - report the number of beans that were in your population.
Closed Populations - Lincoln/Petersen Index:
Closed Populations - Schnabel method:
Open Populations - Jolley Method:
Jolley A
Jolley B
Equal Catchability:
Use your Jolly results. Test for equal catchability at each date (up to three before the last capture). Conclude about the catchability of the beans.
Thinking about the experiment:
References:
Bailey, N. J. T. 1951. On estimating the size of mobile populations from recapture data. Biometrika 38:293-306.
Cormack, R. M. 1966. A test for equal catchability. Biometrics 22:330-342.
Last updated September 6, 2006