Introduction
Continental and global land cover classifications
from satellite data have largely been derived from annual time series
of the Normalized Difference Vegetation Index (NDVI) as a measure of
phenological variability throughout the year. This approach raises
the question of whether there are remotely sensed variables in
addition to NDVI that improve the accuracies of land cover
clasifications.
Studies to address this issue conclude that multitemporal metrics
derived from reflectances in individual AVHRR bands in addition to NDVI
improve separability between cover types at a global scale. Other studies
demonstrate that NDVI is sensitive to short-term climatic characteristics
and that classifications based on a time series of the surface temperature
to NDVI ratio produce more stable results over an eight year period for
the African continent. Still other researchers suggest the use of metrics
that describe an annual NDVI temporal profile, such as length of growing
season and rate of greenup rather than the NDVI values directly and a
rule-based approach for determining cover type.
We test a number of metrics derived from the six bands of the PAL data
using MSS training data to determine the metrics to be
used as input variables to the classifier. Based on these results, we use
the metrics in a decision tree classifier to obtain a global land cover
classification product.
Classification Algorithm
Decision tree theory has been applied previously to
remotely sensed data. Decision trees predict class membership by
recursively partitioning a data set into more homogeneous subsets.
Different variables and splits are then used to split the subsets
into further subsets. The grown tree can be selectively pruned based
on decision rules to produce more stable predictions of class
membership. For example, lower level splits which contribute little
to the clasification accuracy can be merged into parent nodes.
The classification tree has a number of advantages over
traditional classification methods. First, it is not based on any
assumptions of normality within training area statistics as is a
maximum likelihood approach. Thus it is better suited to those
situations where a single cover type is represented by more than one
set of remote sensing characteristics. Second, the tree can reveal
non-linear and hierarchical relationships between the input variables
and use these to predict class membership. Third, from a practical
point of view, it is immediately apparent which variables contribute
to the discrimination between classes. This can be useful for
defining subsequent suitable inputs for landcover characterizations.
Methods and Results
The next issue concerns the input variables to be
used in the decision tree classifier. Several types of metrics have been
suggested to describe vegetation phenology, and are described in DeFries
et al. (at press).
To determine which suite of metrics to use for deriving global land
cover classification products, we tested five sets of metrics with the
decision tree classifier using MSS training data. These metrics were
derived such that they are independent of the timing of phenological
events and thereby applicable at a global scale. For example, a metric
characterizing the month when green up begins would not be globally
applicable because of phasing of seasons in different parts of the world,
nor would any individual monthly NDVI value. The sets of metrics are
given in Table 3. The basis for each set is as
follows:
- "surface temperature/NDVI metrics" are to characterize the temporal
profile of the surface temperature/NDVI ratio. Surface temperature is
calculated from channels 4 and 5.
- "seasonal NDVI-derived metrics" are calculated from 10-day NDVI
values.
- "simple metrics" are calculated from annual maximum, minimum, mean,
and amplitude (difference between maximum and minimum) for AVHRR bands
1,2,3,4, and 5 and NDVI.
- use of simple metrics in a "logic tree" where we derived a series of
hierarchical trees to subdivide the cover types. For example, the first
tree divides all the training pixels into woody and non-woody vegetation
by aggregating the labeled cover types into these two major categories.
Subsequently, we then grow a tree for all those pixels designated as
"woody" to further subdivide them into "needleleaf" or "broadleaf." This
procedure is carried out so that there is a series of trees for each cover
type. Input variables to the trees were the 24 "simple metrics."
- combined use of all metrics in 1), 2), and 3) to determine whether
some of these metrics in combination would better discriminate some cover
types than the use of any one of these suites of metrics alone.
A sixth test was carried out with equal numbers of pixels in each
cover type to avoid a bias in the tree toward higher accuracies for those
cover types with a larger number of pixels at the expense of cover types
with a fewer number of pixels. The number of pixels was equalized by
replicating training pixels until each cover type included the same number
as that cover type with the maximum number of training pixels. This test
was carried out using the "simple metrics," the suite of metrics which
produced the highest accuracies among the five tested (Figure 5).
Conclusions
On the basis of the results of these tests (Figure 5) we conclude that use of the "simple
metrics" alone (test 3) is to
be preferred. These metrics give substantially better results than
either the Ts/NDVI (test 1) or "seasonal metrics" (test 2) alone.
Moreover, applying the "logic tree" approach to the simple metrics gives
similar overall accuracy but rather lower mean class accuracy and involves
considerable greater computation. Combining the three sets of metrics
from tests 1, 2, and 3 (test 5) gives no better results than for the
"simple metrics" alone.
The test using an equal number of training pixels for each cover
type (test 6) yielded a slightly higher mean accuracy for all
classes, suggesting that the tree might be slightly biased toward
those cover types with a large number of training pixels. From these
results, we decided to use the "simple metrics" with an equal number
of training pixels in each cover type to generate the final global
land cover classification products.
To return to the homepage, click
here.