Using information theory to determine 
optimum pixel size and shape for ecological 
studies: Aggregating land surface 
characteristics in arctic ecosystems
Authors: Paul C. Stoy, M. Williams, L. Spadavecchia, 
R. A. Bell, A. Prieto-Blanco, J. G. Evans, and M. T. 
van Wijk
The final publication is available at Springer via https://dx.doi.org/10.1007/s10021-009-9243-7.
Stoy, P. C., M. Williams, L. Spadavecchia, R. A. Bell, A. Prieto-Blanco, J. G. Evans, and M. T. van 
Wijk. “Using Information Theory to Determine Optimum Pixel Size and Shape for Ecological 
Studies: Aggregating Land Surface Characteristics in Arctic Ecosystems.” Ecosystems 12, no. 4 
(March 10, 2009): 574–589. doi:10.1007/s10021-009-9243-7.
Made available through Montana State University’s ScholarWorks 
scholarworks.montana.edu 
Using Information Theory
to Determine Optimum Pixel Size
and Shape for Ecological Studies:
Aggregating Land Surface
Characteristics in Arctic Ecosystems
P. C. Stoy,1* M. Williams,1,2 L. Spadavecchia,1,2 R. A. Bell,1,3
A. Prieto-Blanco,4 J. G. Evans,5 and M. T. van Wijk6
1School of GeoSciences, University of Edinburgh, Edinburgh EH9 3JN, UK; 2NERC Centre for Terrestrial Carbon Dynamics, University
of Edinburgh, Edinburgh, UK; 3Centre for Ecology, Evolution and Conservation, University of East Anglia, Norwich, UK; 4Department
of Geography, University College London, 26 Bedford Way, WC1H 0AP London, UK; 5Centre for Ecology and Hydrology, Maclean
Building, Benson Lane, Crowmarsh Gifford, Wallingford, Oxfordshire OX10 8BB, UK; 6Plant Production Systems, Wageningen
University, P.O. Box 430, 6700 AK Wageningen, The Netherlands
ABSTRACT
Quantifying vegetation structure and function is
critical for modeling ecological processes, and an
emerging challenge is to apply models at multiple
spatial scales. Land surface heterogeneity is com-
monly characterized using rectangular pixels,
whose length scale reflects that of remote sensing
measurements or ecological models rather than the
spatial scales at which vegetation structure and
function varies. We investigated the ‘optimum’
pixel size and shape for averaging leaf area index
(LAI) measurements in relatively large (85 m2
estimates on a 600 9 600-m2 grid) and small
(0.04 m2 measurements on a 40 9 40-m2 grid)
patches of sub-Arctic tundra near Abisko, Sweden.
We define the optimum spatial averaging operator
as that which preserves the information content
(IC) of measured LAI, as quantified by the nor-
malized Shannon entropy (ES,n) and Kullback–
Leibler divergence (DKL), with the minimum
number of pixels. Based on our criterion, networks
of Voronoi polygons created from triangulated
irregular networks conditioned on hydrologic and
topographic indices are often superior to rectan-
gular shapes for averaging LAI at some, frequently
larger, spatial scales. In order to demonstrate the
importance of information preservation when up-
scaling, we apply a simple, validated ecosystem
carbon flux model at the landscape level before and
after spatial averaging of land surface characteris-
tics. Aggregation errors are minimal due to the
approximately linear relationship between flux and
LAI, but large errors of approximately 45% accrue
if the normalized difference vegetation index
(NDVI) is averaged without preserving IC before
conversion to LAI due to the nonlinear NDVI-LAI
transfer function.
Key words: information content; Kullback–Lie-
bler divergence; leaf area index; Shannon entropy;
spatial averaging; triangulated irregular network;
tundra; upscaling.
INTRODUCTION
Process-based models of ecological function are
usually parameterized using measurements at a
particular plot or suite of plots, which sample only
a small fraction of the landscape. This limited
sampling means that it is difficult to parameterize
ecosystem models across space, which hinders our
ability to accurately estimate multi-scale ecosystem
function. General approaches to this ‘upscaling’
problem remain elusive, but often focus on
describing mechanistic linkages among leaves,
plants, ecosystems, and landscapes (Jarvis and
McNaughton 1986; Leuning and others 1995;
Williams and others 2001), with careful consider-
ation of spatial aggregation and the errors that it
may incur (O’Neill and Rust 1979; Rastetter and
others 1992; Pelgrum 2000).
Although the global carbon (C) budget is well
constrained (Canadell and others 2007), and
intensively measured plots are understood in great
detail (Baldocchi 2008), the intermediate landscape
scale remains poorly quantified. Both ‘top-down’
(Gurney and others 2004) and ‘bottom-up’ (Potter
and others 2006) approaches have been used to
scale from global and local data, but neither have
been completely successful nor consistent.
Addressing the question of scale is central for
bridging the gap between bottom-up estimates of
ecosystem function and top-down estimates from
global earth system models (GCMs, DGVMs) that
are becoming increasingly explicit in their repre-
sentation of the land surface (Essery and others
2003), but require statistical downscaling for re-
gional extrapolation (Wilby and Wigley 1997).
Coping with the natural heterogeneity of landscape
structure and process remains a critical limitation to
scaling efforts.
A common approach to upscaling across space in
physiological and ecosystem ecology is to break the
land surface into pixels that are typically square, and
to represent ecological processes at this pixel scale.
Some studies have sought to implement pixels as the
central unit that describes ecological [for example
‘PROXEL,’ Tenhunen and others (1999)] and/or
hydrological processes (Mauser and others 2001),
and use these process-based pixelmodels as the basis
for scaling up. But pixel size is often selected arbi-
trarily or seemingly imposed by the resolution of
remote sensing platforms (for example, 250–1000 m
for the Moderate-Resolution Imaging Spectroradi-
ometer, MODIS) and the pixels may not have the
optimal grain size to adequately capture the domi-
nant topographic and vegetative features of the
landscape. For example, Rahman and others (2003)
performed a semivariogram analysis to determine
that pixel sizes of 6 m or less are preferred for
hyperspectral studies of California chaparral eco-
systems given spatial variability in vegetation, yet
most attempts at aggregation use a larger grain size.
Heinsch and others (2006) demonstrated errors in
productivity estimates when the spatial scale of plots
measured by eddy covariance is smaller than the
1 9 1 km2 grain of MODIS pixels. Few ecological
studies have investigated the importance of pixel size
and shape for upscaling localmeasurements to larger
spatial scales (but see Rahman and others 2003),
despite efforts from the DGVM and GCM commu-
nities to test the importance of both grid size (Mu¨ller
and Lucht 2007) and shape (Walko and Avissar
2006) in their simulations.
Similarly, few if any examples of upscaling in
ecology consider the loss or change in information
inherent in aggregation across spatial scales, its
consequences, or potential solutions, despite
the widespread use of information-theoretic
approaches in ecological modeling (Ulanowicz
2001; Burnham and Anderson 2002). Changes in
information content (IC) depend of course on how
information is quantified, but always involve the
distribution of the variable of interest as expressed
through the numerical or analytical probability
distribution (or density) function (Shannon 1948;
Kullback 1997). Techniques from information
theory have been applied to quantify how much
information is gained or lost upon increasing or
decreasing the spatial grain of observations
from remote sensing platforms (Chen and Blong
2002), but rarely for upscaling ecosystem function
(Brunsell and Young 2007; Brunsell and others
2008). We propose that metrics for quantifying IC
can be used to select preferential upscaling tech-
niques to minimize information loss or alteration
when changing spatial grain and to avoid aggre-
gation errors due to potential nonlinearities in the
upscaling procedure (Pelgrum 2000).
Here, we investigate approaches from informa-
tion theory for quantifying optimum pixel sizes and
shapes for process-based ecological studies, focus-
ing on the spatial averaging of normalized differ-
ence vegetation index (NDVI) measurements and
associated leaf area index (LAI) estimates in an
Arctic tundra ecosystem. By ‘optimum,’ we suggest
that a spatial averaging operator should: (1) pre-
serve the IC of a key measured ecosystem attribute,
in this case LAI, and (2) minimize the number of
pixels (maximize pixel size) for efficient computa-
tion across time and space, for example, in con-
junction with landscape-level data assimilation
techniques (Williams and others 2005; Quaife and
others 2008). Thus, our definition of optimum
takes a practical rather than strict mathematical
view. Specifically, we ask the question: how does
the spatial averaging operator alter the probability
distribution function (pdf) of LAI (hereafter p(LAI))
measured at finer scales in sub-Arctic tundra eco-
systems, and can we find an averaging operator
with a minimum number of pixels that retains the
fine-scale IC of LAI?
LAI averaging is investigated because of its gen-
eral importance in describing plant structure and
function (Monteith and Unsworth 1990), and be-
cause of its particular importance in the sub-Arctic
tundra ecosystems that we investigate here. LAI is
strongly related to nutrient distribution such as
bulk canopy nitrogen (N) in tundra ecosystems
(Williams and Rastetter 1999; van Wijk and others
2005), which are in turn coupled more strongly to
short-term productivity than C supply, air tem-
perature, or vegetation life-form (Shaver and oth-
ers 1986, 1992, 2007). Consequently, there is a
strong relationship between LAI and gross primary
productivity (GPP) in Arctic tundra, regardless of
species (Street and others 2007). LAI plays a central
role in creating patterns of C uptake and nutrient
distribution across space across Arctic ecosystems,
and its spatial distribution must be accurately
characterized in a simple way for spatial modeling.
LAI may have a linear relationship with some
Arctic ecosystem processes, such as GPP, over part
of its range (Street and others 2007). In this case,
only an accurate estimate of mean landscape-level
LAI is required to obtain an unbiased estimate of
spatially averaged GPP. However, nonlinear rela-
tionships exist between LAI and the surface radia-
tion balance (Kustas and Norman 2000), and thus
soil temperature and hydrology. Any alteration of
p(LAI) is likely to result in inaccurate or biased
upscaled averages or sums of these quantities (Ra-
stetter and others 1992). Similarly, spatial averag-
ing of meteorological, hydrological, edaphic, or
earth observation (EO)-based model inputs may
incur errors if the model transfer function is non-
linear (Pelgrum 2000). For example, the transfer
function between remotely sensed NDVI and LAI in
Arctic tundra is highly nonlinear (van Wijk and
Williams 2005; Williams and others 2008), and
thus prone to averaging errors which we demon-
strate in a C cycle modeling example. These issues
are referred to as the subgrid scaling problem
(Entekhabi and Eagleson 1989; Kustas and Norman
2000), and we seek to address this problem in part
using techniques from information theory.
The central premise of our approach is that the
fine-scale IC can and should be preserved to obtain
an ‘upscaled’ landscape that shares similar proba-
bilistic features of the land surface at finer scales
(Figure 1). Spatial averaging may incur some
alteration of IC due solely to the central limit the-
orem: if each pixel contains a random distribution
of LAI values, p(LAI) will approach a Gaussian
(normal) distribution upon averaging (Figure 1).
However, vegetation need not be distributed ran-
domly, particularly in tundra ecosystems where
plant distribution is often clumped (Bliss 1962;
Spadavecchia and others 2008). We can take
advantage of these non-random landscape features
in our spatial averaging approach if topographic
features that are related to ecosystem hydrology,
insolation, or exposure are statistically related to
LAI (Spadavecchia and others 2008), noting the
coupling between microtopography and C cycle
function in Arctic ecosystems (Sullivan and others
2008). Accounting for coupled LAI-microtopo-
graphical variation may represent an improvement
over simple grid-based averaging given our criteria
for selecting an optimal averaging operator.
Remotely sensed data generally represent aver-
age reflected radiant flux per unit ground area (per
solid angle of the sensor), convolved with the
instrument point-spread function over a square or
rectangular pixel. The field of view is often ellipti-
cal, and consequently the square or rectangular
output is the result of some subsampling and
averaging. Despite the tendency for remote sensing
products to deliver rectangular pixels, some
examples from the hydrological literature have
demonstrated the usefulness of other shapes for
spatially explicit modeling. Namely, triangular
irregular networks (TINs) (Kumler 1994; Ivanov
and others 2004; Vivoni and others 2004, 2005b)
are explored as a means to generate Voronoi
polygons for spatial averaging, and we investigate if
TINs present advantages for spatial averaging over
regular grid-based averaging schemes. TINs can be
generated efficiently (Goodrich and others 1991)
and, along with the Delaunay triangulation that
forms their basis, are supported in common soft-
ware packages such as ArcGIS (ERSI, Redlands,
CA) and MATLAB (MathWorks, Natick, MA). TINs
can also be implemented without the subjectivity
imposed by user parameterization. We envision
that TINs can provide a connection among studies
on ecosystem structure, computational hydrology,
and other effects of topography [for example,
radiation distribution or topographic exposure
(TOPEX)] on controlling vegetation distribution
and function, and thus present a formal method for
efficiently co-classifying and upscaling multiple
processes from the plot to the landscape scale.
We argue that there is a quantifiable optimum
pixel size and shape that can be determined by
exploring the IC of spatially averaged land surface
features. Specifically, we hypothesize that, given
the importance of topography in controlling plant
distribution in Arctic landscapes (Walker and oth-
ers 1995) and for the arrangement of LAI and
surface topography in our study area, TINs based
on hydrologic similarity will be superior averaging
operators than simple grids (Hypothesis 1). In this
sense, rectangular averaging operators represent a
null hypothesis. Furthermore, TINs conditioned
upon topographic features related to statistical
clumping of LAI for our study ecosystems (Spada-
vecchia and others 2008), namely TOPEX at small
spatial scales, will be superior to those based on
hydrologic similarity alone (Hypothesis 2). We
investigate the spatial averaging of LAI at two
canonical scales in sub-Arctic tundra: the sub-me-
ter or ‘micro-scale’ using intensively sampled data
from van Wijk and Williams (2005), and the scale
of tens-of-meters (‘macro-scale’) based on extend-
ing the findings of van Wijk and Williams (2005) to
a larger area (Williams and others 2008). We then
use insight from the averaging exercises to illustrate
an ‘optimum’ TIN that couples hydrologic and vege-
tative similarity at the macro-scale, and use this
aggregation procedure to demonstrate the impor-
tance of information preservation for upscaling eco-
logical function using a validated ecosystem C flux
model (Shaver and others 2007) driven by meteoro-
logical measurements made near the study site.
METHODS
Study Sites
Changes in IC incurred by the spatial averaging of
LAI are explored using two published datasets of
LAI distribution from a tundra ecosystem near
Abisko, Sweden (410,130 m east, 7,583,892 m
north, UTM zone 34 W). Long-term average
Figure 1. A conceptual example of the effect of spatial averaging operators, in this case (A) squares and (B) Voronoi
polygons created from a TIN, on altering the probability distribution, and thus IC, of LAI in a sub-Arctic tundra landscape.
The right-hand graphs represent a conceptual probability distribution of LAI (top) and how it may be modified by spatial
averaging (bottom). It is envisioned that certain sizes and/or shapes will retain the IC of fine-scale LAI measurements in a
way that also maximizes average pixel size. A properly conditioned TIN may also be useful for quantifying the effective
length and volume of the edge between different vegetative types as indicated by the blue lines. Polygons may be combined
by removing unnecessary edges as demonstrated with the yellow line. Source: Image courtesy of Mathew Williams.
rainfall at Abisko is 400 mm y-1 and the average
temperature is -1C (Christensen and others
2004). The study domain comprises a gentle 5
downward slope from SW to NE. Topographical
variations also occur at higher spatial frequency in
heath-dominated patches, with mound/pit struc-
tures at a spatial scale of several meters (van Wijk
and Williams 2005; Spadavecchia and others 2008;
Williams and others 2008). Vegetation at the study
site is dominated by a low heath characterized by
Empetrum nigrum L. Betula nana L. often grows in
sheltered dips (van Wijk and Williams 2005). Some
wooded areas are also present as the macro-scale
dataset lies close to the transition zone from tundra
to birch woodland. These woodland areas are
characterized by Betula pubescens ssp. tortuosa
(Ledeb.) Nyman with an understory that is com-
monly comprised of Vaccinium species including
V. myrtillus L., V. uliginosum L., and V. vitis-idaea
L. Wooded areas are excluded from the example of
C flux modeling, as the model was parameterized
for pan-Arctic tundra ecosystems. A stream run-
ning through the center of the study area is bor-
dered by shrubby riparian vegetation characterized
by B. nana and Salix spp. Soils are rocky and well
drained (Jonasson and others 1999).
LAI Measurements
The ‘macro-scale’ dataset (Williams and others
2008) includes 228 NDVI measurements (Skye
Instruments 2 Channel Sensor SKR1800, Skye
Instruments, Llandrindod Wells, UK) of 85 m2 plots
that were converted into LAI estimates using the
methods described in the study of van Wijk and
Williams (2005). Briefly, van Wijk and Williams
(2005) combined 5625 LAI-2000 (LI-COR, Lincoln,
NE, USA) and Skye SKR1800 NDVI measurements
with direct LAI measurements from 81 destructive
harvests to create a LAI map of the micro-scale
study area at the scale of 0.2 9 0.2 m2. The ‘micro-
scale’ dataset (van Wijk and Williams 2005) in-
cludes LAI estimates from these 5625 paired LAI
and NDVI measurements. The study domain was
nearly 600 9 600 m2, which encompasses the
40 9 40 m2 domain of the micro-scale dataset
(Spadavecchia and others 2008; Williams and oth-
ers 2008). Both sets of measurements were taken
during the peak growing season at Abisko.
Methods: IC
Three metrics are employed to quantify the alter-
ation in IC incurred by spatial averaging:
the normalized Shannon entropy (ES,n) (Shannon
1948; Wesson and others 2003) and the numerical
and analytical Kullback–Leibler divergence (DKL)
(Kullback and Leibler 1951; Kullback 1997).
The ES,n is simply the absolute value of the
Shannon entropy divided by the log of the number
of histogram bins (N) into which the distribution is
divided such that potential values are bounded
between 0 and 1.
ES;n ¼
PN
i¼1 p ið Þ ln p ið Þ
ln Nð Þ


; ð1Þ
where p(i) is the discrete probability distribution of
any variable x, here LAI or NDVI. The ES,n takes a
maximum value of 1 for a uniform distribution,
and a minimum of 0 for a Dirac delta function
(Katul and others 2001; Stoy and others 2006). pdfs
of different size and shape may have the same ES,n,
for example, two identically shaped distributions
with different ranges. In order to avoid this limi-
tation, we scaled the bins’ locations between 0 and
the maximum measured LAI for the micro- and
macro-scale datasets, respectively.
The DKL (also called the Kullback–Leibler differ-
ence, information divergence or gain, or relative
entropy) quantifies the difference between two
arbitrary pdfs q and m:
DKL ¼
XN
i¼1
q ið Þ ln q ið Þ
m ið Þ ; ð2Þ
where, in our case, q represents the probability
density function of the LAI measurements at either
the micro-scale or macro-scale and m is the resul-
tant pdf of LAI after spatial averaging (Kullback and
Leibler 1951; Kullback 1997).
N of 10 bins was chosen for both the ES,n and DKL
as the length of the data series becomes small when
the size of the spatial averaging operators become
large. Results may be sensitive to N. In order to
avoid these limitations, analytical expressions that
do not require binning can be employed. A logical
choice for modeling p(LAI) is the gamma distribu-
tion as LAI is a continuous variable bounded by
zero. The DKL for the gamma distribution (CDKL)
can be quantified using
CDKL ¼ log C a0ð Þb
a0
0
C að Þba00
 
þ a a0ð Þw að Þ þ a b b0b0
;
ð3Þ
where a and b are the shape and scale parameters
of the gamma distribution determined using max-
imum-likelihood estimation, C(a) is the gamma
function for a, and w(a) is the digamma function
for a (Mathiassen and others 2002).
Similarly, NDVI is constrained between 0 and 1,
and a logical choice for modeling its pdf is the beta
distribution, for which the DKL is:
BDKL ¼ log B g;qð Þ
B g0;q0ð Þ
 
 g g0ð Þw g0ð Þ
 q q0ð Þw q0ð Þþ g g0þ q q0ð Þw g0þ q0ð Þ
ð4Þ
where B g; qð Þ is the incomplete beta function for
the beta shape parameters g and q.
Rectangular Averaging Operators
We first consider simple grids as spatial averaging
operators, beginning with squares oriented in the
ordinal directions, then rotated 45 to test if grid
cell orientation has a discernable impact on IC after
spatial aggregation. We then explore 3:1 rectangles
oriented both north–south and east–west, as well as
45 rotated versions of these rectangles to test
whether spatial averaging with respect to topo-
graphical features, namely the direction of the
slope, which runs SE to NW, can result in an im-
proved spatial averaging operator based on our
criteria. These results demonstrate that topographic
features play a role in spatial clumping of vegeta-
tion at the study sites, and motivate a study of
spatial averaging using TINs.
Triangulated Irregular Networks
TINs are commonly used to visualize and quantify
topography (Peuker and others 1978), and have
recently been used to create computationally effi-
cient representations of watershed topography for
hydrological models (Ivanov and others 2004;
Vivoni and others 2004, 2005a, b; Hancock 2006).
TINs are based on Delaunay triangulation, which
maximizes the minimum angle of all triangles (that
is, avoids creating triangles that are highly obtuse
or with a severely acute internal angle) for a given
set of nodes. Unique corresponding Voronoi poly-
gons can be created by connecting the perpendic-
ular bisectors of the Delaunay triangles. These
Voronoi polygons are investigated as spatial aver-
aging operators here.
We begin our investigation of spatial averaging
with TINs by using the topographic convergence
index (k, Beven and Kirkby 1979) to select node
density, following the approach described by Viv-
oni and others (2005b). Vivoni and others (2005b)
argued that TIN node density (dnodes) should follow
a functional relationship with the topographic
feature of interest, that is, dnodes (or mean node
spacing) = f(k), to add spatial resolution in areas of
higher flow for modeling watershed hydrology. In
our case, k roughly follows a gamma distribution at
both micro- and macro-scales, and a logical choice
of f(k) is a cumulative distribution function of the
gamma distribution. Nodes were randomly placed
until they reached the density that results in a
mean pixel size that represents a range between
large and small pixels. We chose 10 bins, similar to
the 7 bins chosen in the Vivoni and others (2005b)
study. Note that our approach is similar to that
described by Figure 3 of Vivoni and others (2005b),
but we use the number of nodes per k bin, rather
than mean point spacing, as our metric of node
density. After creating the TIN, LAI values were
determined to be within a particular Voronoi poly-
gon using a point-in-polygon algorithm (MATLAB,
Natick, MA, USA).
A neutral case [f(k)0] was also investigated to test
the effects of randomly positioned Voronoi poly-
gons versus TINs conditioned on topographic indi-
ces. f(k)0 maintains constant dnodes per topographic
index bin, yet uses the same random node place-
ment algorithm and bin size. Due to the random
nature of the node placement, a Monte Carlo
method with 30 iterations was chosen to quantify
the statistics and possible results for each mean
pixel size for both f(k)1 and f(k)0.
After the preliminary study using TINs condi-
tioned on k, we explore TINs based on TOPEX.
Statistically higher clumping was observed at the
micro-scale at low and high TOPEX as quantified by
Moran’s local indicators of spatial association
(LISA) (Anselin 1995; Spadavecchia and others
2008). TOPEX was normally distributed across the
micro-scale study domain, so we chose a form of
f(TOPEX) based on the normal distribution for this
analysis. Namely, f(TOPEX) has low dnodes at higher
and lower TOPEX to create larger Voronoi poly-
gons in areas where the probability of vegetation
clumping is greater. The null case f(TOPEX)0, sim-
ilar to f(k)0, is also examined.
Digital Elevation Map and Topographic
Indices
A detailed digital elevation map (DEM) with
appropriate spatial resolution is required to quan-
tify k and TOPEX. The DEM used for the macro-
scale analysis was created from airborne LiDAR
(light detection and ranging) data collected in July,
2005 using an Optech Airborne Laser Terrain
Mapper 3033 (Optech Inc., Vaughan, ON, Canada).
The point cloud was gridded on 4 m grid cells using
minimum values of the last pulse data recorded.
The DEM used for the micro-scale analysis was
created using surveying techniques described in the
study of van Wijk and Williams (2005).
k is defined as ln(d/tan v) where d is the upslope
contributing area determined using the flowdirec-
tion and flowaccumulation commands standard in
ArcGIS and v is the local slope (Beven and Kirkby
1979). TOPEX was determined using TOPO-
SCALE.aml written by Niklaus Zimmerman and
available at www.wsl.ch/staff/niklaus.zimmermann/
programs/aml4_1.html.
PLIRTLE Model
The ‘PLIRTLE’ ecosystem C flux model of Shaver
and others (2007) is used for the analysis of spatial
aggregation on upscaled estimates of biogeochem-
ical cycling. PLIRTLE derives its name from its
representation of photosynthesis (P) as a function
of LAI and irradiation (here photosynthetic proton
flux density, PPFD), and of ecosystem respiration
(ER) as a function of air temperature (T) and LAI.
The model for P model follows the aggregated
canopy photosynthesis model of Rastetter and
others (1992). P is assumed to follow a saturating
response to PPFD and is integrated through the
canopy using the Beer–Lambert law for light
attenuation:
P ¼ Pmax
k
ln
Pmax þ EoPPFD
Pmax þ EoPPFDekLAI
  
; ð5Þ
where k is the light extinction coefficient (assumed
here to be 0.5, Shaver and others 2007), Eo is the
light sensitivity of photosynthesis, and Pmax is
maximum photosynthesis.
Shaver and others (2007) found that ER models
that include two sources—one sensitive to LAI and
one not, the latter presumably from deeper soil
horizons—fare better than assuming a single sub-
strate pool when modeling chamber-based flux
measurements from a pan-Arctic dataset. We use
the model ‘ER2’ (Shaver and others 2007):
ER ¼ LAI R0ebT þ Rx; ð6Þ
where R0 is base respiration at 0C, Rx is the LAI, T
is insensitive component of ER, and b is the tem-
perature sensitivity of ER. We use the parameter set
derived for pan-Arctic measurements in both Too-
lik Lake, AK and Abisko for the C cycle simulation
(Table 1). The choice of the pan-Arctic rather than
the local measurement set follows from the find-
ings of Shaver and others (2007), who demon-
strated similar PLIRTLE parameter sets across
tundra vegetation types in Alaska and Sweden and
thus functional convergence in the C cycle of Arctic
tundra.
Meteorological Measurements
Meteorological measurements for applying PLIRTLE
were made on the nearby ABACUS Arctic tundra
meteorological tower located at 411,191.84 m east,
7,577,785.175 m north, 751.907 m asl. PPFD was
measured using an SKP 215 Quantum Sensor (Skye
Instruments, Llandrindod Wells, UK) at a height of
2 m.Twasmeasuredusing aHMP45 sensor (Vaisala,
Helsinki, Finland) at a height of 2 m. Periods of
missingmeteorological data were gapfilled using the
linear relationship between PPFD and T data
from the meteorological station at the nearby Abi-
sko Scientific Research Station (410,130 m east,
7,583,892 mnorth [UTMzone 34 W], 427.06 masl)
and the ABACUS Abisko Birch meteorological
and eddy covariance tower (410,615.906 m east,
7,580,681.110 m north, 569.950 m asl). A period
from June 9, 2007 to September 22, 2007 that
roughly corresponds to the growing season atAbisko
was chosen for this analysis.
RESULTS
Results from the IC analysis on the spatial averag-
ing of LAI with rectangular pixels are presented
first, and are then compared with results from the
TINs conditioned on topographic features (Vivoni
and others 2004, 2005b). The case study on up-
scaling modeled C flux estimates using IC-pre-
serving and naı¨ve spatial averaging approaches
follows in the ‘‘Discussion’’ section.
Rectangular Averaging Operators
The ES,n of spatially averaged LAI changed by not
more than 0.07 (7% of the potential range) at the
macro-scale for all rectangular shapes examined at
pixel sizes up to 103.9 m2 (Figure 2A); this is the
difference in ES,n between the LAI data as indicated
by the ‘x,’ and ES,n after spatial aggregation of LAI
Table 1. The Pan-Arctic Parameter Set for the
PLIRTLE Model with ER Model 2 and Fixed Light
Extinction Coefficient (k) after Shaver and others
(2007)
Parameter Value
Pmax 15.831
1
k 0.5
E0 0.036
R0 0.602
1
b 0.074
Rx 0.547
1
1lmol CO2 m
-2 s-1.
within square pixels. ES,n dropped by up to 17%,
depending on shape, upon averaging with rectan-
gles larger than 103.9 m2. The DKL of rectangular
pixels increased rapidly from zero at a pixel sizes
larger than approximately 103.2 m2 (Figure 2C). The
increase in CDKL with larger rectangular pixels was
not as pronounced (Figure 2E), but occurred at a
similar pixel size. Large rectangles with east–west
and southwest–northeast orientation, those roughly
perpendicular to the stream, retained ES,n nearer to
fine-scale measurements and returned lower DKL
and CDKL than other shapes of the same size.
There was a steady decline in the ES,n with
increasing pixel size when averaging micro-scale
LAI data (Figure 2B). (Pixels smaller than 10-1 m2
were not investigated as pixel sizes this small are
unlikely candidates for ecological or hydrological
studies.) The ES,n of square pixels decreased less
than other rectangular shapes with increasing pixel
size. The DKL of squares and NW–SE (along stream)
rectangles were smaller than other rectangular
shapes at the largest pixel size investigated
(Figure 2D). The change in CDKL for the micro-
scale dataset was roughly equal for all shapes and
sizes investigated, but squares had the lowest CDKL
at the largest pixel size (Figure 2F).
The causes of the alterations in IC (Figure 2)
were the changes in p(LAI) after spatial averaging
102 103 104 105
0.45
0.50
0.55
0.60
0.65
0.70
0.75
x
E S
,n
Macro-Scale
A
10-2 100 102
0.30
0.35
0.40
0.45
0.50
0.55
0.60
0.65 x
Micro-Scale
B
10 2 10 3 10 4 10 5
0
0.5
1
1.5
2
2.5
3
3.5
4
D
KL
x
C
10-2 100 102
0
1
2
3
4
5
6
x
D
102 103 104 105
0
0.1
0.2
0.3
0.4
0.5
Γ
D
KL
x
pixel size (m2)
E
10-2 100 102
0
0.1
0.2
0.3
0.4
0.5
x
pixel size (m2)
F
Squares
N-S Rect.
E-W Rect.
Diamonds
NW-SE D
SW-NE D
Figure 2. The change in IC upon
spatial averaging of LAI using
various rectangular shapes and
orientations for the macro-scale
and micro-scale study areas near
Abisko, Sweden. The normalized
Shannon entropy (ES,n),
numerical Kullback–Leibler
divergence (DKL) and analytical
Kullback–Liebler divergence for
the gamma distribution (CDKL)
are the IC metrics explored. The
xs signify the ES,n, DKL, and CDKL
at the smallest spatial scale of the
respective measurements: 85 m2
(approximately 102 m2) in the
case of the coarse scale data and
0.04 m2 for the fine scale data.
demonstrated in Figure 3. Choosing a relatively
small pixel size for averaging the macro-scale
measurements set resulted in p(LAI) that retained
the canonical shape of the data (Figure 3A). In
contrast, larger square pixels decreased the p(LAI)
in the smallest bin and returned no values for the
five largest potential bins; these were ‘averaged
out.’ An anomalous peak at an LAI near 2 m2 m-2
is also observed after averaging with large square
pixels. This feature is not present in the measure-
ments. The ES,n, DKL, and CDKL quantify these
deviations from the original LAI distribution.
A similar pattern can be observed at the micro-
scale. Choosing relatively small square pixels as
averaging operators altered the shape of measured
p(LAI) less than larger pixels. p(LAI) decreased in
the lowest LAI bin and increased in bin near LAI
= 0.5 (Figure 3B) upon averaging with larger
square pixels. p(LAI) increased in LAI bins larger
than approximately 2.5 due to the spatial clumping
of patches with relatively high LAI (Spadavecchia
and others 2008).
Spatial Averaging Using TINs
TINs conditioned on k outperformed rectangular
spatial averaging operators, on average, based on
our criteria that large pixels that preserve the ES,n,
DKL, and CDKL of measured fine-scale LAI are
superior for land surface aggregation (Figure 4).
The change in IC incurred by employing square
averaging operators frequently exceeded that of
any of the 30 iterations of the TIN node placement
algorithm with larger mean pixel sizes at the micro-
scale (Figure 4). TINs did not represent an
improvement over rectangular pixels for pixel sizes
smaller than about 103.4 m2 for the macro-scale
data set or approximately 1 m2 for the micro-scale
dataset. Differences in the IC metrics between f(k)
and f(k)0 were trivial at both macro-scale and mi-
cro-scale.
It was noted by Spadavecchia and others (2008)
that TOPEX, rather than k, was significantly related
to vegetation clumping at the micro-scale; LAI was
clumped at both higher and lower TOPEX. TINswith
node density based on a normal distribution (lower
dnodes) at low and high TOPEX [f(TOPEX)] were
analyzed, along with a null case that did not vary
dnodes by TOPEX bin [f(TOPEX)0]. The ES,n and CDKL
of f(TOPEX) changed less at large mean pixel sizes,
on average, than f(TOPEX)0 (Figure 5). f(TOPEX)0
showed an increase in ES,n across many mean pixel
sizes, indicating that p(LAI) was approaching a
uniform distribution, which also resulted in the
observed increase in CDKL (Figure 5). Alterations in
the DKL were nearly identical for f(TOPEX)1 and
f(TOPEX)0 (Figure 5B).
It should be noted that some iterations of the TIN
node placement algorithm resulted in IC alterations
that were equivalent to or ‘worse,’ based on the IC-
preserving criteria, than rectangular averaging oper-
ators for allmeanpixel sizes investigated for averaging
the macro-scale dataset, and for larger pixel sizes in
themicro-scale dataset (Figures 4 and 5).At the same
time, all mean pixel sizes investigated included TIN
iterations that were ‘superior’ averaging operators,
and the outcome was better than aggregating using
square averaging operators more often than not.
DISCUSSION
Hypothesis 1
The first hypothesis stated that TINs conditioned on
topographic features will be superior averaging
operators than rectangular shapes, given our cri-
terion that landscape grids for process-based eco-
system studies should preserve IC and maximize
pixel size. We found support for this hypothesis, in
most cases, for the ecosystems studied here. Spatial
averaging of LAI using TINs better-preserved, on
average, the IC of measurements at the macro- and
micro-scales. The change in IC incurred by rect-
angular averaging operators was greater than that
of all iterations of the TIN mesh generation algo-
rithm at some mean pixel sizes. On the other hand,
some TIN iterations changed IC more than rectan-
gular averaging operators due to the random node
0 2 4 6
0
0.10
0.20
0.30
0.40
0.50
LAI
p(L
AI
)
a
Measured
Small Pixel
Large Pixel
0 1 2 3 4
0
0.10
0.20
0.30
0.40
0.50
LAI
b
Figure 3. The change in the probability density of LAI after
spatial averaging with relatively small and large square
pixels for the (A) macro-scale and (B) micro-scale LAI
measurements. The small pixels demonstrated here are
1.02 9 103 m2at themacro-scaleand0.79 m2at themicro-
scale. Large pixels are approximately 1.37 9 104 m2 at the
macro-scale and 13.4 m2 at the micro-scale.
placement algorithm. Any application of TINs for
land surface representation that uses random node
selection should be carefully analyzed to ensure
that the outcome is desirable.
Hypothesis 2
We hypothesized that TINs conditioned on the
landscape features that were most related to LAI
clumping, namely TOPEX (Spadavecchia and oth-
ers 2008), would be superior to TINs based on
topographic convergence (k) at the micro-scale.
Mean IC after averaging using f(TOPEX) was
broadly similar to f(k) and f(k)0, with a lower range
(Figures 4 and 5), and had lower CDKL than f(TO-
PEX)0 over much of its range. The largest statisti-
cally significant clumps of LAI similarity occurred
in the upstream area near the stream and in an
exposed micro-plateau to the east of the stream
(Spadavecchia and others 2008). Most random TIN
iterations did not draw polygons near the edges of
these features; hence, hypothesis 2 did not hold
due to the random TIN node placement. Non-
random TIN node placement may represent an
improvement for simplifying the spatial patterns
that occur across landscapes.
102 103 104
0.40
0.45
0.50
0.55
0.60
0.65
0.70
0.75
0.80
x
E S
,n
Macro-Scale
A
102 103 104
0
1
2
3
4
5
6
7
8
x
D K
L
C
mean f(λ  )
mean f(λ    )0
Square
102 103 104
0
0.02
0.04
0.06
0.08
0.10
0.12
0.14
0.16
x
Γ D
KL
E
Mean pixel size (m2)
10-2 100 102
0.45
0.50
0.55
0.60
0.65
0.70
0.75
0.80
x
Micro-Scale
B
10-2 100 102
0
0.5
1
1.5
2
2.5
x
D
10-2 100 10 2
0
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
x
Mean pixel size (m2)
F
Figure 4. Same as Figure 2, but
for spatial averaging using TINs
conditioned on topographic
convergence index (k). f(k) uses
an increasing gamma
distribution of node density as
explained in the text. f(k0)
employs a node placement
density that does not vary with k
bins. ‘Square’ follows Figure 2.
The TINs were created with
random node placement; 30
iterations of the node placement
routine were used to explore the
variability in normalized
Shannon entropy (ES,n),
numerical Kullback–Liebler
divergence (DKL) and analytical
Kullback–Liebler divergence for
the gamma distribution (CDKL)
incurred by spatial averaging for
larger pixel sizes. The vertical bars
represent one standard deviation
about the mean ES,n for the
upper panels, and the minimum
and maximum of all observed
DKL and CDKL.
TINs as Averaging Operators
Are the Voronoi polygons themselves the optimum
shape for land surface representation? Not neces-
sarily; highly irregular shapes that correspond more
closely to vegetation distribution and/or topogra-
phy in landscapes with high relief such as the one
investigated here may effectively represent the
variability in land surface properties with even
fewer pixels. The TINs objectively created different
sized pixels based on land surface characteristics,
which removed the role of the user for pixellating
the landscape. Along this line of reasoning, there is
no reason why Voronoi polygons could not be
combined in cases where adjacent polygons repre-
sent similar land surface characteristics to capture
the irregular shapes that may describe land surface
patches. Such a case is demonstrated by the yellow
line in Figure 1B; if there is no discernable advan-
tage to include an edge it can be removed to further
decrease the number of pixels used to represent the
land surface.
On the same note, If TINs effectively average
land surface patches, their edges may provide an
estimate of the location and length of edges be-
tween vegetation types and/or land surface fea-
tures. Quantifying ‘edge’ may be critical for
upscaling ecosystem function. For example, light-
saturated photosynthesis in the transition zone
between E. hermaphroditum and V. uliginosum pat-
ches near the study site was nearly 20% higher,
than photosynthesis in the center of patches dom-
inated by E. hermaphroditum (Fletcher and others
unpublished data). Upscaled estimates of photo-
synthesis in the Abisko landscape may be highly
biased if vegetation function at plot edges is not
quantified. Edges with narrower or wider areas of
influence could be easily incorporated into the
Voronoi polygons as demonstrated by the blue bars
in Figure 1B. In this way, TINs can be used to
preserve the IC of land surface features without
ignoring important dynamics of vegetation func-
tion such as increased competition at the edges of
vegetation patches or other land surface features
such as streams or ridges. In addition, concepts
from information theory can be applied to ensure
that the representation of the land surface for
modeling approaches that of the finest grain of
observation. To model land surface function at
multiple scales, hierarchies of Voronoi polygons
may be created to move from individual vegetation
patches to larger scales of land surface similarity
(Gold and Angel 2006).
The demonstration that TINs based on k have the
potential to simplify landscape complexity while
preserving IC of vegetation features opens the
possibility that TINs for watershed hydrology
(Vivoni and others 2004, 2005a) can be coupled to
hydrologic models that incorporate vegetation dis-
tribution. For example, if the intuitive assumption
is made that model patches with greater LAI sup-
port more transpiration, TIN-based watershed
models for both subsurface and soil-atmosphere
water transport can be created. Such models may
be used for upscaling ecological and hydrological
10-2 100 102
0.45
0.50
0.55
0.60
0.65
0.70
0.75
0.80
x
E S
,n
A
10-2 100 102
0
0.5
1
1.5
2
2.5
x
B
D
KL
mean f(TOPEX)
mean f(TOPEX  )
  0Square
10-2 100 102
0
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
x
Mean pixel size (m2)
C
ΓD
KL
Figure 5. Same as Figure 4, but using TINs conditioned
TOPEX for the micro-scale dataset. TOPEX was identified
as most significant for LAI clumping as determined
Moran’s LISA (Spadavecchia and others 2008).
function while minimizing computational load and
preserving fine-scale information. An example of
such an ‘optimal’ TIN based on k is presented in
Figure 6A. The CDKL of this TIN is <0.05, and the
mean pixel size is approximately 8500 m2. A
comparison of the pdf of the LAI measurements
versus TIN-aggregated LAI is presented in Fig-
ure 6B. A spatially explicit land surface represen-
tation of this sort may help simplify some of the
nonlinear impacts of fine-scale patterns in water
and radiation fluxes and help address the sub-grid
scaling problem (Kustas and Norman 2000; Badiya
Roy and Avissar 2002).
Case Study: Upscaling Modeled
Landscape-Level C Flux
In order to demonstrate an application of IC-pres-
ervation while upscaling estimates of ecological
function, we present an analysis of modeled C flux
using the PLIRTLE model for the late growing-
season period at Abisko using measured meteoro-
logical inputs with measured and aggregated NDVI,
and estimated and aggregated LAI. The macro-scale
measurements in treeless areas, corresponding to
tundra vegetation, were used. NDVI was converted
to LAI either before or after the averaging step
using the NDVI-LAI transfer function for the Skye
NDVI sensor positioned 1.5 m above ground level
(Williams and others 2005):
LAI ¼ 0:00067e9:237NDVI: ð7Þ
NDVI and LAI were considered static in time for
this application meant to represent a growing-sea-
son period without transient LAI dynamics.
Averaging measured NDVI using large (7335 m2)
square pixels resulted in mean LAI estimates that
were 11% smaller (0.723) than LAI estimates from
the average of the NDVI measurements, which
resulted in modeled NEE 45% greater (closer to
zero) than the estimates that resulted from the
NDVI measurements themselves (Table 2). (The
micrometeorological convention where flux from
atmosphere to biosphere is considered negative is
used here.) This is despite the mean of measured
and aggregated NDVI being trivially different (ca.
-0.01%).
In order to alleviate these errors, we created a
TIN that preserved the IC of NDVI by iterating
through the TIN node selection algorithm until an
output in which the BDKL (equation 4) between
measured and averaged NDVI was <0.005 was
chosen. The mean NDVI and resulting mean LAI
between the measured and aggregated approaches
was nearly identical, and the difference between
NEE estimates from the NDVI measurements and
averages was on the order of 1% (Table 2). This is
despite similar mean pixel size between the TIN
(7767 m2) and square grid (7335 m2).
Averaging LAI itself using square pixels or opti-
mized TIN outputs resulted in minor differences in
model NEE output, on the order of 5 to 8%,
because the relationship between LAI and NEE in
PLIRTLE is roughly linear over the expected range
of LAI in tundra ecosystems (Table 2, Figure 7A),
0 1 2 3 4 5 6
0
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
0.45
0.50
LAI (m   m   )
p(L
AI
)
ES,n = 0.60 (0.58)
DKL = 1.59
ΓDKL = 0.0043
Measured LAI
Aggregated LAI
2   -2
Figure 6. (Top) An example of a TIN for spatial averaging
of LAI conditioned on hydrologic similarity via the
topographic convergence index, k, at the macro-scale
(shading, see right-hand side scale). The magnitude and
position of measured LAI (Figure 2A) are displayed as
white shapes. This TIN is near ‘optimal’ given the crite-
rion that averaging operators should preserve the IC
while maximizing pixel size; the analytical Kullback–
Liebler divergence for the gamma distribution (CDKL) for
this TIN is <0.005. The similarity of the measurements
and aggregates is also revealed by a comparison of the
probability distributions of measured and aggregated LAI
(bottom).
in accordance with recent findings on the roughly
linear relationship between LAI and flux in tundra
ecosystems (Street and others 2007).
The major difference between the square and IC-
preserving upscaling routine began with the slight
difference in the distribution of NDVI after aver-
aging in comparison with the measurements
(Figure 7B). There were fewer NDVI aggregates in
the highest and lowest bins and more in the bin
encompassing 0.73; that is, the shape of the NDVI
distribution was nearer that of a normal distribu-
tion. This change in NDVI distribution propagated
to differences in the estimated distribution of LAI
(Figure 7C) and consequently NEE (Figure 7D).
The importance of accurately averaging NDVI is
further shown in Figure 7A, which presents the
PLIRTLE-modeled NEE estimates that result from
different NDVI and LAI model inputs using the
2007 growing-season meteorological data. PLIRTLE
requires an LAI estimate or measurement, but the
nonlinearity in the NDVI-LAI conversion can
propagate to result in very large differences in
modeled NEE (Table 2, Williams and others 2008).
Preserving the information contained in the NDVI
distribution when applying the NDVI-LAI transfer
function is a logical way to upscale NEE estimates
with minimal bias.
Future Studies at Abisko
A major uncertainty for co-classifying ecological/
hydrological/topographic features in the Arctic
landscape studied here is the controls on forest
patch distribution near the Arctic treeline. Spada-
vecchia and others (2008) restricted their study to
tundra vegetation types given incomplete infor-
mation on the spatial structure of the treeline. The
irregular transition between the forest/wetland
matrix and tundra that occurs with elevation at
Abisko must be investigated to accurately model
ecological and hydrological processes at this inter-
face. Such modeling efforts should find the balance
between simplifying the spatial complexity of the
land surface for efficient modeling while accurately
characterizing vegetation and surface features.
Future Studies on IC-Based Upscaling
The particular results of the experimental findings
apply to our study site, which is characteristic of
tundra ecosystems (Spadavecchia and others
2008). There is no guarantee that conditioning
networks based on the topographic indices chosen
here will be effective for upscaling ecological
function in other ecosystems, but Vivoni and others
(2005a) successfully applied their variation of the
approach to watersheds with different characteris-
tics. Other likely examples where topographically
conditioned TINs may be effective for averaging
ecological variability include natural riverine and
montane environments where vegetation is clus-
tered around topographic features. Heavily man-
aged landscapes, such as industrial agriculture, may
show weaker relationships between topographic
indices and vegetation for the application of topo-
graphically based TINs. Regardless, upscaling point
or plot-scale measurements may benefit from the
IC-based approach that we demonstrated for rela-
tionships that are nonlinear, including the transfer
functions for remotely sensed indices and LAI in
agricultural (Haboudane and others 2004) and
other global ecosystems (Myeni and others 1997).
Choosing an index that has a relationship with LAI
that is not so strikingly nonlinear, for example, the
enhanced vegetation index may be useful for
reducing aggregation errors (Boegh and others
2002), particularly for mixed forest-tundra pixels
(Liu and Kafatos 2005).
Table 2. PLIRTLE (Shaver and others 2007) Model Results for the Net Ecosystem Exchange of C After
Spatial Averaging of (Top) Leaf Area Index (LAI) Measurements and (Bottom) NDVI Measurements
Mean NDVI Mean LAI (m2 m-2) NEEmod (g C m
-2 g s-1) NEEpixels (g C m
-2 g s-1)
Averaging LAI
Fine-scale – 1.20 -155.2 -105.2
Square pixels – 1.08 (-10.0%) -128.0 (17.5%) -111.2 (-5.70%)
TIN – 1.21 (0.00833%) -158.0 (-0.0175%) -96.5 (8.27%)
Averaging NDVI
Fine-scale 0.757 0.814 -57.2 -51.1
Square pixels 0.749 (-.0106%) 0.723 (-11.2%) -31.4 (45.1%) -28.0 (45.2)
TIN 0.758 (.00132%) 0.812 (.00246%) -56.5 (1.24%) -50.7 (0.783%)
‘Fine-scale’ refers to mean model results from all treeless (that is, tundra) measurements in the macro-scale dataset using meteorological inputs for the 2007 growing season.
NEEmod refers to model results after applying the mean of the aggregated LAI values, and NEEpixels refers to the mean model result after first applying the model individually to
each pixel.
Information Theory for Upscaling
Ecosystem Function
Concepts from information theory have long been
used to describe the distribution of organisms; the
Shannon diversity is simply the Shannon entropy
for the case of species. Information theory has also
been used to characterize mass and energy flows
among the networks and trophic levels that char-
acterize ecosystems (Ulanowicz 2001; Jørgensen
and others 2007), and provides the theoretical basis
for model selection algorithms such as the Akaike
information criterion (Akaike 1974). IC has been
used less frequently for landscape-level studies
(Chen and Blong 2002), research into biogeo-
chemical processes, or land surface modeling, par-
ticularly for addressing the ‘upscaling’ problem
(Rastetter and others 1992; Pelgrum 2000). This
study represents one example of how simple con-
cepts from information theory can be used for
linking plot-level measurements to process-based
studies at larger spatial scales, taking the view that
‘upscaling’ represents the transfer of information
between scales in space. This effort is related to
previous investigations that sought to minimize
within-pixel variance (Band and others 1991) by
noting that variance can be defined by the
parameters of the distributions used here to define
information (for example, variance for the gamma
distribution equals ab2). The example with the
ecosystem model demonstrated the potential
importance of preserving the information con-
tained in the spatial distribution of an ecological
characteristic (in this case NDVI) if the model
transfer function (to LAI) is nonlinear. Similarly,
quantifying information loss or gain when ‘down-
scaling’ global and regional models to finer spatial
scales may be forthcoming for efficiently quanti-
fying the accuracy of the disaggregation routine.
Combining upscaled and downscaled estimates of
ecosystem function ultimately rely on the infor-
mation (in the form of data) that is available, and
how these data are combined to present the best
estimate of ecosystem function at multiple spatial
scales. We envision that IC preservation will be a
logical method for reducing errors while extrapo-
lating estimates of ecosystem function to larger
scales in space (Brunsell and others 2007, 2008).
ACKNOWLEDGMENTS
We acknowledge the funding from the US National
Science Foundation (Grant numbers OPP-0096523,
OPP-0352897, DEB-0087046, and DEB-00895825),
from the University of Edinburgh, and from the
Natural Environment Research Council. PS, MW,
and AP-B were supported by the ABACUS project.
LS was supported by a NERC studentship to the
Centre for Terrestrial Carbon Dynamics. RB was
supported by the University of Edinburgh research
funding. Funding for the NERC ARSF flight
that carried the ATM sensor used for DEM genera-
tion was provided by Bob Baxter and Brian Huntley
at the University of Durham.Wewould like to thank
Willem Bouten for use of the LAI-2000, Lorna
Street and Sven Rasmussen for field assistance, Ben
Poulter for ArcGIS assistance, Terry Callaghan and
0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90
0
0.1
0.2
0.3
0.4
0.5
p(N
DV
I)
NDVI
Measurements
Square pixels
TIN
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6
0
0.1
0.2
0.3
0.4
p(L
AI
)
LAI (m  2     m     -2    )
-250 -200 -150 -100 -50 0 50 100 150
0
0.1
0.2
0.3
0.4
p(N
EE
)
NEE (g C m   -2           gs   -1)
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
-600
-400
-200
0
200
400
LAI or NDVI
N
EE
 (g
 C
 m
-
2  g
s-
1 )
NDVI
LAI
A
B
D
C
Figure 7. A Modeled net ecosystem exchange of C
(NEE) from the PLIRTLE model (Shaver and others 1992)
with pan-Arctic parameterization (Table 1) and 2007
growing-season meteorological data for a range of LAI, as
well as a range of NDVI that has been converted to LAI
using the 1.5 m NDVI-LAI transfer function described in
the study of Williams and others (2008). B The pdf of
tundra macro-scale measured NDVI versus NDVI aggre-
gated using square pixels and an optimally designed TIN
at a tundra ecosystem near Abisko, Sweden. C The pdf of
LAI after conversion from NDVI using the 1.5 m NDVI-
LAI relationship in the study of Williams and others
(2008). D The pdf of modeled net ecosystem exchange of
C (NEE) from the PLIRTLE model.
Gus Shaver for general support, andMathias Disney
for valuable comments on the manuscript.
REFERENCES
Akaike H. 1974. A new look at the statistical model identification.
IEEE Trans Automat Contr 19:716–23.
Anselin L. 1995. Local indicators of spatial association—LISA.
Geogr Anal 27:93–115.
Badiya Roy S, Avissar R. 2002. Impact of land use/land cover
change on regional hydrometeorology in Amazonia. J Geo-
phys Res 107. doi:10.1029/2000JD000266.
Baldocchi DD. 2008. ‘Breathing’ of the terrestrial biosphere:
lessons learned from a global network of carbon dioxide flux
measurements systems. Aust J Bot 56:1–26. Turner Review.
Band LE, Peterson DL, Running SW, Coughlan JC, Lammers R.
1991. Forest ecosystem processes at the watershed scale: basis
for distributed simulation. Ecol Model 56:171–96.
Beven KJ, Kirkby MJ. 1979. A physically-based variable con-
tributing area model of basin hydrology. Hydrol Sci Bull 24:
43–69.
Bliss LC. 1962. Adaptations of Arctic and alpine plants to envi-
ronmental conditions. Arctic 15:117–44.
Boegh E, Soegaard H, Broge N, Hasager CB, Jensen NO, Schelde
K. 2002. Airborne multispectral data for quantifying leaf area
index, nitrogen concentration and photosynthetic efficiency
in agriculture. Remote Sens Environ 81:179–93.
Brunsell NA, Ham JM, Owensby CE. 2008. Assessing the multi-
resolution information content of remotely sensed variables
and elevation for evapotranspiration in a tall-grass prairie
environment. Remote Sens Environ 112:2977–87.
Brunsell NA, Young CB. 2007. Land surface response to
precipitation events using MODIS and NEXRAD data. Int J
Remote Sens 29:1965–82.
Burnham KP, Anderson DR. 2002. Model selection and multi-
model inference: a practical information-theoretic approach.
New York: Springer, p 488.
Canadell JG, Le Quere C, Raupach MR, Field CB, Buitenhuis ET,
Ciais P, Conway TJ, Gillett NP, Houghton RA, Marland G.
2007. Contributions to accelerating atmospheric CO2 growth
from economic activity, carbon intensity, and efficiency of
natural sinks. Proc Natl Acad Sci U S A 104:18353–4.
Chen K, Blong R. 2002. Integrating remotely sensed images and
areal census data for building new models across scales.
Geoscience and Remote Sensing Symposium, 2002. IGARSS
¢02. 2002 IEEE International, vol 4, pp 2385–7.
Christensen TR, Johansson T, A˚kerman HJ, Mastepanov M,
Malmer N, Friborg T, Crill P, Svensson BH. 2004. Thawing
sub-Arctic permafrost: effects on vegetation and methane
emissions. Geophys Res Lett 31:L04501.
Entekhabi D, Eagleson PS. 1989. Land surface hydrology
parameterization for atmospheric general circulation models
including subgrid scale spatial variability. J Clim 2:816–31.
Essery RLH, Best MJ, Betts RA, Cox PM, Taylor CM. 2003.
Explicit representation of subgrid heterogeneity in a GCM
land-surface scheme. J Hydrometeorol 4:530–43.
Fletcher BJ, Press MC, Baxter R, Phoenix GK. (unpublished
data). Plant growth and photosynthesis across transition zones
between Arctic vegetation patches: separation of ecological
and physiological optima. Funct Ecol.
Gold C, Angel P. 2006. Voronoi hierarchies. In: Raubal M, Miller
HJ, Frank AU, Goochild MF, Eds. Geographic information
science: 4th international conference, GIScience 2006. Mu¨n-
ster, Germany: Springer. p 419.
Goodrich DC, Woolhiser DA, Keefer TO. 1991. Kinematic rout-
ing using finite elements on a triangular irregular network.
Water Resour Res 38. doi:10.1029/2001WR000854.
Gurney KR, Law RM, Denning AS, Rayner PJ, Pak BC, Baker D,
Bousquet P, Bruhwiler L, Chen Y-H, Ciais P, Fung IY, Heimann
M, John J. 2004. Transcom3 inversion intercomparison:model
mean results for the estimation of seasonal carbon sources and
sinks. Global Biogeochem Cycles 18:GB1010.1011–8.
Haboudane D, Miller JR, Pattey E, Zerco-Tejada PJ, Strachan IB.
2004. Hyperspectral vegetation indices and novel algorithms
for predicting green LAI of crop canopies: modeling and val-
idation in the context of precision agriculture. Remote Sens
Environ 90:337–52.
Hancock GR. 2006. The impact of different gridding methods on
catchment geomorphology and soil erosion over long time-
scales using a landscape evolution model. Earth Surf Processes
Landforms 31:1035–50.
Heinsch FA, Zhao M, Running SW, Kimball JS, Nemani RR,
Davis KJ, Bolstad PV, Cook BD, Desai AR, Ricciuto DM, Law
BE, Oechel WC, Kwon H, Luo H, Wofsy SC, Dunn AL,
Munger JW, Baldocchi DD, Xu L, Hollinger DY, Richardson
AD, Stoy PC, Siqueira MBS, Monson RK, Burns S, Flanagan
LB. 2006. Evaluation of remote sensing based terrestrial pro-
ductivity from MODIS using AmeriFlux tower eddy flux
network observations. IEEE Trans Geosci Remote Sens 44:
1908–25.
Ivanov VY, Vivoni ER, Bras RL, Entekhabi D. 2004. Catchment
hydrologic response with a fully distributed triangulated
irregular network model. Water Resour Res 40:W11102.
doi:11110.11029/12004WR003218.
Jarvis PG, McNaughton KG. 1986. Stomatal control of transpi-
ration: scaling up from leaf to region. Adv Ecol Res 15:1–49.
Jonasson S,MichelsenA, Schmidt IK, Nielsen EV. 1999. Responses
in microbes and plants to changed temperature, nutrient and
light regimes in the Arctic. Ecology 80:1828–43.
Jørgensen SE, Marques JC, Mu¨ller F, Nielsen SN, Patten PC,
Tiezzi E, Ulanowicz RE. 2007. A new ecology: systems per-
spective. Amsterdam: Elsevier, p 275.
Katul GG, Lai C-T, Albertson JD, Vidakovic B, Scha¨fer KVR,
Hsieh CI, Oren R. 2001. Quantifying the complexity in map-
ping energy inputs and hydrologic state variables into land-
surface fluxes. Geophys Res Lett 28:3305–7.
Kullback S. 1997. Information theory and statistics. Mineola,
NY: Dover Publications, p 416.
Kullback S, Leibler RA. 1951. On information and sufficiency.
Ann Math Stat 22:79–86.
Kumler MP. 1994. An intensive comparison of triangulated
irregular networks (TINs) and digital elevation models
(DEMs). Cartographica 31: Monograph 45, 41–48.
Kustas WP, Norman JM. 2000. Evaluating the effects of subpixel
heterogeneity on pixel average fluxes. Remote Sens Environ
74:327–42.
Leuning R, Kelliher FM, DePury DG, Schulze E-D. 1995. Leaf
nitrogen, photosynthesis, conductance and transpiration:
scaling from leaves to canopies. Plant Cell Environ 18:1183–
200.
Liu X, Kafatos M. 2005. Land-cover mixing and spectral vege-
tation indices. Int J Remote Sens 26:3321–7.
Mathiassen JR, Skavhaug A, Bø K. 2002. Texture similarity
measure using Kullback-Leibler divergence between gamma
distributions. Computer Vision—ECCV 2002. Berlin: Springer,
p 19–49.
Mauser W, Tenhunen JD, Schneider K, Ludwig R, Stolz R, Geyer
R, Falge EM. 2001. Remote sensing, GIS and modelling:
assessing spatially distributed water, carbon and nutriend
balances in the Ammer River catchment, in southern Bavaria.
In: Tenhunen JD, Lenz R, Hantschel R, Hunter S, Eds. Eco-
system approaches to landscape management in central Eur-
ope. Berlin: Springer.
Monteith JL, Unsworth MH. 1990. Principles of environmental
physics. London: Edward Arnold, p 291.
Mu¨ller C, Lucht W. 2007. Robustness of terrestrial carbon and
water cycle simulations against variations in spatial resolution. J
Geophys Res 112:D06105. doi:06110.01029/02006JD007875.
Myeni RB, Nemani RR, Running SW. 1997. Estimation of global
leaf area index and absorbed PAR using radiative transfer
models. IEEE Trans Geosci Remote Sens 35:1380–93.
O’Neill RV, Rust B. 1979. Aggregation error in ecological models.
Ecol Model 7:91–105.
Pelgrum H. 2000. Aggregation of a nonlinear land surface model
for heterogeneous terrain. Remote sensing and hydrology.
Santa Fe, NM: IAHS.
Peuker TK, Fowler RJ, Little JJ, Mark DM. 1978. The triangu-
lated irregular network. Proceedings of the DTM symposium.
American Society of Photogrammetry—American Congress
on Surveying and Mapping. Saint Lois, MO. pp 24–31.
Potter CS, Klooster SA, Nemani R, Genovese V, Hiatt S, Flade-
land M, Gross P. 2006. Estimating carbon budgets for U.S.
ecosystems. EOS Trans Am Geophys Union 87:85–96.
Quaife T, Lewis P, de Kauwe M, Williams M, Law BE, Disney M,
Bowyer P. 2008. Assimilating canopy reflectance data into an
ecosystem model with an ensemble Kalman filter. Remote
Sens Environ 112:1347–64.
Rahman AF, Gamon JA, Sims DA, Schmidts M. 2003. Optimum
pixel size for hyperspectral studies of ecosystem function in
southern California chaparral and grassland. Remote Sens
Environ 84:192–207.
Rastetter EB, King AW, Cosby BJ, Hornberger GM, O’Neill RV,
Hobbie JE. 1992. Aggregating fine-scale ecological knowledge
to model coarser-scale attributes of ecosystems. Ecol Appl
2: 55–70.
Shannon CE. 1948. A mathematical theory of communication.
Bell Syst Tech J 27:379–423. 623–56.
Shaver GR, Billings WD, Chapin FS, Giblin AE, Nadelhoffer KJ,
Oechel WC, Rastetter EB. 1992. Global change and the carbon
balance of Arctic ecosystems. Bioscience 42:433–41.
Shaver GR, Chapin FS, Gartner BL. 1986. Factors limiting
growth and biomass accumulation in Eriophorum vaginatum L.
in Alaskan tussock tundra. J Ecol 74:257–78.
Shaver GR, Street LE, Rastetter EB, van Wijk MT, Williams M.
2007. Functional convergence in regulation of net CO2 flux in
heterogeneous tundra landscapes in Alaska and Sweden. J
Ecol 95:802–17.
Spadavecchia L, Williams M, Bell R, Stoy PC, Huntley B, van
Wijk MT. 2008. Topographic controls on the leaf area index of
a Fennoscandian tundra ecosystem. J Ecol doi:10.1111/
j.1365-2745.2008.01424.x.
Stoy PC, Katul GG, Siqueira MBS, Juang J-Y, Novick KA, Oren
R. 2006. An evaluation of methods for partitioning eddy
covariance-measured net ecosystem exchange into photo-
synthesis and respiration. Agric For Meteorol 141:2–18.
Street LE, Shaver GR, Williams M, van Wijk MT. 2007. What is
the relationship between changes in canopy leaf area and
changes in photosynthetic CO2 flux in Arctic ecosystems? J
Ecol 95:139–50.
Sullivan PF, Arens SJT, Chimner RA, Welker JM. 2008. Tem-
perature and microtopography interact to control carbon cy-
cling in a high Arctic fen. Ecosystems 11:61–76.
Tenhunen JD, Geyer R, Valentini R, Mauser W, Cernusca A.
1999. Ecosystem studies, land-use change, and resource
management. In: Tenhunen J, Kabat P, Eds. Integrating
hydrology, ecosystem dynamics, and biogeochemistry in
complex landscapes. West Sussex: Wiley. p 1–19.
Ulanowicz RE. 2001. Information theory in ecology. Comput
Chem 25:393–9.
van Wijk MT, Williams M. 2005. Optical instruments for mea-
suring leaf area index in low vegetation: application in Arctic
ecosystems. Ecol Appl 15:1462–70.
van Wijk MT, Williams M, Shaver GR. 2005. Tight coupling
between leaf area index and foliage N content in Arctic plant
communities. Oecologia 142:421–7.
Vivoni ER, Ivanov VY, Bras RL, Entekhabi D. 2004. Generation
of triangulated irregular networks based on hydrological
similarity. J Hydrol Eng 9:288–302.
Vivoni ER, Ivanov VY, Bras RL, Entekhabi D. 2005. On the ef-
fects of triangulated terrain resolution on distributed hydro-
logic model response. Hydrol Processes 19:2101–22.
Vivoni ER, Teles V, Ivanov VY, Bras RL, Entekhabi D. 2005.
Embedding landscape processes into triangulated terrain
models. Int J Geogr Inf Sci 19:429–57.
Walker DA, Auerbach NA, Lewis BE, Shippert MM. 1995. NDVI,
biomass, and landscape evolution of glaciated terrain in
northern Alaska. Polar Rec 31:169–78.
Walko RL, Avissar R. 2006. The ocean-land-atmosphere model
(OLAM): a new generation of earth system model. EOS Trans.
AGU 87 Fall Meeting Suppl., Abstract A33F-05.
Wesson KH, Katul GG, Siqueira MBS. 2003. Quantifying
organization of atmospheric turbulent eddy motion using
nonlinear time series analysis. Bound-Layer Meteorol 106:
507–25.
Wilby RL, Wigley TML. 1997. Downscaling general circulation
model output: a review of methods and limitations. Prog Phys
Geogr 21:530–48.
Williams M, Bell R, Spadavecchia L, Street LE, van Wijk MT.
2008. Upscaling leaf area index in an Arctic landscape through
multi-scale observations. Glob Chang Biol 14. doi:10.1111/
j.1365-2486.2008.01590.x.
Williams M, Rastetter EB. 1999. Vegetation characteristics and
primary productivity along an Arctic transect: implications for
scaling-up. J Ecol 87:885–98.
Williams M, Rastetter EB, Shaver GR, Hobbie JE, Carpino E,
Kwiatkowski BL. 2001. Primary production of an Arctic
watershed: an uncertainty analysis. Ecol Appl 11:1800–16.
Williams M, Schwarz PA, Law B, Irvine J, Kurpius MR. 2005. An
improved analysis of forest carbon dynamics using data
assimilation. Glob Chang Biol 11:89–105.