Guatemala Population Map Metadata Report

Prediction Weighting Layer Used in Population Redistribution

The data presented below represent the predicted number of people per ~100 m pixel as estimated using the random forest (RF) model as described in Stevens, et al. (In Press). The following pages contain a description of the RF model and its covariates, their sources and any metadata collected for each covariate. The prediction weighting layer is used to dasymetrically redistribute the census counts and project counts to match estimated populations based on UN estimates for the final population maps provided by AfriPop, AsiaPop and AmeriPop.

plot of chunk predict_density

Guatemala Census Data and Observed Population Density

These data are the population density values used to estimate the RF model used to create the prediction weighting layer you see above. Values represent population density as measured by people per hectare and calculated from population counts within each census unit. These values are used as the dependent variable during model estimation.

Guatemala Census, 2012

Folder: Census
File Name: AdminPop12.shp
Source: Instituto Nacional de Estadística, Guatemala, 2012, http://www.geohive.com/cntry/guatemala.aspx
Description: These high spatial resolution census block data were attained through in-country partners for 2012.
Class: polygon
Derived Covariates:
area, buff, zones,

class       : SpatialPolygonsDataFrame 
nfeatures   : 335 
extent      : 258599, 690576, 1518820, 1970102  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 31

plot of chunk census_data


Random Forest Model and Diagnostics

These output and figures outline the estimated RF model that is used to predict the population density weighting layer. The model is fitted to the population density values for the preceding census data using covariates aggregatedfrom the ancillary data sources summarized following the model diagnostics.


Call:
 randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry,      nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) 
               Type of random forest: regression
                     Number of trees: 500
No. of variables tried at each split: 13

          Mean of squared residuals: 0.24
                    % Var explained: 80

plot of chunk random_forestplot of chunk random_forestplot of chunk random_forest

Covariate Metadata

Remotely-sensed, Classified Landcover, 2000/2009

Folder: Landcover
File Name: GTM_RF_INPUT_LC.tif
Source: Geocover, 2000 (30m), http://www.mdafederal.com/geocover; GlobCover, 2009 (300m), http://due.esrin.esa.int/globcover/; MODIS 500m Global Urban Extent, 2001-2002, http://sage.wisc.edu/people/schneider/research/data.html
Description: Land cover was derived from GeoCover 2000, re-coded to GlobCover 2009 categories to be consistent with current WorldPop datasets; GlobCover 2009 was used to fill no-data areas, due to presence of clouds/shadows, in the GeoCover 2000; the MODIS 500m Global Urban Extent map was used to distinguish between urban areas and rural settlements
Class: raster
Derived Covariates:
prp011, cls011, dst011, prp040, cls040, dst040, prp130, cls130, dst130, prp140, cls140, dst140, prp150, cls150, dst150, prp160, cls160, dst160, prp190, cls190, dst190, prp200, cls200, dst200, prp210, cls210, dst210, prp230, cls230, dst230, prp240, cls240, dst240, prp250, cls250, dst250, prpBLT, clsBLT, dstBLT,

class       : RasterBrick 
dimensions  : 4528, 4347, 19683216, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 257729, 692429, 1518754, 1971554  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=tmerc +lat_0=0 +lon_0=-90 +k=0.9996 +x_0=500000 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : C:\WorldPop\RFmap\data\GTM\Landcover\Derived\landcover.tif 
names       : landcover 
min values  :         0 
max values  :       240 

plot of chunk covariate_reports


MODIS 17A3 2010 Estimated Net Primary Productivity, 1km

Folder: NPP
File Name: DEFAULT: MODIS 17A3 2010
Source: United States Geological Survey (USGS)
Description: MODIS 17A3 version-55 derived estimates of net primary productivity for the year 2010, estimated for 1km pixel sizes and subset and resampled to match the available land cover and final population map output requirements.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 4528, 4347, 19683216, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 257729, 692429, 1518754, 1971554  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=tmerc +lat_0=0 +lon_0=-90 +k=0.9996 +x_0=500000 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : C:\WorldPop\RFmap\data\GTM\NPP\Derived\npp.tif 
names       :   npp 
min values  :     0 
max values  : 18667 

plot of chunk covariate_reports


Suomi NPP VIIRS-Derived 2012 Lights at Night, 15 arc-second

Folder: Lights
File Name: DEFAULT: VIIRS 2012
Source: http://ngdc.noaa.gov/eog/viirs/download_viirs_ntl.html
Description: These 'Lights at Night' data were derived from imagery collected by the Suomi National Polar-orbiting Partnership (NPP) Visible Infrared Imaging Radiometer Suite (VIIRS) sensor. Data were collected in 2012 on moonless nights and though background noise associated with fires, gas-flares, volcanoes or aurora have not been removed it represents the best-available data for night-time light production.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 4529, 4348, 19692092, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 257629, 692429, 1518654, 1971554  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=tmerc +lat_0=0 +lon_0=-90 +k=0.9996 +x_0=500000 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : C:\WorldPop\RFmap\data\GTM\Lights\Derived\lights.tif 
names       : lights 
min values  : -0.083 
max values  :    605 

plot of chunk covariate_reports


WorldClim/BioClim Mean Annual Temperature 1950-2000, 30 arc-second

Folder: Temp
File Name: DEFAULT: BIO1
Source: http://www.worldclim.org/current
Description: WorldClim/BioClim 1950-2000 mean annual precipitation (BIO12) and mean annual temperature (BIO1) estimates (Hijmans et al., 2005) were downloaded, mosaicked and subset to match the extent of our land cover data for the mapping of this region.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 4529, 4348, 19692092, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 257629, 692429, 1518654, 1971554  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=tmerc +lat_0=0 +lon_0=-90 +k=0.9996 +x_0=500000 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : C:\WorldPop\RFmap\data\GTM\Temp\Derived\temp.tif 
names       : temp 
min values  :   42 
max values  :  283 

plot of chunk covariate_reports


WorldClim/BioClim Mean Annual Precipitation 1950-2000, 30 arc-second

Folder: Precip
File Name: DEFAULT: BIO12
Source: http://www.worldclim.org/current
Description: WorldClim/BioClim 1950-2000 mean annual precipitation (BIO12) and mean annual temperature (BIO1) estimates (Hijmans et al., 2005) were downloaded, mosaicked and subset to match the extent of our land cover data for the mapping of this region.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 4529, 4348, 19692092, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 257629, 692429, 1518654, 1971554  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=tmerc +lat_0=0 +lon_0=-90 +k=0.9996 +x_0=500000 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : C:\WorldPop\RFmap\data\GTM\Precip\Derived\precip.tif 
names       : precip 
min values  :    578 
max values  :   5372 

plot of chunk covariate_reports


Roads (COD-FOD registry)

Folder: Roads
File Name: COD_FOD_caminos_gtm_GCSWGS84_Clip.shp
Source: Common Operational Datasets (CODs)/Fundamental Operational Datasets (FODs), Downloaded 2014-02-24, https://cod.humanitarianresponse.info/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the https://cod.humanitarianresponse.info/ website, extracted from the Common/Fundamental Operational Datasets (COD-FOD) registry.
Class: linear
Derived Covariates:
prp, cls, dst,

class       : SpatialLinesDataFrame 
nfeatures   : 10760 
extent      : 262605, 683929, 1519973, 1970095  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 7

plot of chunk covariate_reports


Rivers (COD-FOD registry)

Folder: Rivers
File Name: COD_FOD_rios_gtm_GCSWGS84_Clip.shp
Source: Common Operational Datasets (CODs)/Fundamental Operational Datasets (FODs), Downloaded 2014-02-24, https://cod.humanitarianresponse.info/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the https://cod.humanitarianresponse.info/ website, extracted from the Common/Fundamental Operational Datasets (COD-FOD) registry.
Class: linear
Derived Covariates:
prp, cls, dst,

class       : SpatialLinesDataFrame 
nfeatures   : 5633 
extent      : 263751, 690449, 1518916, 1970051  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 6

plot of chunk covariate_reports


Populated places with ~20000 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Populated
File Name: CIUDAD_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 75 
extent      : 278672, 650636, 1578295, 1871659  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Inland Waterbodies

Folder: Waterbodies
File Name: DEFAULT: hydro/watrcrsl
Source: National Geospatial-Intelligence Agency (NGA), http://geoengine.nga.mil/geospatial/SW_TOOLS/NIMAMUSE/webinter/rast_roam.html
Description: The VMAP0 data area downloaded as separate files, grouped roughly by continent, and merged into individual shapefiles for subsetting and further processing for population mapping efforts. These data were obtained directly from the original VMAP0 data sources provided by the NGA and pre-processed using Military Analyst in ArcGIS 10.0.
Class: polygon
Derived Covariates:
cls, dst, prp,

class       : SpatialPolygonsDataFrame 
nfeatures   : 253 
extent      : 256691, 690231, 1534481, 1973672  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 10

plot of chunk covariate_reports


Protected Areas

Folder: Protected
File Name: DEFAULT: WDPAfgdb_Sept2012.gdb
Source: World Database on Protected Areas, Downloaded September, 2012, UNEP, http://www.wdpa.org, http://protectedplanet.net
Description: These data are compiled by UNEP and distributed via the Protected Planet website. All protected areas were downloaded regardless of International Union for Conservation of Nature (IUCN) or any other designation, so they include sanctuaries, national parks, game reserves, World Heritage Sites, etc.
Class: polygon
Derived Covariates:
cls, dst, prp,

class       : SpatialPolygonsDataFrame 
nfeatures   : 309 
extent      : 250788, 693414, 1514496, 1980098  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 26

plot of chunk covariate_reports


Urban Extents

Folder: Urban
File Name: DEFAULT: schneider-urban.shp
Source: Schneider, et al., United Nations
Description: These data were constructed from MODIS-derived imagery and provided to WorldPop researchers by Schneider, et al. as part of a global urban extents datasets.
Class: polygon
Derived Covariates:
cls, dst, prp,

class       : SpatialPolygonsDataFrame 
nfeatures   : 129 
extent      : 255833, 693508, 1538203, 1870845  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 2

plot of chunk covariate_reports


Elevation and Derived Slope, 3 second

Folder: Elevation
File Name: DEFAULT: Void-Filled DEM.gdb
Source: HydroSHEDS Void-Filled DEM (Lehnert, et al., 2006), http://hydrosheds.cr.usgs.gov/dataavail.php
Description: The HydroSHEDS data are the result of an effort to provide a globally consistent dataset consisting of NASA's Shuttle Radar Topography Mission (SRTM) data and have been processed, void-filled and corrected for use at large scales.
Class: raster
Derived Covariates:
, slope,

class       : RasterBrick 
dimensions  : 4529, 4348, 19692092, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 257629, 692429, 1518654, 1971554  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=tmerc +lat_0=0 +lon_0=-90 +k=0.9996 +x_0=500000 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : C:\WorldPop\RFmap\data\GTM\Elevation\Derived\elevation.tif 
names       : elevation 
min values  :       -14 
max values  :      4191 

plot of chunk covariate_reports


Digitized Building Locations (OSM), 2013

Folder: Buildings
File Name: OSM_buildings_Rep_Clip.shp
Source: Open Street Map, Downloaded 2013-12-06, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: polygon
Derived Covariates:
prp, cls, dst,

class       : SpatialPolygonsDataFrame 
nfeatures   : 1744 
extent      : 269262, 674902, 1522655, 1905022  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 3

plot of chunk covariate_reports


Points of Interest Locations (OSM), 2013

Folder: Points
File Name: OSM_points_cleaned_Clip.shp
Source: Open Street Map, Downloaded 2013-12-06, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 3835 
extent      : 269271, 653770, 1519231, 1967699  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 4

plot of chunk covariate_reports


Delineated Land Uses (OSM), 2013

Folder: Uses
File Name: OSM_landuse_Rep_Clip.shp
Source: Open Street Map, Downloaded 2013-12-06, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: polygon
Derived Covariates:
prp, cls, dst,

class       : SpatialPolygonsDataFrame 
nfeatures   : 280 
extent      : 268838, 653816, 1520901, 1905781  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 3

plot of chunk covariate_reports


Schools (COD-FOD registry), 2000

Folder: Education
File Name: COD_FOD_escuelas_gtm_GCSWGS84_Clip.shp
Source: Common Operational Datasets (CODs)/Fundamental Operational Datasets (FODs), Downloaded 2014-02-24, https://cod.humanitarianresponse.info/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the https://cod.humanitarianresponse.info/ website, extracted from the Common/Fundamental Operational Datasets (COD-FOD) registry.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 23508 
extent      : 260866, 683204, 1520181, 1943793  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 22

plot of chunk covariate_reports


Health Centers (COD-FOD registry), 2000

Folder: Health
File Name: COD_FOD_centros_salud_pdialogo_gtm_GCSWGS84_Clip.shp
Source: Common Operational Datasets (CODs)/Fundamental Operational Datasets (FODs), Downloaded 2014-02-24, https://cod.humanitarianresponse.info/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the https://cod.humanitarianresponse.info/ website, extracted from the Common/Fundamental Operational Datasets (COD-FOD) registry.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 4065 
extent      : 263653, 667212, 1520187, 1930614  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 7

plot of chunk covariate_reports


Populated places with ~8000 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Villa
File Name: VILLA_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 38 
extent      : 305832, 625711, 1555942, 1747580  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Populated places with ~4000 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Pueblo
File Name: PUEBLO_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 317 
extent      : 263712, 633824, 1539561, 1887098  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Populated places with ~1000 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Aldea
File Name: ALDEA_ASENTAMIENTO_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 2716 
extent      : 265683, 683887, 1520175, 1952647  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Populated places with ~500 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Colonia
File Name: COLONIA_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 1815 
extent      : 269029, 651802, 1538456, 1881111  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Populated places with ~150 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Caserio
File Name: CASERIO_FINCA_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 17916 
extent      : 260762, 687260, 1520012, 1962406  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Populated places with ~50 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Otra
File Name: OTRA_PARAJE_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 3285 
extent      : 263905, 685907, 1532890, 1967633  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports


Populated places with ~0 inhabitants (Instituto Nacional de Estadística, Guatemala), 2002

Folder: Hacienda
File Name: HACIENDA_LABOR_GRANJA_gtm_GCSWGS84_Clip.shp
Source: Instituto Nacional de Estadística, Guatemala, 2002
Description: These census data were attained through in-country partners for 2002.
Class: point
Derived Covariates:
prp, cls, dst,

class       : SpatialPointsDataFrame 
nfeatures   : 1140 
extent      : 268100, 671742, 1528480, 1900670  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
nvariables  : 15

plot of chunk covariate_reports