Malawi Population Map Metadata Report

Prediction Weighting Layer Used in Population Redistribution

The data presented below represent the predicted number of people per ~100 m pixel as estimated using the random forest (RF) model as described in Stevens, et al. (2015). The following pages contain a description of the RF model and its covariates, their sources and any metadata collected for each covariate. The prediction weighting layer is used to dasymetrically redistribute the census counts and project counts to match estimated populations based on UN estimates for the final population maps provided by WorldPop.

Stevens, F. R., Gaughan, A. E., Linard, C., & Tatem, A. J. (2015). Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data. PLOS ONE, 10(2), e0107042. doi:10.1371/journal.pone.0107042

plot of chunk predict_density

Malawi Census Data and Observed Population Density

These data are the population density values used to estimate the RF model used to create the prediction weighting layer you see above. Values represent population density as measured by people per hectare and calculated from population counts within each census unit. These values are used as the dependent variable during model estimation.

Malawi Census, 2008

Folder: Census
File Name: MWI_adm3_2008_12666.shp
Source: Malawi Census, 2008, provided from Andy Tatem
Description: These high spatial resolution census block data were attained through in-country partners for 2008.
Class: polygon
Derived Covariates:
area, buff, zones,

class       : SpatialPolygonsDataFrame 
features    : 12666 
extent      : 464484, 814001, 8105168, 8964939  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 25

plot of chunk census_data


Random Forest Model and Diagnostics

These output and figures outline the estimated RF model that is used to predict the population density weighting layer. The model is fitted to the population density values for the preceding census data using covariates aggregatedfrom the ancillary data sources summarized following the model diagnostics.


Call:
 randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry,      nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) 
               Type of random forest: regression
                     Number of trees: 500
No. of variables tried at each split: 16

          Mean of squared residuals: 0.35
                    % Var explained: 83

plot of chunk random_forestplot of chunk random_forestplot of chunk random_forest

Covariate Metadata

Suomi NPP VIIRS-Derived 2012 Lights at Night, 15 arc-second

Folder: Lights
File Name: DEFAULT: VIIRS 2012
Source: http://ngdc.noaa.gov/eog/viirs/download_viirs_ntl.html
Description: These 'Lights at Night' data were derived from imagery collected by the Suomi National Polar-orbiting Partnership (NPP) Visible Infrared Imaging Radiometer Suite (VIIRS) sensor. Data were collected in 2012 on moonless nights and though background noise associated with fires, gas-flares, volcanoes or aurora have not been removed it represents the best-available data for night-time light production.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8613, 3567, 30722571, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463703, 820403, 8103924, 8965224  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\Lights\Derived\lights.tif 
names       : lights 
min values  : -0.047 
max values  :    619 

plot of chunk covariate_reports


WorldClim/BioClim Mean Annual Temperature 1950-2000, 30 arc-second

Folder: Temp
File Name: DEFAULT: BIO1
Source: http://www.worldclim.org/current
Description: WorldClim/BioClim 1950-2000 mean annual precipitation (BIO12) and mean annual temperature (BIO1) estimates (Hijmans et al., 2005) were downloaded, mosaicked and subset to match the extent of our land cover data for the mapping of this region.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8847, 3829, 33875163, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 452403, 835303, 8091924, 8976624  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\Temp\Derived\temp.tif 
names       : temp 
min values  :  112 
max values  :  269 

plot of chunk covariate_reports


WorldClim/BioClim Mean Annual Precipitation 1950-2000, 30 arc-second

Folder: Precip
File Name: DEFAULT: BIO12
Source: http://www.worldclim.org/current
Description: WorldClim/BioClim 1950-2000 mean annual precipitation (BIO12) and mean annual temperature (BIO1) estimates (Hijmans et al., 2005) were downloaded, mosaicked and subset to match the extent of our land cover data for the mapping of this region.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8847, 3829, 33875163, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 452403, 835303, 8091924, 8976624  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\Precip\Derived\precip.tif 
names       : precip 
min values  :    587 
max values  :   2443 

plot of chunk covariate_reports


Open Street Map (OSM) Road Network, 2017

Folder: Roads
File Name: OSM_MWI_Roads.shp
Source: http://www.openstreetmap.org/
Description: These data were downloaded as part of a per-country package of OSM data layers made availalble as shapefiles through the http://www.BBBike.org/community.html website.
Class: linear
Derived Covariates:
cls, dst,

class       : SpatialLinesDataFrame 
features    : 155665 
extent      : 459527, 815610, 8095244, 8966102  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 10

plot of chunk covariate_reports


Open Street Map (OSM) River Lines, 2017

Folder: Rivers
File Name: OSM_MWI_Rivers.shp
Source: http://www.openstreetmap.org/
Description: These data were downloaded as part of a per-country package of OSM data layers made availalble as shapefiles through the http://www.BBBike.org/community.html website.
Class: linear
Derived Covariates:
cls, dst,

class       : SpatialLinesDataFrame 
features    : 6768 
extent      : 463672, 813748, 8095191, 8961909  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 5

plot of chunk covariate_reports


Populated Places

Folder: Populated
File Name: DEFAULT: Merged pop/builtupp, pop/builtupa, pop/mispopp
Source: National Geospatial-Intelligence Agency (NGA), http://geoengine.nga.mil/geospatial/SW_TOOLS/NIMAMUSE/webinter/rast_roam.html
Description: The VMAP0 data area downloaded as separate files, grouped roughly by continent, and merged into individual shapefiles for subsetting and further processing for population mapping efforts. These data were obtained directly from the original VMAP0 data sources provided by the NGA and pre-processed using Military Analyst in ArcGIS 10.0. Point data sources are buffered to 100 m and then all polygon data sources are merged to a single shapefile prior to processing.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 259 
extent      : 473944, 811168, 8109840, 8960583  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 13

plot of chunk covariate_reports


Open Street Map (OSM) Water Bodies, 2017

Folder: Waterbodies
File Name: OSM_MWI_Waterbodies.shp
Source: http://www.openstreetmap.org/
Description: These data were downloaded as part of a per-country package of OSM data layers made availalble as shapefiles through the http://www.BBBike.org/community.html website.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 1003 
extent      : 472530, 815453, 8125192, 8937566  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 4

plot of chunk covariate_reports


Protected Areas

Folder: Protected
File Name: DEFAULT: WDPAfgdb_Sept2012.gdb
Source: World Database on Protected Areas, Downloaded September, 2012, UNEP, http://www.wdpa.org, http://protectedplanet.net
Description: These data are compiled by UNEP and distributed via the Protected Planet website. All protected areas were downloaded regardless of International Union for Conservation of Nature (IUCN) or any other designation, so they include sanctuaries, national parks, game reserves, World Heritage Sites, etc.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 74 
extent      : 459749, 809309, 8123312, 8949770  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 26

plot of chunk covariate_reports


Urban Extents

Folder: Urban
File Name: DEFAULT: schneider-urban.shp
Source: Schneider, et al., United Nations
Description: These data were constructed from MODIS-derived imagery and provided to WorldPop researchers by Schneider, et al. as part of a global urban extents datasets.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 82 
extent      : 462857, 813537, 8161091, 8781378  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 2

plot of chunk covariate_reports


Elevation and Derived Slope, 3 second

Folder: Elevation
File Name: DEFAULT: Void-Filled DEM.gdb
Source: HydroSHEDS Void-Filled DEM (Lehnert, et al., 2006), http://hydrosheds.cr.usgs.gov/dataavail.php
Description: The HydroSHEDS data are the result of an effort to provide a globally consistent dataset consisting of NASA's Shuttle Radar Topography Mission (SRTM) data and have been processed, void-filled and corrected for use at large scales.
Class: raster
Derived Covariates:
, slope,

class       : RasterBrick 
dimensions  : 8839, 3820, 33764980, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 452903, 834903, 8092424, 8976324  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\Elevation\Derived\elevation.tif 
names       : elevation 
min values  :        25 
max values  :      2965 

plot of chunk covariate_reports


Open Street Map (OSM) Buildings, 2017

Folder: Buildings
File Name: OSM_MWI_Buildings.shp
Source: http://www.openstreetmap.org/
Description: These data were downloaded as part of a per-country package of OSM data layers made availalble as shapefiles through the http://www.BBBike.org/community.html website.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 887625 
extent      : 480445, 807514, 8103807, 8951345  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 5

plot of chunk covariate_reports


Open Street Map (OSM) Residential, 2017

Folder: Residential
File Name: OSM_MWI_Residential.shp
Source: http://www.openstreetmap.org/
Description: These data were downloaded as part of a per-country package of OSM data layers made availalble as shapefiles through the http://www.BBBike.org/community.html website.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 10869 
extent      : 463480, 807131, 8104128, 8962169  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 4

plot of chunk covariate_reports


Open Street Map (OSM) Village Points, 2017

Folder: Villages
File Name: OSM_MWI_Villiages.shp
Source: http://www.openstreetmap.org/
Description: These data were downloaded as part of a per-country package of OSM data layers made availalble as shapefiles through the http://www.BBBike.org/community.html website.
Class: polygon
Derived Covariates:
cls, dst,

class       : SpatialPolygonsDataFrame 
features    : 86 
extent      : 541967, 782657, 8181745, 8946692  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 5

plot of chunk covariate_reports


GHSL Beta June 2015 38m, Malawi Corrected

Folder: GHSL
File Name: ghsl_mwi_prj_2014_euc_dte.tif
Source: https://ec.europa.eu/jrc/en/scientific-tool/global-human-settlement-layer
Description: These data were provided at 38m under a cooperation agreement with the GHSL project under the ECJRC as the 2014 beta versions and were converted to binary datasets at 38m. Data shown is euclidean distance of a binary settlement layer.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8613, 3567, 30722571, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463703, 820403, 8103924, 8965224  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\GHSL\Derived\ghsl.tif 
names       :  ghsl 
min values  :  -985 
max values  : 44854 

plot of chunk covariate_reports


Facebook Binary Built Area

Folder: FB
File Name: FB_DTE_MWI.tif
Source: Facebook data, provided by Tobias Tiecke. Based on house_finder_footprint_perc_mwi_v3
Description: Based on privately distributed housing locates from Facebook. A binary layer of built vs. non-built area
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8606, 3544, 30499664, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 464703, 819103, 8103824, 8964424  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\FB\Derived\fb.tif 
names       :    fb 
min values  :  -500 
max values  : 16274 

plot of chunk covariate_reports


Global Urban Footprint Distance to Edge

Folder: GUF
File Name: MWI_GUF_DTE.tif
Source: http://www.dlr.de/eoc/en/desktopdefault.aspx/tabid-11725/20508_read-47944/
Description: Global Urban Footprint data was downloaded at 2.8 arc second from web portal and subset for Madagascar. Shown is euclidean distance to edge.
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8613, 3567, 30722571, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463703, 820403, 8103924, 8965224  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\GUF\Derived\guf.tif 
names       :   guf 
min values  :  -728 
max values  : 38323 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 011 Cultivated Terrestrial Areas and Managed Lands Distance to Edge

Folder: R011dte
File Name: mwi_grid_100m_ccilc_dst011_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R011dte\Derived\r011dte.tif 
names       : r011dte 
min values  :     -13 
max values  :      35 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 040 Terrestrial Vegetation Woody / Trees Distance to Edge

Folder: R040dte
File Name: mwi_grid_100m_ccilc_dst040_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R040dte\Derived\r040dte.tif 
names       : r040dte 
min values  :     -12 
max values  :      35 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 130 Terrestrial Vegetation Shrubs Distance to Edge

Folder: R130dte
File Name: mwi_grid_100m_ccilc_dst130_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R130dte\Derived\r130dte.tif 
names       : r130dte 
min values  :    -1.4 
max values  :      40 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 140 Terrestrial Vegetation Herbaceous Distance to Edge

Folder: R140dte
File Name: mwi_grid_100m_ccilc_dst140_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R140dte\Derived\r140dte.tif 
names       : r140dte 
min values  :      -3 
max values  :      46 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 150 Terrestrial Vegetation Distance to Edge

Folder: R150dte
File Name: mwi_grid_100m_ccilc_dst150_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R150dte\Derived\r150dte.tif 
names       : r150dte 
min values  :   -0.19 
max values  :     388 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 160 Aquatic Vegetation Distance to Edge

Folder: R160dte
File Name: mwi_grid_100m_ccilc_dst160_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R160dte\Derived\r160dte.tif 
names       : r160dte 
min values  :    -4.5 
max values  :      56 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 190 Urban Area Distance to Edge

Folder: R190dte
File Name: mwi_grid_100m_ccilc_dst190_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R190dte\Derived\r190dte.tif 
names       : r190dte 
min values  :    -1.5 
max values  :      66 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 200 Bare Area Distance to Edge

Folder: R200dte
File Name: mwi_grid_100m_ccilc_dst200_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8610, 3564, 30686040, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 463803, 820203, 8104024, 8965024  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R200dte\Derived\r200dte.tif 
names       : r200dte 
min values  :    -1.1 
max values  :     288 

plot of chunk covariate_reports


ESA Annual CCI 2010 - 210 Water Bodies Distance to Edge

Folder: R210dte
File Name: MWI_grid_100m_ccilc_dst210_2010.tif
Source: http://cci.esa.int/content/land-cover-data
Description: European Space Agencie's Climate Change Initiative Data Portal: Distance to outer Edge of distinct landcover type
Class: raster
Derived Covariates:
,

class       : RasterBrick 
dimensions  : 8653, 3684, 31877652, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : 462503, 830903, 8101324, 8966624  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=utm +zone=36 +south +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\WorldPop\data\MWI\R210dte\Derived\r210dte.tif 
names       : r210dte 
min values  :  -11254 
max values  :  110562 

plot of chunk covariate_reports