Associate Professor of Pediatrics
Introduction: Although many studies have investigated the probability of Ebola virus disease (EVD) outbreaks while other studies have simulated the size and speed of EVD outbreaks, few have investigated the environmental and population-level predictors of Ebola transmission once an outbreak is underway. Identifying strong predictors of transmission could help guide and target limited public health resources during an EVD outbreak. We examined several environmental and population-level demographic predictors of EVD risk from the West African epidemic. Methods: We obtained district-level estimates from the World Health Organization EVD case data, demographic indicators obtained from the Demographic and Health surveys, and satellite-derived temperature, rainfall, and land cover estimates. A Bayesian hierarchical Poisson model was used to estimate EVD risk and to evaluate the spatial variability explained by the selected predictors. Results: We found that districts had greater risk of EVD with increasing proportion of households not possessing a radio (RR 2.79, 0.90-8.78; RR 4.23, 1.16-15.93), increasing rainfall (RR 2.18; 0.66-7.20; 5.34, 1.20-23.90), and urban land cover (RR 4.87, 1.56-15.40; RR 5.74, 1.68-19.67). Discussion: The finding of radio ownership and reduced EVD transmission risk suggests that the use of radio messaging for control and prevention purposes may have been crucial in reducing the EVD transmission risk in certain districts, although this association requires further study. Future research should examine the etiologic relationships between the identified risk factors and human-to-human transmission of EVD with a focus on factors related to population mobility and healthcare accessibility, which are critical features of epidemic propagation and control.
The recent Ebola virus disease (EVD) epidemic in West Africa was the largest EVD outbreak in history, spreading across Guinea, Liberia, and Sierra Leone, infecting an estimated 28,600 individuals, and claiming over 11,000 lives.
EVD has caused numerous outbreaks, the majority in equatorial Africa, since the first human outbreaks were detected in 1976 in the Democratic Republic of Congo and South Sudan.
Several studies have identified predictors of Ebola ecological niches (occurrence of environmental conditions that support its presence in a particular location),
Previous work has investigated the probability of Ebola outbreaks, and mathematical modelling studies have estimated the size, speed, and spatio-temporal patterns of EVD using simulated data.
The study included the three West African countries with widespread EVD transmission: Guinea, Liberia, and Sierra Leone. Guinea has an estimated population of 11,780,000
Publicly available data from the World Health Organization (WHO) included weekly counts of confirmed EVD by prefecture (Guinea, n=34), county (Liberia, n=15), and district (Sierra Leone, n=14) as of May 13, 2015.
Data for potential predictors were obtained from multiple sources including satellite sensor-derived environmental data and national Demographic and Health Surveys (DHS). Rainfall, temperature, and land cover measures were obtained from the Tropical Rainfall Measuring Mission (TRMM) and moderate resolution imaging spectroradiometer (MODIS) instruments onboard the Terra satellite. The TRMM product (TRMM3B42RT) provided weekly accumulated rainfall estimates with a spatial resolution of 0.25° × 0.25°, which was then weighted by the surface area daytime and nighttime land surface temperature (LST) estimates were obtained from MODIS (MOD11A2) using eight-day composite images at a 1 km × 1 km resolution. TRMM and LST estimates were obtained between April 28, 2014 and May 3, 2015 and 17 land cover classifications were provided by MODIS (MCD12Q1) at a 500 m × 500 m resolution for the most recent year available (2012). Eight of the land cover classifications were considered in the analysis as the remaining nine categories had negligible presence in Guinea, Liberia, and Sierra Leone. Density of waterways and roadways were estimated (km per km2), as was the average elevation and land area from shapefiles obtained from DIVA-GIS. Shapefiles from the WorldPop project were acquired, which provided projected population estimates. All estimates were obtained at the district level and Universal Transverse Mercator zone 28 projection was used.
National DHS from Guinea (2012), Liberia (2013), and Sierra Leone (2013) were used for subnational estimates on household education, wealth, occupation, household structure, and possessions and amenities. For Liberia and Sierra Leone, these estimates were obtained for the county and district levels respectively. For Guinea, DHS estimates were only available at the regional level (n=8), therefore prefectures within each region were given the same DHS-derived values.
Linearity between covariates and the outcome was first assessed and for ease of interpretation and to avoid modelling complex non-linear terms, all continuous variables were reclassified into tercile intervals. Simple transformations (e.g., exponential, log, quadratic) were insufficient to produce a linear relationship between the covariate and outcome. Multicollinearity between ternary covariates was then examined using Cramer’s V
We regressed the cumulative total of EVD cases (outcome) on the selected covariates using a Poisson model with the total population per district as an offset, given the population differences between the districts. We used the glmulti
Having identified a covariate subset, we quantified the amount of variation in EVD cases that this covariate set explained by using a spatial autoregressive modelling approach.
where
T2=second tercile, T3=third terciles; *Kilometers of roadway per 100 km2 of land area; †Number of people per km2 of land area; ‡Proportion (%) of total land surface area; §Households without radio possession; ¶Mean years of education of head of household.
Parameter
Parameter value
x1
Rainfall T2
x2
Rainfall T3
x3
Roadway T2*
x4
Roadway T3
x5
Population T2†
x6
Population T3
x7
Urban T2‡
x8
Urban T3
x9
Radio T2§
x10
Radio T3
x11
Education T2¶
x12
Education T3
Spatial hierarchical models were fit using Bayesian estimation via the R-INLA package.
The analysis was conducted in R version 3.2.1 software and STATcompiler was used for DHS indicators included in Table 1, to obtain country-level values.
Table 2 summarizes selected covariates at the country-level for Guinea, Liberia, and Sierra Leone. Sierra Leone had the largest number of confirmed cases as well as the highest road and waterway density. Liberia and Guinea were similar in case burden, with Liberia having the largest portion of households headed by females and households without toilets. Guinea had the highest elevation as well as the highest proportion of households with electricity. *Total confirmed EVD cases as of May 13, 2015; †Kilometer of roadway or waterway per 100 km2 of land area; ‡Proportion (%) of total land surface area; §Proportion (%) of households headed by males who have completed secondary education; ¶Proportion (%) of households that drink surface water source such as river, canal, dam, irrigation channel, lake, pond, and stream.Covariate Guinea Liberia Sierra Leone Total confirmed EVD cases* 3,144 3,339 9,394 Average rainfall accumulation (cm) 3.2 3.6 4.8 Average elevation (m) 3.2 172.1 166.4 Roadway density (km)† 10.0 11.2 16.8 Waterway density (km)† 11.2 9.0 15.7 Cropland (%)‡ 9.5% 5.1% 13.3% Female headed households (%) 17.3% 35.2% 28.0% Secondary education (%)§ 1.5% 10.5% 4.8% Households (%) without toilets 19.5% 45.2% 21.4% Households (%) with drinking water 10.2% 15.4% 18.2% Households (%) with electricity 26.2% 9.8% 13.5% Households (%) with radios 61.5% 58.9% 58.8%
Figure 1 displays the correlation between the covariates that were included in a final model, which was less that 0.4 for all.
Figure 1: The color and circle size vary with correlation, with increasing circle size and intensity in the color blue represent increasing correlation.
Table 3 presents the median posterior rate ratios for the covariates that were included in the final model. We report medians of the marginal posterior distributions for each parameter as our point estimates, as commonly done in Bayesian analyses. The spatial analysis indicates that districts were more at risk of EVD with increasing rainfall (RRRainfall2 2.18; 95% credible interval 0.66-7.20; RRRainfall3 5.34, 1.20-23.90), urban land cover (RRUrban2 4.87, 1.56-15.40; RRUrban3 5.74, 1.68-19.67), households not possessing a radio (RRRadio2 2.79, 0.90-8.78; RRRadio3 4.23, 1.16-15.93), and years of education (RREducation3 1.58, 0.40-6.25). Districts with low density were at higher risk than those with medium population density (RRPopulation2 0.64, 0.18-2.32) and similarly, districts with low or high roadway density (RRRoadway3 1.22, 0.35-4.26) were at higher risk those with medium roadway density (RRRoadway2 0.61, 0.19-1.96). *Kilometers of roadway per 100 km2 of land area; †Number of people per km2 of land area; ‡Proportion (%) of total land surface area. §Mean years of education of head of household.Covariate Rate rate (95% credible interval) Weekly rainfall accumulation (cm) <3.2 1.00 3.2-4.2 2.18 (0.66, 7.20) >4.2 5.34 (1.20, 23.90) Roadway density* <0.09 1.00 0.09-0.11 0.61 (0.19, 1.96) >0.11 1.22 (0.35, 4.26) Population density† <33.6 1.00 33.6-68.0 0.64 (0.18, 2.23) >68.0 0.98 (0.22, 4.35) Urban land cover (%)‡ <0.02 1.00 0.02-0.09 4.87 (1.56, 15.40) >0.09 5.74 (1.68, 19.67) Household not possessing radios (%) <38.1 1.00 38.1-47.6 2.79 (0.90, 8.78) >47.6 4.23 (1.16, 15.93) Years of education§ <2.1 1.00 2.1-3.2 0.98 (0.25, 3.79) >3.2 1.58 (0.40, 6.25)
In the full model, the spatial residuals are very small in magnitude and appear to be spatially random whereas the uncorrelated non-spatial residuals suggest that unmeasured non-clustered variation remains. In other words, the covariate set explains a meaningful amount of spatial variation but there are additional unmeasured or unmeasurable factors that explain the different rates of EVD per district.
In this study, we identified several environmental and demographic spatial predictors of EVD risk at the district level for Guinea, Liberia, and Sierra Leone, which require further study to determine causality. We found that lack of radio ownership was a strong predictor of EVD risk (RRRadio2 2.79, 0.90-8.78; RRRadio3 4.23, 1.16-15.93) at the district level. Radio campaigns in all three countries used serial dramas and popular music to disseminate risk communication, prevention, and social mobilization messages, which may have reduced EVD transmission risk.
The correlation between rainfall and EVD transmission risk is supported by previous work which found associations between increased rainfall or humidity and EVD outbreaks.
We included roadway and waterway densities as proxies for population mobility, which is thought to have been an important influence in the explosive nature of West African EVD epidemic.
Surprisingly, population density had no association with EVD risk with the exception of the second tercile having a weak protective effect (RRPopulation2 0.64, 0.18-2.32). This suggests that lower population densities were at increased risk for EVD transmission, which could be a consequence of clinical and public health service provision issues in remote areas. Conversely, our finding of increased EVD risk in more urban areas (RRUrban2 4.87, 1.56-15.40; RRUrban3 5.74, 1.68-19.67), while controlling for population density, may reflect the population mobility and the increased mixing between susceptible and infected individuals in urban areas.
Previous EVD outbreaks had much fewer cases and differences have been noted in historical outbreaks when EVD was introduced into the general population versus into a healthcare setting.
There are different ways in which measurement error could have influenced our findings. Remote sensing data was used in lieu of ground observations due to data availability and deriving measures of environmental characteristics from remotely sensed data requires assumptions about the values, which are also subject to measurement error.
Our work has shed new light on population-level spatial factors for EVD risk and future research should examine the etiologic relationships of these risk factors and EVD transmission. The potentially significant role of radio having reduced the EVD risk requires further study and is an important and modifiable risk factor for future outbreaks. Future research should incorporate higher spatial resolution (e.g., sub-prefectures, districts, chiefdoms) and a temporal dimension, as it would provide further understanding into aspects of population mobility and healthcare accessibility, which are critical features of epidemic propagation and control. In addition, these findings should be compared to other diseases that are transmitted from human-to-human in Guinea, Liberia, and Sierra Leone. This would provide further information into disease transmission patterns in Guinea, Liberia, and Sierra Leone and common risk factors among different diseases that could be used for integrated outbreak management.
The authors have declared that no competing interests exist.
The World Health Organization’s Ebola data is publicly available from the Ebola data and statistics page (http://apps.who.int/gho/data/node.ebola-sitrep.quick-downloads?lang=en). The Demographic and Health Survey (DHS) data is publicly available for registered users from the DHS Program (http://dhsprogram.com/data/available-datasets.cfm). Satellite images were provided by NASA for rainfall estimates (https://pmm.nasa.gov/data-access/downloads/trmm) and by USGS for temperature (https://lpdaac.usgs.gov/dataset_discovery/modis/modis_products_table/mod11a2) and land cover (https://lpdaac.usgs.gov/dataset_discovery/modis/modis_products_table/mcd12q1).
Kate Zinszer (kate.zinszer@umontreal.ca)