Predicting Influenza in Latin America: Using Voting Ensembles to Combine Google Search Activity and Geo-spatial Synchronicities from Historical Flu Activity
Clemente Lopez, César Leonardo
MetadataShow full item record
Novel influenza surveillance systems in the U.S. that combine historical influenza reports from hospitals, along with external influenza-related sources (such as web-based databases and reports from neighboring locations, among others) have shown to generate flu estimates in advance to the official health reports published by healthcare-based systems. However, in Latin America, several systems that harnessed web-based data sources in the past, such as Google Flu Trends, have shown that novel influenza surveillance are yet to deliver acceptable flu estimates. In this work, we aim to show that improved influenza estimates can be achieved for various countries in Latin America, and even in more refined geographic-scales in Mexico, by implementing methodological changes on the way that information sources such as Google Search activity and historical influenza activity reports are combined. A methodological framework which dynamically combines different influenza tracking techniques via a voting ensemble system is presented and used to generate improved influenza activity estimates in 15 countries in Latin America, 5 regions and 27 states in Mexico. Our results show that the voting ensemble outperforms the 3 different techniques implemented to harness the predictive power of local and external data sources, reaching lower error and higher correlation scores. This methodology may prove helpful to local public health officials who organize health interventions aimed at mitigating influenza outbreaks, and its adaptive power may also prove useful to extend its scope by also tracking other diseases such as Dengue and Zika.
The following license files are associated with this item: