Skip to content

πŸ‘€πŸ¦  Evaluating the predictive power of Wikipedia pageviews for mpox cases πŸ¦ πŸ‘€

Notifications You must be signed in to change notification settings

smkerr/mpox-wiki-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

From Clicks to Cases: Leveraging Wikipedia Pageviews to Predict Mpox Cases in the United States

Abstract

As the world becomes increasingly interconnected and climate change elevates the risk of zoonotic spillover events, the public becomes ever more susceptible to global-scale outbreaks. Traditional disease surveillance methods are prone to under-reporting and time lags. By contrast, Wikipedia pageviews offer a real-time and cost-effective open source resource for tracking online health-related information-seeking behavior with the potential for enhancing global disease surveillance. This paper investigates the value of anonymized country-level Wikipedia pageviews data for predicting case incidence during the 2022-2024 mpox outbreak in the United States. The methods employed in this study involve a combination of quan- titative techniques aimed at increasing understanding of the relationship between online behaviors and disease dynamics. A lag analysis correlating mpox cases and pageviews for mpox-related Wikipedia articles at different time lags was conducted to assess the variation in directionality between pageviews and cases across mpox-related articles. This was followed by a multivariate linear regression analysis aimed at predicting mpox incidence based on pageview data. Finally, impulse response and Granger-causality tests were performed to further analyze the directionality of the relationship between online activity and mpox cases. The study’s findings underscore the potential of Wikipedia traffic as a predictive tool for public health trends, revealing a bidirectional relationship between pageviews and mpox cases that unfolds over time. The predictive models struggled with accuracy, highlighting the need for further model refinement to adequately account for the complexity of online attention and disease dynamics.

➑️ Read the paper

➑️ View the poster

Repository structure

.
β”œβ”€β”€ README.md
β”œβ”€β”€ 1-proposal
β”‚   β”œβ”€β”€ data-report
β”‚   └── pre-analysis-plan
β”œβ”€β”€ 2-literature
β”œβ”€β”€ 3-data
β”‚   β”œβ”€β”€ mpox-cases
β”‚   β”œβ”€β”€ mpox-news
β”‚   β”œβ”€β”€ mpox-studies
β”‚   β”œβ”€β”€ wikipedia
β”‚   └── output
β”œβ”€β”€ 4-code
β”œβ”€β”€ 5-tables
β”œβ”€β”€ 6-figures
β”œβ”€β”€ 7-paper
└── 8-poster

About

πŸ‘€πŸ¦  Evaluating the predictive power of Wikipedia pageviews for mpox cases πŸ¦ πŸ‘€

Topics

Resources

Stars

Watchers

Forks