Data in Brief (Jun 2023)
A Twitter dataset for Monkeypox, May 2022
Abstract
After struggling with COVID-19 pandemic for two years, the world is finally recovering from this crisis. Nonetheless, another virus, Monkeypox, is quickly spreading throughout the world and in non-endemic regions and continents, threatening the world to a new pandemic. Twitter as a popular social media has successfully been used for predicting and controlling outbreaks. Much research previously has been done for building early warning systems, trend prediction, and misinformation and fake news detection. Since tweets are not accessible to all researchers, in this work, a publicly available dataset containing 2400202 tweets gathered from May first to December twenty-fifth, 2022 is presented. Twitter developers academic researcher API which returns all the tweets matching a given query was used to gather the dataset. To this end, the full archive search and keywords related to Monkeypox and its equivalents in other languages, i.e. Monkeypox or “monkey pox” or “viruela dei mono” or “variole du singe” or “variola do macoco” were used. The retweets were excluded using the negation operator, and the tweet ids and user ids were extracted and shared with public. Approximately, 1.79 percent (43047 number) of tweets were geotagged. To visualize the geotagged tweets, the longitude and latitude of the bounding box coordinates were averaged. This work will help researchers shed light on the news, patterns, and on-going discussions of Monkeypox on social media, identify hotspots, and help contain the Monkeypox virus.