Open access data repository for institutional/news media tweet dataset in the time of COVID-19 pandemic
Detail information pre-print avaliable at: https://arxiv.org/abs/2004.01791
As Twitter has provided a new academic API, which gives access to full historical data, this dataset will be no longer updated since Feb 20, 2021.
Thank you very much for all your interests in this small project.
News media and government/international organization tweets across different countries (eg. US, UK, China, Spain, France, Germany etc) Feel free to share this repo.
Data collected using twitter REST API.
First data collection at March 12, 2020 (updated on my PC every week). This means the first time I collect the most recent 3200 tweets (official limits) of all the target accounts, then update weekly.
##V1.46 Last update: from Feb 11 to Feb 17
##V1.45 update data from Feb 04 to Feb 10
##V1.44 update data from Jan 28 to Feb 03
@GuiseppeConteIT
(has resigned) and@socialstyrelsen
tweeted 0 message.- I will no longer update
eu_leadership
from the next week
##V1.43 update data from Jan 21 to Jan 27
##V1.42 update data from Jan 14 to Jan 20
@socialstyrelsen
tweeted 0 message
##V1.41 update data from Jan 7 to Jan 13
election_us
has been removed from my tracking list
##V1.40 update data from Dec 31 to Jan 6 (2021)
@socialstyrelsen
tweeted 0 message
##V1.39 update data from Dec 24 to Dec 30
@Itamaraty_EN
tweeted 0 message
##V1.38 update data from Dec 17 to Dec 23
Merry Xmas
##V1.37 update data from Dec 10 to Dec 16
- Due to their low tweeting frequency,
@BrazilGovNews
and@French_Gov
have been removed from my tracking list.
##V1.36 update data from Dec 3 to Dec 9
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.35 update data from Nov 26 to Dec 2
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.34 update data from Nov 19 to Nov 25
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.33 update data from Nov 12 to Nov 18
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.32 update data from Nov 5 to Nov 11
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.31 update data from Oct 29 to Nov 4
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.30 update data from Oct 22 to Oct 28
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.29 update data from Oct 15 to Oct 21
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.28 update data from Oct 8 to Oct 14
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.27 update data from Oct 1 to Oct 7
@BrazilGovNews
,@socialstyrelsen
and@French_Gov
tweet 0 message
##V1.26 update data from Sep 24 to Sep 30
@BrazilGovNews
,@Itamaraty_EN
and@French_Gov
tweet 0 message
##V1.25 update data from Sep 17 to Sep 23
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.24 update data from Sep 10 to Sep 16
@BrazilGovNews
,@Itamaraty_EN
@SwedishPM
and@French_Gov
tweet 0 message
##V1.23 update data from Sep 3 to Sep 9
@BrazilGovNews
,@socialstyrelsen
and@French_Gov
tweet 0 message@foreignoffice
will be removed from the next update
##V1.22 update data from Aug 27 to Sep 2
@BrazilGovNews
,@Itamaraty_EN
,@SwedishPM
,@French_Gov
and@foreignoffice
tweet 0 message- It seems like the twitter account
@foreignoffice
has met some problem, and the tweets are not publicly available any more.
##V1.21 update data from Aug 20 to Aug 26
@BrazilGovNews
,@socialstyrelsen
and@French_Gov
tweet 0 message
##V1.20 update data from Aug 13 to Aug 19
@BrazilGovNews
,@Itamaraty_EN
.@socialstyrelsen
and@French_Gov
tweet 0 message
##Extra update 1
example_doc_classifier.R
is the example (election_us) script I used to subset all the collected data
##V1.19 update data from Aug 6 to Aug 12
@BrazilGovNews
,@socialstyrelsen
and@French_Gov
tweet 0 message
##V1.18 update data from Jul 30 to Aug 5
@BrazilGovNews
and@French_Gov
tweet 0 message
##V1.17 update data from Jul 23 to Jul 29
@BrazilGovNews
,@French_Gov
,@SwedishPM
tweet 0 message
##V1.16: update data from Jul 16 to Jul 22
@BrazilGovNews
,@Itamaraty_EN
,@French_Gov
,@socialstyrelsen
tweet 0 message
##V1.15: update data from Jul 9 to Jul 15
@BrazilGovNews
and@French_Gov
tweeted 0 message
##V1,14: update data from Jul 2 to Jul 8
@BrazilGovNews
,@Itamaraty_EN
,@French_Gov
,@socialstyrelsen
,@SwedishPM
tweeted 0 message
##V1.13: update data from Jun 25 to Jul 1
- New added: Two Italian news media:
@LaStampa
and@Corriere
@BrazilGovNews
and@French_Gov
tweeted 0 message
##V1.12: update data from Jun 18 to Jun 24
- New added:
SE_tweet_id
Swedish gov, PM and news media tweets - Attention: During 0618-0624
@BrazilGovNews
tweeted 0 message - Attention: During 0618-0624
@French_Gov
tweeted 0 message
##V1.11: update data from Jun 11 to Jun 17
- New added:
TR_tweet_id
Turkish gov, president and news media tweets - Attention: During 0611-0617
@BrazilGovNews
tweeted 0 message - Attention: During 0611-0617
@Itamaraty_EN
tweeted 0 message - Attention: During 0611-0617
@French_Gov
tweeted 0 message
##V1.10: update data from Jun 4 to Jun 10
- Attention: During 0604-0610
@BrazilGovNews
tweeted 0 message - Attention: During 0604-0610
@Itamaraty_EN
tweeted 0 message - Attention: During 0604-0610
@French_Gov
tweeted 0 message
##V1.09: update data from May 28 to Jun 3
- Attention: During 0528-0603
@BrazilGovNews
tweeted 0 message
##V1.08: update data from May 21 to May 27
- Attention: During 0521-0527
@BrazilGovNews
tweeted 0 message
##V1.07: update data from May 14 to May 20
- Attention: During 0514-0520
@BrazilGovNews
tweeted 0 message
##V1.06: update data from May 7 to May 13.
- Attention: During 0507-0513
@BrazilGovNews
tweeted 0 message - Attention: During 0507-0513
@French_Gov
tweeted 0 message
##V1.05: update data from April 30 to May 6.
- Attention: During 0430-0506
@BrazilGovNews
tweet 0 message
##V1.04: update data from April 23 to April 29.
- Attention: During 0423-0429
@BrazilGovNews
tweeted 0 message - Attention: During 0423-0429
@French_Gov
tweeted 0 message
##V1.03: update data from April 16 to April 22.
- New added:
BR_tweets
Brazilian government, president, news media - Attention: During 0416-0422
@French_Gov
tweeted 0 message - Attention: During 0416-0422
@BorisJohnson
tweeted 0 message
##V1.02: update data from April 9 to April 15.
- New added:
EU_leadership
(@BorisJohnson
,@EmmanuelMacron
,@GiuseppeconteIT
,@sanchezcastejon
) - New added:
election_us
(@BernieSanders
,@JoeBiden
,@realDonaldTrump
,@POTUS
) - New added:
national_gov_foreign_office
(you can see this as a huge update to the previous gov file, which include 14 European/US/Chinese government/foreign office accounts) - Minor changes:
@globaltimesnews
moved fromADDITIONAL_news_tweet_id
toCHINA_news_tweet_id
. - Minor changes:
@spiegelonline
stop tweeting at 20200108, it was removed from my collection query, tweet_id were saved on V1.0.
##V1.01: update data from April 2 to April 8.
##First online: April 2, 2020
Data crawled by twitter account user name (same as txt file name), some of the accounts may lost maintaince for long time (for example @SanidadPublicaEs, stop tweeting at 2014, but activate this account again when COVID-19 became global crisis).
I did NOT remove the historical data before coronavirus outbreak. Any questions please contact with me (see email below).
Two recommendations: by Hydrator https://github.com/DocNow/hydrator
or twarc https://github.com/DocNow/twarc
Please follow the instructions
- 吉田光男. (2020). COVID-19 流行下におけるソーシャルメディア—日本での状況と研究動向・公開データセット—. 人工知能, 35(5), 644-653.
- Liang, S., Wong, D. F., & Zhang, Y. (2020, October). 新型冠状病毒肺炎相关的推特主题与情感研究 (Exploring COVID-19-related Twitter Topic Dynamics across Countries). In Proceedings of the 19th Chinese National Conference on Computational Linguistics (pp. 707-718).
- Shuja, J., Alanazi, E., Alasmary, W., & Alashaikh, A. (2020). Covid-19 open source data sets: A comprehensive survey. medRxiv.
- Yu, J., Lu, Y., & Muñoz-Justicia, J. (2020). Analyzing Spanish News Frames on Twitter during COVID-19—A Network Study of El País and El Mundo. International Journal of Environmental Research and Public Health, 17(15), 5414.
Jingyuan Yu
narcisoyu[at]gmail[dot]com
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.