diff --git a/extraction_medical_terms_from_online_news_and_twitter.ipynb b/extraction_medical_terms_from_online_news_and_twitter.ipynb index beb17e8fba71e2e922b4ea0bb04f66e734c4c4ea..e44207effb5327a475890b7fccd2e105f4a89715 100644 --- a/extraction_medical_terms_from_online_news_and_twitter.ipynb +++ b/extraction_medical_terms_from_online_news_and_twitter.ipynb @@ -152,7 +152,8 @@ "Only tweet IDs can be shared from which we can retrieve all tweet contents and metadata. It's called **hydrating tweet**.\n", "To do so, you can use the command line tool [twarc](https://github.com/DocNow/twarc). You must set your credentials and then hydrate tweets : `twarc hydrate tweet-ids.txt > tweets.jsonl`\n", "\n", - "For this workshop, we are going to use a tweets corpus in Licence CC0 (Public Domain) from [kaggle platform](https://www.kaggle.com/gpreda/pfizer-vaccine-tweets)." + "For this workshop, we are going to use a tweets corpus in Licence CC0 (Public Domain) from [kaggle platform](https://www.kaggle.com/gpreda/pfizer-vaccine-tweets).\n", + "**If you have already a kaggle account, you can download the dataset from the link below or you can download from this link [filesender](https://filesender.renater.fr/?s=download&token=1706766d-676e-4823-a1b4-665067e5fc81#), password will be given during the workshop**. Please, now upload this file in data directory" ] }, {