From 36dc7a662d19323af692db62ac1f2e48b50da53d Mon Sep 17 00:00:00 2001
From: Decoupes Remy <remy.decoupes@inrae.fr>
Date: Wed, 17 Mar 2021 23:43:36 +0100
Subject: [PATCH] add procedure to download data

---
 extraction_medical_terms_from_online_news_and_twitter.ipynb | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/extraction_medical_terms_from_online_news_and_twitter.ipynb b/extraction_medical_terms_from_online_news_and_twitter.ipynb
index beb17e8..e44207e 100644
--- a/extraction_medical_terms_from_online_news_and_twitter.ipynb
+++ b/extraction_medical_terms_from_online_news_and_twitter.ipynb
@@ -152,7 +152,8 @@
     "Only tweet IDs can be shared from which we can retrieve all tweet contents and metadata. It's called **hydrating tweet**.\n",
     "To do so, you can use the command line tool [twarc](https://github.com/DocNow/twarc). You must set your credentials and then hydrate tweets : `twarc hydrate tweet-ids.txt > tweets.jsonl`\n",
     "\n",
-    "For this workshop, we are going to use a tweets corpus in Licence CC0 (Public Domain) from [kaggle platform](https://www.kaggle.com/gpreda/pfizer-vaccine-tweets)."
+    "For this workshop, we are going to use a tweets corpus in Licence CC0 (Public Domain) from [kaggle platform](https://www.kaggle.com/gpreda/pfizer-vaccine-tweets).\n",
+    "**If you have already a kaggle account, you can download the dataset from the link below or you can download from this link [filesender](https://filesender.renater.fr/?s=download&token=1706766d-676e-4823-a1b4-665067e5fc81#), password will be given during the workshop**. Please, now upload this file in data directory"
    ]
   },
   {
-- 
GitLab