feat(nlp-scraper): add link to datasets provided

4 months ago · 375bb5c1fb
3 changed files with 2 additions and 1 deletions
--- a/subjects/ai/nlp-scraper/README.md
+++ b/subjects/ai/nlp-scraper/README.md
@ -50,7 +50,8 @@ https://towardsdatascience.com/named-entity-recognition-with-nltk-and-spacy-8c4a
 ### **2. Topic detection:**

 The goal is to detect what the article is dealing with: Tech, Sport, Business,
-Entertainment or Politics. To do so, a labelled dataset is provided. From this
+Entertainment or Politics. To do so, a labelled dataset is provided: [training
+data](bbc_news_train.csv) and [test data](bbc_news_test.csv). From this
 dataset, build a classifier that learns to detect the right topic in the
 article. The trained model should be stored as `topic_classifier.pkl`. Make
 sure the model can be used easily (with the preprocessing pipeline built for
--- a/subjects/ai/nlp-scraper/bbc_news_tests.csv
+++ b/subjects/ai/nlp-scraper/bbc_news_tests.csv
--- a/subjects/ai/nlp-scraper/bbc_news_train.csv
+++ b/subjects/ai/nlp-scraper/bbc_news_train.csv