mirror of https://github.com/01-edu/Branch-AI.git
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
804 B
804 B
NLP-enriched News Intelligence platform
This is the correction of the project 1: NLP-enriched News Intelligence platform.
Deliverables
project
│ README.md
│ environment.yml
│
└───data
│ │ topic_classification_data.csv
│
└───results
│ │ topic_classifier.pkl
│ │ learning_curves.png
|
|───nlp_engine
│
Validation
The project is validated if:
Scrapper
- There are at least 300 news articles stored in the file system or the database.
- Run the scrapper and fetch 3 documents. The scrapper is not expected to fetch n documents, you can stop it manually.
NLP engine
- Run the nlp_engine on the 3 documents you fetched.
Topic classfier
- The accuracy on the test set is bigger then 95%.