Tasks:
- Look at the
text.txt
file and notice it is a video transcription. - Run
main.py
by typingpython main.py --help
. Notice there are two tools:extract
andhashtags
- Next convert the text into keywords using the
python main.py extract
. What do you see? - Now run
python main.py hashtags
. What do you see? - Next convert the command-line tool to have another flag that limits the keywords to a maximum of the top keywords (lowest score is better).
- Grab some text on the internet, say wikipedia or a blog post and create hashtags with it.
Tasks:
- Run
python etl.py
, notice there are three commands that work for ETL:delete
,etl
andquery
- The ETL command extracts the keywords from the file and puts them in a sqlite database. Run it
python etl.py etl
. - Now that the database has been created and data loaded into it. Query the database for the top results with
python etl.py query
- Change the command-line flag to return only five results
- Create your own version of this tool in GitHub and extend the database with different metadata using another NLP tool.