This is a repository linked to the blog posts written by Datamarinier and Huapii. The posts were written by data scientists from both Datamarinier and Huapii, the main code was provided from Huapii, and the infrastructure on Google Cloud Platform was designed by Datamarinier.
Here are some notes on what the files and folders contain, but for more information on how to use them, do take a look at our blog ;)
model
: The model from Hugging FaceDockerfile
: The docker container settings for deployment on Cloud Runcreate_embeddings.py
: The script to create embeddings from master_skills_list.txtdownload_huggingface_model.py
: The script to download the model in the model folderhelper_script.sh
: A helper script to accompany Dockerfilemain.py
: The main function when deploying to Cloud Runmaster_emb_list.pkl
: The embeddings created with create_embeddings.pymaster_skills_list.txt
: The skill list to recommend fromrecommend_without_cloudrun.py
: The main function written to run without Cloud Runrequirements.txt
: The list of Python modules that are needed to run the code