Welcome to the Ultimate Data Science Interview Prep Repository!
This repository is your all-in-one resource for preparing and excelling in data science interviews. Whether you're brushing up on technical skills, mastering behavioral questions, or seeking interactive tools, I've got you covered!
What you'll find in this repository:
- Datasets
- In the
data
folder you will find a selection of datasets you can use.
- In the
- Interactive Jupyter Notebooks - use them to get hands-on practice with essencial data science methods.
- Check out the
data_science_training
folder to see the covered topics.
- Check out the
- Interview Tips and Cheatsheets
-
- Check out the
interview_tips_and_cheatsheets
folder to see the covered topics.
- Check out the
- You will find interview FAQ (HR, Behavioural and technical), as well as several cheatsheets.
-
- AI-Powered Chatbot for Technical Prep
- Use our chatbot as your interview mentor or to roleplay a mock interview.
- Clone repository
git clone https://github.com/LoboaTeresa/Data-Science-Dojo.git
- Install cuda toolkit. (example with conda)
conda install -c conda-forge cudatoolkit=11.2 cudnn=8.1.0
- Install the requirements. You need Python 3.8.
pip install -r requirements.txt
- Unzip the
data.zip
folder to access the datasets. - Explore the interactive notebooks inside the 'data_science_training' folder.
- Attempt the challenging coding assesments inside the 'assesments' folder. #TODO
- Roleplay an interview with our chatbot. Steps:
- 7.1. #TODO
- This repository is designed to help data scientists prepare for interviews.
- It’s not a resource for learning data science from scratch, so some basic knowledge is expected.
- It’s ideal for refreshing key concepts, reviewing libraries, and accessing a collection of assessments and interview questions (technical, behavioral, and HR).
This project is alive!🧟 This means it is a work in progress.
Future additions:
- Hierarchial clustering notebook
- Deep Learning notebooks
- NLP notebooks
- Computer Vision notebooks
- Langchain and LLMs notebooks
I would like to thank the active online communities on Glassdoor and Reddit, whose members generously share their interview experiences and insights. Your contributions have been invaluable in helping others navigate the challenges of data science interviews. Finally, thank all the authors in Medium and GeeksforGeeks blogs
Special thanks to Aaren Stubberfield and Leondra R. Gonzalez for their insightful book, Cracking the Data Science Interview, which has been a significant source of information. Their book, available on Amazon, is a must-read for anyone preparing for data science roles.
Your collective wisdom and resources have made this repository possible. Thank you!