This repository contains a comprehensive collection of Jupyter notebooks covering fundamental to advanced statistical concepts, implementations, and analyses. It serves as both a learning resource and a reference for statistical methods in data science.
The notebooks in this repository provide hands-on implementations and explanations of statistical concepts, progressing from basic descriptive statistics to advanced topics like hypothesis testing and regression analysis. Each notebook is designed to be self-contained while building upon previous concepts.
-
Descriptive Statistics (
abc_01_decriptive_stats.ipynb
)- Measures of central tendency
- Variability measures
- Distribution analysis
-
Combinatorics (
abc_02_combinatorics.ipynb
)- Permutations
- Combinations
- Probability fundamentals
-
Probability Distributions (
abc_03_distributions.ipynb
)- Normal distribution
- Binomial distribution
- Other common distributions
-
Sampling Methods (
abc_04_sampling.ipynb
,sampling.ipynb
)- Random sampling techniques
- Sample size determination
- Central Limit Theorem applications
-
Hypothesis Testing (
abc_05_hypothesis_testing.ipynb
)- T-tests
- Chi-square tests
- ANOVA
- Multiple testing corrections (FWER, FDR)
-
Regression Analysis (
abc_06_regression.ipynb
)- Linear regression
- Multiple regression
- Gradient descent implementation
- Clone this repository:
git clone https://github.com/macozu/Statistics.git
cd Statistics
- Create and activate a virtual environment (recommended):
python -m venv venv
source venv/bin/activate # On Windows use: venv\Scripts\activate
- Install required packages:
pip install -r requirements.txt
- Launch Jupyter Notebook:
jupyter notebook
- Python 3.8+
- Basic understanding of Python programming
- Familiarity with Jupyter notebooks
Key libraries used in this project:
- NumPy for numerical computations
- Pandas for data manipulation
- SciPy for statistical functions
- Matplotlib and Seaborn for visualization
- Statsmodels for statistical models
For a complete list of dependencies, see requirements.txt
.
Feel free to open issues or submit pull requests if you find any bugs or have suggestions for improvements.
This project is licensed under the MIT License - see the LICENSE file for details.