Skip to content

Exploratory data analysis of the 2023 developer survey conducted by stack overflow, using python.

Notifications You must be signed in to change notification settings

rahulkay122/StackOverflow-EDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exploratory Data Analysis on Stack Overflow Developer Survey 2023

Project Overview

This project involves conducting an exploratory data analysis (EDA) on the Stack Overflow Developer Survey 2023 dataset. The goal is to clean, transform, and visualize the data and answer a few questions to uncover trends and insights within the developer community. The analysis was performed using Python libraries including Pandas, Matplotlib, and Seaborn.

Data Source

The Stack Overflow Developer Survey 2023 dataset is used for this analysis. It includes attributes related to developer demographics, technology preferences, job satisfaction, and more. The dataset can be found here: https://survey.stackoverflow.co/

Tools and Libraries

  • Python
  • Pandas
  • Matplotlib
  • Seaborn

Analysis Steps

  1. Data Cleaning:

    • Handled missing values
    • Standardized data formats
    • Removed duplicates
  2. Data Transformation:

    • Restructured data for better analysis
    • Created new calculated fields where necessary
  3. Data Wrangling:

    • Filtered and selected relevant columns for analysis
  4. Data Visualization:

    • Utilized Matplotlib and Seaborn to create various plots (e.g., histograms, scatter plots, bar charts)
    • Visualized trends and patterns in the data
    • A pairplot between the numerical columns was made to identify the correlation between them. Pairplot

Research Questions

  • Qualifications: Does a master's degree give you any significant advantage?
  • Coding Experience: How much does coding experience affect compensation?
  • Average Salary: Who is the highest-paid employee in India?
  • Working preferences: How much does remote working matter to employees?

Future Scope of the project

The survey was conducted globally, but the EDA was done only with the respondents from India to answer more subjective and relevant questions.

Further EDA can be done for each specific country, or the entire dataset disregarding the country to answer a larger array of questions

Conclusion

The exploratory data analysis provided valuable insights into the developer community within India, highlighting key demographic trends, technology usage, and the correlation among these. These findings can inform stakeholders and guide future decisions in the tech industry.

About

Exploratory data analysis of the 2023 developer survey conducted by stack overflow, using python.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published