Findings from StackOverflow's developers survey of 2019
Pandas 1.0.1, numpy 1.17.4, scikit-learn 0.20.0 The code is developed using python3 (3.7.0), and the above libraries. It should mostly run on a different version of the above libraries too.
For this project, I was interested in exploring a few trends among developers, in the IT industry. The data is from StackOverflow's Developer Survey of 2019, which you can find here.
- On an average, developers of what age groups earn the most in the industry?
- What is the trend in average salaries when compared across organisations of different sizes?
- Do Startups consist of significant percentage of senior developers or is it the youngsters that dominate? Which country contributes to most developers working in start ups.
- How the the distribution among freelancers, when age group and country of residence is taken into account.
- Do majority of developers wish to work remotely? What factors do you think majorly influence this decision?
The is one notebook file Developer Data Analysis.ipynb
. If you wish to execute this, please provide the path for input file in cell 2.
The main findings of this study can be found here. The Developer Data Analysis.ipynb
file has cells explaining answers/trends for each of the questions above.
Credits to Stack Overflow for conducting a survey and releasing the data publicly. You can find the Licensing for the data and other descriptive information here.