Data Analyst Job Insights

Based on 15K+ real job ads

Rafael Belokurows

2023-06-01

Goal

  • Identifying trends in Data Analyst Job Ads

  • Are there common characteristics in jobs that offer higher salaries?

Tools used

  • Python to scrape, analyze and plot

    • spacy: text extraction

    • pandas: storing, processing, feature engineering

    • plotly: plots (duh)

    • pycaret: ML model building and evaluation

  • Quarto - using R - to present

Top Skills

  • SQL is the top skill in general
  • Python 2 for 1 compared with R
  • Tableau > Power BI
  • AWS the top cloud provider

Salary

Skills x Salary

Python means more $

Excel means less $

Salary x Skills (cont.)

Cloud and Engineer stuff👍🏻

Cliches 👎🏻

How prevalent are AI buzzwords?*

*In % of jobs they appear

Prediction 🔮

% of jobs with salary larger than a few thresholds — choosing my goal

ML model results

ML Results - Feature Importance

Skills are good for you (but not just any skills)

Verifying one prediction

My model says this job doesn’t pay so well

And my model was right 🥳

Takeaways

  • If training to be a Data Analyst, make sure you are proficient in at least:

    • SQL

    • Tableau or Power BI

    • Python (or R for some specific fields or skills)

    • One cloud platform, especially AWS

  • Be on lookout for positions with the following cliches on the job description. Those are probably workplaces that will challenge your mental health and you won’t even earn enough for it to be worth it:

    • “ability to work under pressure”

    • “multitasking”

    • “fast-paced environment”

  • European companies, please start disclosing salaries for your positions, thank you in advance!