diff --git a/images/datascience.svg b/images/datascience.svg index 7618cd4..2ebb734 100644 --- a/images/datascience.svg +++ b/images/datascience.svg @@ -1,3 +1,3 @@ -

Data Scientist

Data Scientist
Matrices & Linear Algebra Fundamentals
Matrices & Linear Algebra Fu...
Database Basics
Database Basics
Relational vs. non-relational databases
Relational vs. non-relational databases
SQL + Joins (Inner, Outer, Cross, Theta Join)
SQL + Joins (Inner, Outer, Cross, Thet...
NoSQL
NoSQL
Tabular Data
Tabular Data
Data Frames & Series%3CmxGraphModel%3E%3Croot%3E%3CmxCell%20id%3D%220%22%2F%3E%3CmxCell%20id%3D%221%22%20parent%3D%220%22%2F%3E%3CmxCell%20id%3D%222%22%20value%3D%22Tabular%20Data%22%20style%3D%22rounded%3D1%3BwhiteSpace%3Dwrap%3Bhtml%3D1%3B%22%20vertex%3D%221%22%20parent%3D%221%22%3E%3CmxGeometry%20x%3D%22170%22%20y%3D%22350%22%20width%3D%22170%22%20height%3D%2230%22%20as%3D%22geometry%22%2F%3E%3C%2FmxCell%3E%3C%2Froot%3E%3C%2FmxGraphModel%3E
Data Frames & Series%3CmxGra...
Extract, Transform, Load (ETL)
Extract, Transform, Load (ET...
Reporting vs BI vs Analytics
Reporting vs BI vs Analytics
Data Formats
Data Formats
JSON
JSON
XML
XML
Regular Expressions (RegEx)
Regular Expressions (RegEx)
Probability Theory
Probability Theory
Probability distribution
Probability distribution
Randomness, random variable and...Conditional probability and...
(Statistical) independence
(Statistical) independence
iid
iid
cdf, pdf, pmf
cdf, pdf, pmf
Continuous distributions (pdf's)
Continuous distributions (pd...
Cumulative distribution function (cdf)
Cumulative distribution function (cd...
Probability density function (pdf)
Probability density function (pdf)
Probability mass function (pmf)
Probability mass function (pmf)
Normal / Gaussian
Normal / Gaussian
Uniform (continuous)
Uniform (continuous)
Beta
Beta
Dirichlet
Dirichlet
Exponential
Exponential
Uniform (discrete)
Uniform (discrete)
Discrete distributions (pmf's)
Discrete distributions (pmf'...
 χ2 (chi-squared)
 χ2 (chi-squared)
Binomial
Binomial
Multinomial
Multinomial
Hypergeometric
Hypergeometric
Poisson
Poisson
Expectation and mean
Important Laws
Important Laws
Summary statistics
Summary statistics
Estimation
Estimation
Hypothesis Testing
Hypothesis Testing
Confidence Interval (CI)%3CmxGraphModel%3E%3Croot%3E%3CmxCell%20id%3D%220%22%2F%3E%3CmxCell%20id%3D%221%22%20parent%3D%220%22%2F%3E%3CUserObject%20label%3D%22Important%20Laws%22%20id%3D%222%22%3E%3CmxCell%20style%3D%22rounded%3D1%3BwhiteSpace%3Dwrap%3Bhtml%3D1%3B%22%20vertex%3D%221%22%20parent%3D%221%22%3E%3CmxGeometry%20x%3D%22360%22%20y%3D%22740%22%20width%3D%22170%22%20height%3D%2230%22%20as%3D%22geometry%22%2F%3E%3C%2FmxCell%3E%3C%2FUserObject%3E%3C%2Froot%3E%3C%2FmxGraphModel%3E
Confidence Interval (CI)%3Cm...
Monte Carlo Method
Monte Carlo Method
Geometric
Geometric
Variance and standard deviation (...Covariance and correlationMedian, quartile
Interquartile range
Interquartile range
Percentile / quantile
Mode
Mode
Law of large numbers (LLN)
Law of large numbers (LLN)
Central limit theorem (CLT)
Central limit theorem (CL...
Maximum Likelihood Estimation (MLE)
Maximum Likelihood Estimation (ML...
Kernel Density Estimation (KDE)
Kernel Density Estimation (KDE)
p-Value
p-Value
Chi2 test
Chi2 test
F-test
F-test
t-test
t-test
Python Basics
Python Basics
Important libraries
Important libraries
Virtual Environments
Virtual Environments
Expressions
Expressions
Variables
Variables
Data Structures
Data Structures
Functions
Functions
Install packages (via pip, conda or similar)
Install packages (via pip, conda or si...
Codestyle, e.g. PEP8
Codestyle, e.g. PEP8
Numpy
Numpy
Pandas
Pandas
Ecosystem
Ecosystem
Manipulate Data Frames
Manipulate Data Frames
Subsetting Data
Subsetting Data
Reading CSV and raw data
Reading CSV and raw data
Fundamentals
Fundamentals
Statistics
Statistics
Python   Programming
Python   Programming
Chart Suggestions thought starter
Chart Suggestions thought st...
Exploratory Data Analysis /
Data Munging / - Wrangling
Exploratory Data Analysis /...
Python
Python
Matplotlib
Matplotlib
plotnine (like ggplot in R)
plotnine (like ggplot in R)
Vega-Lite
Vega-Lite
D3.js
D3.js
Tableau
Tableau
Dash
Dash
Dimensionality & Numerosity...
Visualization
Visualization
Normalization
Normalization
Data Scrubbing,
Handling Missing Values
Data Scrubbing,...
Unbiased Estimators
Unbiased Estimators
Binning sparse values
Binning sparse values
Feature Extraction
Feature Extraction
Denoising
Denoising
Sampling
Sampling
Principal Component Analysis (PCA)
Principal Component Analysis...

Machine Learning

Machine Learning

Data Engineer

Data Engineer
CSV
CSV
Awesome Public Datasets
Awesome Public Datasets
Kaggle
Kaggle
Jupyter Notebooks / Lab
Jupyter Notebooks / Lab
Web
Web
Dashboards
Dashboards
BI
BI
PowerBI
PowerBI
seaborn
seaborn
ipyvolume (3D data)
ipyvolume (3D data)
streamlit
streamlit
Data Sources
Data Sources
Some boxes link to additional ressources
Some boxes link to additional ress...
Interactive version on
i.am.ai/roadmap

Interactive version on...
Viewer does not support full SVG 1.1
\ No newline at end of file +

Data Scientist

Data Scientist
Matrices & Linear Algebra Fundamentals
Matrices & Linear Algebra Fu...
Database Basics
Database Basics
Relational vs. non-relational databases
Relational vs. non-relational databases
SQL + Joins (Inner, Outer, Cross, Theta Join)
SQL + Joins (Inner, Outer, Cross, Thet...
NoSQL
NoSQL
Tabular Data
Tabular Data
Data Frames & Series%3CmxGraphModel%3E%3Croot%3E%3CmxCell%20id%3D%220%22%2F%3E%3CmxCell%20id%3D%221%22%20parent%3D%220%22%2F%3E%3CmxCell%20id%3D%222%22%20value%3D%22Tabular%20Data%22%20style%3D%22rounded%3D1%3BwhiteSpace%3Dwrap%3Bhtml%3D1%3B%22%20vertex%3D%221%22%20parent%3D%221%22%3E%3CmxGeometry%20x%3D%22170%22%20y%3D%22350%22%20width%3D%22170%22%20height%3D%2230%22%20as%3D%22geometry%22%2F%3E%3C%2FmxCell%3E%3C%2Froot%3E%3C%2FmxGraphModel%3E
Data Frames & Series%3CmxGra...
Extract, Transform, Load (ETL)
Extract, Transform, Load (ET...
Reporting vs BI vs Analytics
Reporting vs BI vs Analytics
Data Formats
Data Formats
JSON
JSON
XML
XML
Regular Expressions (RegEx)
Regular Expressions (RegEx)
Probability Theory
Probability Theory
Probability distribution
Probability distribution
Randomness, random variable and...Conditional probability and...
(Statistical) independence
(Statistical) independence
iid
iid
cdf, pdf, pmf
cdf, pdf, pmf
Continuous distributions (pdf's)
Continuous distributions (pd...
Cumulative distribution function (cdf)
Cumulative distribution function (cd...
Probability density function (pdf)
Probability density function (pdf)
Probability mass function (pmf)
Probability mass function (pmf)
Normal / Gaussian
Normal / Gaussian
Uniform (continuous)
Uniform (continuous)
Beta
Beta
Dirichlet
Dirichlet
Exponential
Exponential
Uniform (discrete)
Uniform (discrete)
Discrete distributions (pmf's)
Discrete distributions (pmf'...
 χ2 (chi-squared)
 χ2 (chi-squared)
Binomial
Binomial
Multinomial
Multinomial
Hypergeometric
Hypergeometric
Poisson
Poisson
Expectation and mean
Important Laws
Important Laws
Summary statistics
Summary statistics
Estimation
Estimation
Hypothesis Testing
Hypothesis Testing
Confidence Interval (CI)%3CmxGraphModel%3E%3Croot%3E%3CmxCell%20id%3D%220%22%2F%3E%3CmxCell%20id%3D%221%22%20parent%3D%220%22%2F%3E%3CUserObject%20label%3D%22Important%20Laws%22%20id%3D%222%22%3E%3CmxCell%20style%3D%22rounded%3D1%3BwhiteSpace%3Dwrap%3Bhtml%3D1%3B%22%20vertex%3D%221%22%20parent%3D%221%22%3E%3CmxGeometry%20x%3D%22360%22%20y%3D%22740%22%20width%3D%22170%22%20height%3D%2230%22%20as%3D%22geometry%22%2F%3E%3C%2FmxCell%3E%3C%2FUserObject%3E%3C%2Froot%3E%3C%2FmxGraphModel%3E
Confidence Interval (CI)%3Cm...
Monte Carlo Method
Monte Carlo Method
Geometric
Geometric
Variance and standard deviation (...Covariance and correlationMedian, quartile
Interquartile range
Interquartile range
Percentile / quantile
Mode
Mode
Law of large numbers (LLN)
Law of large numbers (LLN)
Central limit theorem (CLT)
Central limit theorem (CL...
Maximum Likelihood Estimation (MLE)
Maximum Likelihood Estimation (ML...
Kernel Density Estimation (KDE)
Kernel Density Estimation (KDE)
p-Value
p-Value
Chi2 test
Chi2 test
F-test
F-test
t-test
t-test
Python Basics
Python Basics
Important libraries
Important libraries
Virtual Environments
Virtual Environments
Expressions
Expressions
Variables
Variables
Data Structures
Data Structures
Functions
Functions
Install packages (via pip, conda or similar)
Install packages (via pip, conda or si...
Codestyle, e.g. PEP8
Codestyle, e.g. PEP8
Numpy
Numpy
Pandas
Pandas
Ecosystem
Ecosystem
Manipulate Data Frames
Manipulate Data Frames
Subsetting Data
Subsetting Data
Reading CSV and raw data
Reading CSV and raw data
Fundamentals
Fundamentals
Statistics
Statistics
Python   Programming
Python   Programming
Chart Suggestions thought starter
Chart Suggestions thought st...
Exploratory Data Analysis /
Data Munging / - Wrangling
Exploratory Data Analysis /...
Python
Python
Matplotlib
Matplotlib
plotnine (like ggplot in R)
plotnine (like ggplot in R)
Vega-Lite
Vega-Lite
D3.js
D3.js
Tableau
Tableau
Dash
Dash
Dimensionality & Numerosity...
Visualization
Visualization
Normalization
Normalization
Data Scrubbing,
Handling Missing Values
Data Scrubbing,...
Unbiased Estimators
Unbiased Estimators
Binning sparse values
Binning sparse values
Feature Extraction
Feature Extraction
Denoising
Denoising
Sampling
Sampling
Principal Component Analysis (PCA)
Principal Component Analysis...

Machine Learning

Machine Learning

Data Engineer

Data Engineer
CSV
CSV
Awesome Public Datasets
Awesome Public Datasets
Kaggle
Kaggle
Jupyter Notebooks / Lab
Jupyter Notebooks / Lab
Web
Web
Dashboards
Dashboards
BI
BI
PowerBI
PowerBI
seaborn
seaborn
ipyvolume (3D data)
ipyvolume (3D data)
streamlit
streamlit
Data Sources
Data Sources
Some boxes link to additional ressources
Some boxes link to additional ress...
Interactive version on
i.am.ai/roadmap

Interactive version on...
Viewer does not support full SVG 1.1
\ No newline at end of file diff --git a/images/datascience.xml b/images/datascience.xml index 988ba77..f407dfa 100644 --- a/images/datascience.xml +++ b/images/datascience.xml @@ -1,7 +1,7 @@ - + - + @@ -441,12 +441,12 @@ - + - +