Skip to content

RickArko/DengueFever

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DengueFever

Forecast total cases of Dengue Fever in the cities of San Juan and Iquitos.


Checklist:

  1. EDA\Visualizations
  2. Analysis of Covariate Relation to Target
  3. Documentation, Observations, Statistical Tests
  4. Forecast Model

Exploratory Data Analysis

Results saved in src/eda

    python eda.py

Jupyter

    python -m ipykernel install --user --name=venv-azureml-37
    jupyter notebook

Baseline model (Simple AutoRegression)

Results saved in src/ar_models

    python ar_model.py

Modelling Thoughts

  1. Baseline (something simple to improve on - lagged inputs only)
  2. Complex (clearly exogeneous variables matter here, this is tricky though get to if time permits)

Features/Processing (initial thoughts):

1. Normalize (cases per 100k rather than total cases)
2. Missing Data
    - mean
    - same as last
    - mode for categoric
    - MICE
3. Standardize Timestamp Frequency
    - Not a big deal but make every period start on same day
    - Aggregate covariates appropriately
3. Features
    - AutoRegressive (Lags)
        - Quickly test a few simple AR Models (lagged observations clearly matter)
    - Exogeneous
        - Clearly lagged variables don't contain all the information (non-cyclical plot)
        - This is a mosquito born virus and we're provided lots of NOAA data... 
            - Rainfaill/Humidity and Temperature likely important
                - Linear Correlation is low (include plots)
            - Perhaps order matters?
                - (i.e. mild winter followed by humid wet/hot season)
                - El Nino? (perhaps more foreseeable compared to humidity/temp)

        - Including Exogeneous predictors is not easy in a time-series context
            - forecasts will be required for each co-variate, so it's important to establish a baseline and demonstrate meaningfull improvement

Installation/Configuration

Need some version of Python3.9 in $PythonPath then use pipenv to install dependencies:

    $PythonPath\python.exe -m pip install pipenv
    set PIPENV_VENV_IN_PROJECT="enabled"
    pipenv install
    pipenv shell
    cd src

About

Forecast Dengue

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages