Skip to content

aregm/icl

Repository files navigation

Infrastructure Control Language

Tool to easily run your data science, machine learning or deep learning experiments anywhere: locally, on-prem clusters, or hybrid clouds dynamically optimized for GPUs. Based on Infrastructure as Code, but pushed to the next level. Provides cosmic level of infrastructure control bringing old good days back when you run program on the machine you develop.

Still in engineering Alpha mode. Use with caution. But let us know what you think!

Quick start

The simplest way to start with ICL is to create a local ICL cluster in a Docker container:

git clone https://github.com/intel-ai/icl.git
cd icl
./scripts/deploy/kind.sh

The cluster's endpoints are accessible only from localhost:

In your browser, navigate to http://jupyter.localtest.me.

Install Python package infractl package with pip:

pip install infractl

ICL allows running a local Python program of Prefect flow in a cluster.

Create a Python file my_program.py:

if __name__ == '__main__':
    print('Hello from my_program')

Then execute the program in your cluster:

import infractl

await infractl.run(infractl.program('my_program.py'))

Create a Python file my_flow.py with Prefect flow definition:

from prefect import flow

@flow
def my_flow():
    print('Hello from my_flow')

Then execute the flow in your cluster:

import infractl

await infractl.run(infractl.program('my_flow.py'))

Note that the program and flow can be developed, tested, and executed locally. ICL allows executing the same program or flow in your local or remote ICL cluster.

Next steps

In addition to a local ICL cluster, there are scripts to create multi node ICL clusters in AWS, GCP. We also support deploying ICL into an existing Kubernetes or OpenShift clusters, provisioning bare-metal machines. Check our documentation at https://aregm.github.io/icl/.

ICL ecosystem

ICL can be a backbone of your company CI and ML infrastructure. Here is a component diagram which shows how it works for us.

flowchart LR
    subgraph Bare-metal
        bm["`ubuntu
        rocky
        raid
        datasets
        runners
        caches`"]
    end

    subgraph Clouds
        c["`aws
            gke
            idc
            onecloud
            dbaas
        `"]
    end

    subgraph VMs
        vm["`kvm
        qemu
        docker`"]
    end

    subgraph GitHub
        gh["`repos
        actions
        runners
        caches`"]
    end

    Bare-metal --> icl
    Clouds --> icl
    VMs --> icl
    GitHub --> icl

    subgraph icl["Infrastructure Control Language"]
        subgraph Kubernetes
            user["`arc
                grafana
                jupyterhub
                prefect
                pytorch
                ray`"]
            system["`
                prometheus
                kubernetes dashboard
                docker registry
                cert manager
                ingress
                rook-ceph
                shared volumes`"]
        end
        subgraph Deploy
            deploy["`maas
                terraform
                vagrant
                ansible
                multipass
                kubespray`"]
        end
        Deploy --> Kubernetes
    end

    subgraph Pipelines
        pb["product build"] -->

        check["`license check
            style check
            sanitizers
            llvm-tidy
            integration test`"] -->

        test["`functional tests
            cpu/gpu/sim tests
            issue and skiplist updates`"] -->

        perf["performance tests"] --> dashboard


        ib["infra build"] --> ctest["container tests"] --> iclu["icl definition updates"]
    end
    icl --> Pipelines


    subgraph Containers
         cnt["`self-hosted runners
         arc runners
         jupyter profiles
         dev dockers
         perf measurement dockers
         reproducers`"]
    end
    icl --> Containers

    subgraph Processes
        p["`sla
        access governance
        scans
        signing
        compliance checks
        retention
        backups`"]
    end
    icl --> Processes
Loading

About

Infrastructure Control Language

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published