Welcome to the AI Project Template repository! This template is designed to streamline the development and deployment of AI/ML/MLOps/LLMOps projects. It provides a comprehensive setup for experiment tracking, model versioning, monitoring, tracing, and efficient project management, ensuring that all critical aspects of an AI project are covered.
- GitHub Actions: CI/CD workflows for testing code, deploying models, running experiments, and more.
- Config Directory: Centralized configuration files for the project.
- Data Directory: Structured data management (raw, processed, and database data).
- IaC Directory: Infrastructure as Code (IaC) scripts for cloud deployment.
- Notebooks Directory: Jupyter notebooks for data analysis and experiments.
- Results Directory: Store experiment results, logs, and models.
- Source Code Directory: Modularized source code for models, pipelines, utilities.
- Tests Directory: Unit and integration tests using
pytest
. - pre-commit: Configuration for linting, type checking, and secret detection.
- Docker Compose: Setup for services like MLflow, Langfuse, databases, and storage.
- uv: An extremely fast Python package and project manager
- Pyproject.toml: Config for project, ruff linter, formatter, type checking, and testing.
Note
Not every element from this template is required for every project. Please adjust the repository structure and configuration files to fit your project's needs.
-
/config
: Store all configuration files needed for your project.- Example:
config.yaml
can be used for setting up hyperparameters, API keys, or other environment variables.
- Example:
-
/data
: Structured data folders for different stages of the project.raw/
: Raw data straight from the source.processed/
: Cleaned and preprocessed data ready for use in models.database/
: Database files if using local storage solutions.
-
/iac
: Infrastructure as Code scripts for automating cloud deployments (e.g., AWS CloudFormation, Terraform). -
/notebooks
: Jupyter notebooks for performing Exploratory Data Analysis (EDA), experimenting with models, and reporting.- Example:
00_example.ipynb
shows how a notebook looks in the project.
- Example:
-
/results
: Store experiment results, logs, and models. -
/src
: Main source code for your project./constants/
: Store project-wide constants (e.g., file paths, API endpoints)./models/
: Machine learning models (e.g., neural networks, decisi`on trees) scripts, classes, and functions./pipelines/
: Data and model pipelines./utils/
: Utility functions and helpers (e.g., data loaders, preprocessing functions).
-
/tests
: Unit and integration tests.
-
.env
: Store environment variables like API keys, database credentials, and sensitive information. -
.gitignore
: Standard.gitignore
file to exclude unnecessary files (e.g., environment files, data files). -
.pre-commit-config.yaml
: Configuration for pre-commit hooks for linting, formatting, and type checking. -
docker-compose.yaml
: A Docker Compose configuration to spin up services such as MLflow, Langfuse, databases, and any additional tools. -
pyproject.toml
: Configuration of the project, plus the configuration for formatting and linting. This file is used by uv to manage the project and ruff for linting and formatting. -
uv.lock
: Lock file for uv package manager automatically generated byuv
when installing packages.
git clone https://github.com/username/ai-project-template.git
cd ai-project-template
- For macOS and Linux:
curl -LsSf https://astral.sh/uv/install.sh | sh
- For Windows:
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
uv install python
uv sync --all-extras --dev
docker-compose up -d
.
├── config # Configuration files for your project
│ └── config.yaml # Example configuration file
├── data # Folder to store raw and processed data
│ ├── database # Databases or local data storage
│ ├── processed # Preprocessed/cleaned data
│ └── raw # Raw data inputs
├── iac # Infrastructure as Code (IaC) scripts for cloud deployment
├── notebooks # Jupyter notebooks for exploratory data analysis, experiments
│ └── 00_example.ipynb # Example notebook
├── results # Folder to store the results of experiments and models
├── src # Source code of your project
│ ├── constants # Constants used in the project
│ ├── models # Machine learning model scripts
│ ├── pipelines # ML pipelines for preprocessing and modeling
│ ├── utils # Utility functions
│ └── execution.py # Main execution script
├── tests # Unit and integration tests
│ └── test_example.py # Example test file using pytest
├── .env # Environment variables file
├── .gitignore # Standard .gitignore file
├── .pre-commit-config.yaml # Configuration for pre-commit hooks
├── docker-compose.yaml # Docker Compose setup for MLflow, Langfuse, and related services
├── pyproject.toml # Configuration for formatting, linting, type-checking, and testing
├── README.md # Documentation for the project (you're reading it!)
└── uv.lock # Lock file for uv package manager
Feel free to contribute to this project by submitting a pull request.