Ansible Deployment

Relevant Source Files

ansible/README.md ansible/group_vars/all/vault.yml.example ansible/playbook.yml ansible/roles/templar/defaults/main.yml ansible/roles/templar/templates/miner.service.j2

This document describes how to deploy Templar using Ansible, focusing on automated provisioning of miner nodes across single or multiple GPUs. For Docker-based deployment, see Docker Deployment.

Overview

The Templar Ansible playbook automates the deployment of miner nodes by:

Cloning the repository
Setting up Python virtual environments with CUDA support
Installing required system packages
Configuring environment variables
Deploying miners as continuously running services (optionally with systemd)
Supporting multi-GPU configurations with separate instances per GPU

Deployment Architecture

flowchart TD
    subgraph "Control Machine"
        AP["ansible-playbook"]
        IV["inventory file"]
        VF["vault.yml (secrets)"]
        PB["playbook.yml"]
    end
    
    subgraph "Target Machine"
        subgraph GNode["GPU Node"]
            P0["Python Virtual Env"]
            M0["Miner Instance 0"]
            S0["Systemd Service 0"]
        end
        
        subgraph MNode["Multi-GPU Node"]
            P1["Python Virtual Env 1"]
            M1["Miner Instance 1"]
            S1["Systemd Service 1"]
            
            P2["Python Virtual Env 2"]
            M2["Miner Instance 2"]
            S2["Systemd Service 2"]
        end
    end
    
    AP --> |"SSH"| GNode
    AP --> |"SSH"| MNode
    IV --> AP
    VF --> AP
    PB --> AP
    
    P0 --> M0
    M0 --> S0
    
    P1 --> M1
    M1 --> S1
    
    P2 --> M2
    M2 --> S2

Sources: ansible/playbook.yml , ansible/README.md

Prerequisites

Before using the Ansible deployment, ensure you have:

On the control machine (where you run Ansible):
- Ansible installed
- A Unix-like environment (Linux/macOS) with SSH access to target hosts
- Python 3 and pip
On target hosts (where miners will run):
- Ubuntu (recommended: 22.04)
- CUDA support already installed
- SSH server configured and accessible
- Python installed (the playbook will install it if missing)
- At least one CUDA-enabled GPU

Sources: ansible/README.md:19-29

Configuration

Inventory File

The inventory file defines your target hosts and their GPU configurations:

[bittensor_subnet]
# Single GPU example
192.168.123.213 ansible_user=root ansible_port=12345 wallet_hotkeys='["miner"]' cuda_devices='["cuda"]'

# Multi-GPU example
192.168.222.111 ansible_user=root ansible_port=23456 wallet_hotkeys='["miner_1", "miner_2", "miner_3", "miner_4"]' cuda_devices='["cuda:0", "cuda:1", "cuda:2", "cuda:3"]'

Note: The wallet_hotkeys and cuda_devices arrays must have matching lengths to ensure proper pairing.

Sources: ansible/README.md:32-52

Environment Variables and Secrets

Sensitive configuration settings are managed via Ansible Vault. These include R2 storage credentials, wallet configuration, and API keys.

Creating a Vault File

Create directory structure:
Terminal window
```
mkdir -p group_vars/all/
```

Create an encrypted vault file:

ansible-vault create group_vars/all/vault.yml

Add your configuration in YAML format:

env_vars:
  WANDB_API_KEY: "your_wandb_key"
  INFLUXDB_TOKEN: "your_influxdb_token"
  R2_ACCOUNT_ID: "your_r2_account_id"
  R2_GRADIENTS_ACCOUNT_ID: "your_r2_gradients_account_id"
  # Other R2 credentials...
  WALLET_NAME: "default"
  NETWORK: "finney"
  NETUID: "3"

# Miner configuration
cuda_devices: ["cuda:0"]
wallet_hotkeys: ["miner_0"]

Sources: ansible/group_vars/all/vault.yml.example , ansible/README.md:53-87

Configuration Diagram

flowchart TD
    subgraph "Ansible Configuration"
        IV["inventory"]
        VF["vault.yml"]
        DF["defaults/main.yml"]
    end
    
    subgraph "Environment Variables"
        R2["R2 Credentials"]
        WB["WandB API Key"]
        IF["InfluxDB Token"]
        WC["Wallet Configuration"]
    end
    
    subgraph "Package Installation"
        AP["APT Packages"]
        PP["Pip Packages"]
        UP["UV Pip Packages"]
    end
    
    subgraph "Miner Configuration"
        BS["actual_batch_size"]
        NU["netuid"]
        SN["subtensor_network"]
        WN["wallet_name"]
        WH["wallet_hotkeys"]
        CD["cuda_devices"]
    end
    
    VF --> R2
    VF --> WB
    VF --> IF
    VF --> WC
    VF --> WH
    VF --> CD
    
    DF --> AP
    DF --> PP
    DF --> UP
    DF --> BS
    DF --> NU
    DF --> SN
    DF --> WN
    
    IV --> WH
    IV --> CD

Sources: ansible/roles/templar/defaults/main.yml , ansible/group_vars/all/vault.yml.example

Running the Deployment

Basic Usage

From the ansible directory, run:

ansible-playbook -i inventory playbook.yml --ask-vault-pass

The -i inventory option specifies your inventory file
The --ask-vault-pass flag prompts for your vault password (if using encrypted vault)

Sources: ansible/README.md:92-99

Overriding Default Variables

You can override default variables in several ways:

Via Command Line:

ansible-playbook -i inventory playbook.yml -e "actual_batch_size=5 wallet_name=default" --ask-vault-pass

In Group/Host Variables Files: Create files in host_vars/your_host.yml with your custom variables.
In Inventory: Set variables directly in your inventory file.

Sources: ansible/README.md:101-117

Multi-GPU Setup

The playbook automatically provisions separate instances for each GPU specified in your inventory:

Multi-GPU Deployment Process

flowchart TD
    subgraph "playbook.yml"
        PreCheck["Verify arrays match in length"]
        Loop["Loop through GPUs"]
        Role["Include templar role"]
    end
    
    subgraph "Per-GPU Instance 0"
        DIR0["Create directory\ntemplar-0"]
        ENV0["Configure .env"]
        SVC0["Create systemd service"]
    end
    
    subgraph "Per-GPU Instance 1"
        DIR1["Create directory\ntemplar-1"]
        ENV1["Configure .env"]
        SVC1["Create systemd service"]
    end
    
    PreCheck --> Loop
    Loop --> Role
    Role --> DIR0
    Role --> DIR1
    DIR0 --> ENV0
    DIR1 --> ENV1
    ENV0 --> SVC0
    ENV1 --> SVC1

For each GPU:

A separate clone of the repository is created in a unique directory (templar-0, templar-1, etc.)
Environment variables are configured with GPU-specific settings
A dedicated systemd service is created (when systemd is enabled)

The instance directory naming follows the pattern: templar-<GPU index> where the index is extracted from the cuda:X device name.

Sources: ansible/playbook.yml , ansible/README.md:133-153

Systemd Service Configuration

When use_systemd is set to true, the playbook will create systemd services for each miner instance. The service:

Runs the miner with the specified GPU and wallet
Automatically restarts on failure
Starts on system boot

The systemd service template parameters:

Parameter	Description
`templar_dir`	Working directory for the instance
`wallet_name`	Name of the wallet to use
`wallet_hotkey`	Hotkey identifier for the wallet
`device`	CUDA device to use (e.g., `cuda:0`)
`netuid`	Network UID for the subnet
`subtensor_network`	Subtensor network (e.g., `finney`)
`actual_batch_size`	Batch size for training

Sources: ansible/roles/templar/templates/miner.service.j2 , ansible/roles/templar/defaults/main.yml:35-36

Customization

Package Installation

You can customize package installation by modifying:

apt_packages: System packages to install via APT
additional_apt_packages: Additional system packages
essential_pip_packages: Global pip packages
additional_pip_packages: Additional global pip packages
additional_uv_pip_packages: Additional packages to install in the virtual environment with uv pip

Sources: ansible/roles/templar/defaults/main.yml:49-69 , ansible/README.md:120-124

Miner Parameters

Default miner parameters can be customized:

actual_batch_size: Batch size for training
netuid: Network UID for the subnet
subtensor_network: Subtensor network name
wallet_name: Name of the wallet to use
wallet_hotkeys: Array of wallet hotkeys to use
cuda_devices: Array of CUDA devices to use

Sources: ansible/roles/templar/defaults/main.yml:38-47 , ansible/README.md:126-127

Troubleshooting

Use the -vvv flag with ansible-playbook for verbose output when troubleshooting:
Terminal window
```
ansible-playbook -i inventory playbook.yml --ask-vault-pass -vvv
```
Ensure SSH keys and network connectivity are correctly configured
Check that the cuda_devices and wallet_hotkeys arrays have the same length
Verify that your vault file contains all required environment variables

Sources: ansible/README.md:168-171