Docker Deployment

Relevant Source Files

.github/workflows/docker.yml docker/Dockerfile docker/compose.yml docker/docker-compose-test.yml scripts/entrypoint.sh

This document details how to deploy Templar nodes (miners and validators) using Docker containers. Docker provides a consistent, isolated runtime environment for Templar, simplifying deployment across different machines while ensuring all dependencies are properly configured.

For information about deploying with Ansible, see Ansible Deployment. For setting up a local Subtensor chain for development, see Local Chain Setup.

Prerequisites

Before deploying Templar using Docker, ensure you have:

Docker and Docker Compose installed on your system
NVIDIA drivers installed (for GPU support)
NVIDIA Container Toolkit installed and configured
A Bittensor wallet created and funded
Cloudflare R2 storage credentials configured
Weights & Biases API key (for logging and monitoring)

Docker Architecture Overview

Templar’s Docker deployment architecture consists of several components working together:

flowchart TD
    subgraph "Host Machine"
        GPU["NVIDIA GPU"]
        FS["File System"]
        DockD["Docker Daemon"]
    end
    
    subgraph "Docker Container"
        TE["Templar Entrypoint"]
        NNT["Node Type\n(Miner/Validator)"]
        PT["PyTorch + CUDA"]
        BT["Bittensor Client"]
    end
    
    subgraph "External Services"
        R2["Cloudflare R2 Storage"]
        BNET["Bittensor Network"]
        WB["Weights & Biases"]
    end
    
    GPU <--> DockD
    FS <--> DockD
    DockD <--> TE
    
    TE --> NNT
    NNT <--> PT
    NNT <--> BT
    
    BT <--> BNET
    NNT <--> R2
    NNT --> WB

Sources: docker/Dockerfile , scripts/entrypoint.sh

Docker Image

Templar uses a custom Docker image based on NVIDIA’s CUDA runtime environment:

flowchart TD
    subgraph "Templar Docker Image"
        BASE["nvidia/cuda:12.6.0-runtime-ubuntu22.04"]
        PY["Python 3 + Dependencies"]
        UV["uv Package Manager"]
        TCODE["Templar Codebase"]
        ENTRYPT["Entrypoint Script"]
    end
    
    BASE --> PY
    PY --> UV
    UV --> TCODE
    TCODE --> ENTRYPT
    
    VOL1["Volume: /root/.bittensor/wallets"]
    VOL2["Volume: /app/logs"]

The Docker image is automatically built and published to GitHub Container Registry via GitHub Actions when releases are made or manually triggered.

Sources: docker/Dockerfile , .github/workflows/docker.yml

Deployment Options

Standard Deployment with Docker Compose

The standard deployment uses compose.yml to run a single Templar node (miner or validator) with the following components:

The Templar node container
Watchtower for automatic updates

# Clone the repository and navigate to the docker directory
git clone https://github.com/tplr-ai/templar.git
cd templar/docker

# Create an .env file with your configuration
# Start the container
docker-compose up -d

Test Environment Deployment

For testing or development, the docker-compose-test.yml file provides a multi-node setup with:

Two miners (on different GPUs)
One validator (on a separate GPU)
A shared Docker network

# Navigate to the docker directory
cd templar/docker

# Start the test environment
docker-compose -f docker-compose-test.yml up -d

Sources: docker/compose.yml , docker/docker-compose-test.yml

Configuration

Environment Variables

The Templar Docker containers are configured via environment variables, which can be set in a .env file or passed directly to Docker Compose.

Variable	Description	Default Value	Required
`NODE_TYPE`	Node type (miner or validator)	miner	Yes
`WALLET_NAME`	Bittensor wallet name		Yes
`WALLET_HOTKEY`	Bittensor wallet hotkey		Yes
`CUDA_DEVICE`	CUDA device to use	cuda:0	Yes
`NETWORK`	Bittensor network	finney	Yes
`DEBUG`	Enable debug mode	false	No
`NETUID`	Network UID	268	Yes
`WANDB_API_KEY`	Weights & Biases API key		Yes
`R2_*`	Cloudflare R2 configuration variables		Yes
`GITHUB_USER`	GitHub username for Watchtower		For auto-updates
`GITHUB_TOKEN`	GitHub token for Watchtower		For auto-updates

GPU Configuration

NVIDIA GPU access is configured in the Docker Compose file:

deploy:
  resources:
    reservations:
      devices:
        - driver: nvidia
          device_ids: [ '0', '1', '2' ]
          capabilities: [ gpu ]

This configuration allows the container to access GPUs 0, 1, and 2. Modify the device_ids array to select specific GPUs.

Sources: docker/compose.yml:36-42 , docker/docker-compose-test.yml:31-37

Container Lifecycle

Container Startup Process

The container startup process is managed by the entrypoint script:

flowchart TD
    START["Container Start"] --> CHECK["Check Required Environment Variables"]
    CHECK --> CUDA["Check CUDA Availability"]
    CUDA --> WANDB["Login to Weights & Biases"]
    WANDB --> TYPE{"NODE_TYPE?"}
    
    TYPE -- "miner" --> MINER["Start Miner Process"]
    TYPE -- "validator" --> VAL["Start Validator Process"]
    TYPE -- "invalid" --> ERR["Error: Invalid NODE_TYPE"]
    
    ERR --> EXIT["Container Exit"]

The entrypoint script validates the environment, checks dependencies, and starts the appropriate node type.

Sources: scripts/entrypoint.sh

Persistence and Volumes

Two Docker volumes are used for persistence:

/root/.bittensor/wallets - Stores Bittensor wallet information
/app/logs - Stores application logs

These volumes persist data even when the container is restarted or updated.

Automatic Updates with Watchtower

The standard Docker Compose configuration includes a Watchtower container that automatically checks for updated Templar Docker images and deploys them:

flowchart LR
    subgraph "Docker Environment"
        W["Watchtower Container"]
        T["Templar Container"]
    end
    
    subgraph "GitHub Container Registry"
        I["Templar Docker Images"]
    end
    
    W -- "1. Check for updates" --> I
    W -- "2. Pull new image" --> I
    W -- "3. Stop container" --> T
    W -- "4. Start new container" --> T

Watchtower checks for updates every 30 minutes and automatically updates containers with the com.centurylinklabs.watchtower.enable=true label.

Sources: docker/compose.yml:46-57

Troubleshooting

Common Issues

CUDA not available - Ensure NVIDIA drivers and NVIDIA Container Toolkit are properly installed.
Missing environment variables - Check your .env file or Docker Compose environment configuration.
Wallet access issues - Verify your wallet volume is correctly mounted and permissions are set.
R2 connectivity problems - Verify your R2 credentials and check the container logs for connection errors.

Viewing Logs

# View logs for Templar node
docker logs templar-miner-YOUR_HOTKEY

# Follow logs in real-time
docker logs -f templar-miner-YOUR_HOTKEY

Multi-Node Setup

For running multiple nodes on the same machine:

Create a copy of the compose.yml file for each node
Give each container a unique name
Assign different GPUs to each container
Use different wallet hotkeys for each node

flowchart TD
    subgraph "Host Machine"
        GPU0["GPU 0"]
        GPU1["GPU 1"]
        GPU2["GPU 2"]
    end
    
    subgraph "Docker Containers"
        M1["Miner 1\nWallet: key1\nGPU: 0"]
        M2["Miner 2\nWallet: key2\nGPU: 1"]
        V1["Validator\nWallet: key3\nGPU: 2"]
    end
    
    GPU0 --- M1
    GPU1 --- M2
    GPU2 --- V1
    
    subgraph "Shared Resources"
        R2["R2 Storage"]
        BT["Bittensor Network"]
    end
    
    M1 --- R2
    M2 --- R2
    V1 --- R2
    
    M1 --- BT
    M2 --- BT
    V1 --- BT

The test Docker Compose file demonstrates this configuration pattern.

Sources: docker/docker-compose-test.yml