Migration Guide: PyTorch to MIND

Side-by-side examples showing how PyTorch patterns map to MIND. Every comparison highlights what MIND catches at compile time that PyTorch only finds at runtime. Examples follow the MIND v1.0 specification surface; anything marked roadmap is not yet in the shipped compiler.

Tensor Creation

PyTorch

import torch

x = torch.tensor([[1.0, 2.0], [3.0, 4.0]])
w = torch.randn(2, 3)
b = torch.zeros(3)

MIND

// Shape is part of the type — verified at compile time
let x: Tensor<f32, 2, 2> = tensor([[1.0, 2.0], [3.0, 4.0]])
param w: Tensor<f32, 2, 3>
param b: Tensor<f32, 3>

In MIND, tensor shapes are compile-time types. A shape mismatch is a compile error, not a runtime crash.

Shape Safety

PyTorch

# PyTorch: crashes at RUNTIME
x = torch.randn(32, 784)
w = torch.randn(256, 784)  # Wrong shape!
y = x @ w  # RuntimeError: mat1 and mat2 shapes
           # cannot be multiplied (32x784 and 256x784)

MIND

// MIND: caught at COMPILE TIME
let x: Tensor<f32, ?, 784> = input()
param w: Tensor<f32, 256, 784>       // Wrong shape!
let y = matmul(x, w)
// COMPILE ERROR: matmul inner dimensions
// do not match: 784 != 256
// hint: did you mean Tensor<f32, 784, 256>?

Static shape mismatches are caught at compile time, before the model ever runs.

Autodiff

PyTorch

# Runtime autograd tape
x = torch.randn(32, 784, requires_grad=True)
y = model(x)
loss = criterion(y, target)
loss.backward()  # Builds + walks tape at runtime
optimizer.step()

MIND

// Compile-time gradient generation
@grad
fn train_step(x: Tensor<f32, ?, 784>,
              target: Tensor<f32, ?, 10>,
              lr: f32) -> Tensor<f32, 1> {
    let pred = forward(x)
    let loss = cross_entropy(pred, target)
    for param in parameters {
        param = sub(param, mul_scalar(grad(loss, param), lr))
    }
    loss
}

MIND's @grad compiles gradients at build time. The optimizer sees the full graph and fuses ops.

Linear Layer

PyTorch

import torch.nn as nn

class MLP(nn.Module):
    def __init__(self):
        super().__init__()
        self.fc1 = nn.Linear(784, 256)
        self.fc2 = nn.Linear(256, 10)

    def forward(self, x):
        x = torch.relu(self.fc1(x))
        return self.fc2(x)

MIND

module mlp

param W1: Tensor<f32, 784, 256>
param b1: Tensor<f32, 256>
param W2: Tensor<f32, 256, 10>
param b2: Tensor<f32, 10>

fn forward(x: Tensor<f32, ?, 784>)
    -> Tensor<f32, ?, 10> {
    let h = relu(add(matmul(x, W1),
                     broadcast(b1, [?, 256])))
    add(matmul(h, W2), broadcast(b2, [?, 10]))
}

Explicit parameter tensors instead of opaque Module objects. Every shape is visible and verified.

Model Export

PyTorch

# Multiple export paths, each with quirks
torch.onnx.export(model, dummy_input, "model.onnx")
traced = torch.jit.trace(model, dummy_input)
traced.save("model.pt")
# TensorRT, CoreML need separate conversion

MIND

# Single source, single semantics
mindc build model.mind                 # CPU (shipped)
mindc build model.mind --target cuda   # GPU (roadmap:
                                       #  commercial runtime)
mindc build model.mind --target metal  # Metal (roadmap)
mindc build model.mind --export onnx   # ONNX (roadmap)

The open-source compiler targets CPU today. GPU, Metal, and ONNX targets are on the roadmap — GPU execution ships in the commercial mind-runtime.

What maps, what doesn't

Feature	Maps?	Notes
Tensor operations (matmul, conv2d, etc.)	Yes	Core ops covered
Shape-checked tensors	Yes	Compile-time (vs runtime)
Autograd / autodiff	Yes	Compile-time @grad
Custom CUDA kernels	No	Roadmap (commercial mind-runtime)
Distributed training (DDP)	No	Roadmap
Model serving	No	Roadmap
Dynamic computation graphs	No	MIND is static-graph
Eager execution / REPL	No	Compiled, not interpreted
Python ecosystem (pandas, sklearn)	No	MIND has its own stdlib
Pre-trained model zoo (HuggingFace)	No	Requires reimplementation

Estimated migration effort

Model Complexity	Effort	MIND LOC
Simple model (linear, MLP)	1-2 hours	50-100 lines
Medium model (CNN, RNN)	2-4 hours	100-300 lines
Complex model (Transformer)	4-8 hours	300-600 lines
Full training pipeline	1-2 days	500-1000 lines

Need help migrating?

Our pilot program includes hands-on migration support. We'll help you port your first model and verify compliance artifacts.

Start a pilot