MiniTensor API Reference¶

This document consolidates the public MiniTensor surface area available through minitensor and its submodules, using the Python bindings and the Rust engine as the source of truth. It is intentionally exhaustive and meant to complement existing guides such as custom_operations.md, plugin_system.md, and performance.md.

1) Top-level module (`minitensor`)¶

Core exports¶

MiniTensor’s top-level module re-exports the Rust-backed core API and a handful of convenience aliases.

Export	Description
`Tensor` / `tensor`	Core tensor type (constructor + alias).
`Device` / `device`	Device handle type (CPU/GPU).
`cpu`, `cuda`	Convenience constructors for CPU/GPU devices.
`functional`	Functional API module (stateless ops).
`nn`	Neural network modules and losses.
`optim`	Optimizers.
`numpy_compat`	NumPy-style helpers (if built).
`plugins`	Plugin registry and utilities (if built).
`serialization`	Model serialization utilities (if built).
`minitensor.tensor`	Compatibility module containing tensor constructors and dtype helpers.

Versioning¶

__version__ reflects the backend version exposed by the Rust core (if available) or a default fallback version.
__version_tuple__ mirrors the structured version tuple.

Global configuration & graph controls¶

Function	Purpose
`get_default_dtype()`	Return the global default dtype string.
`set_default_dtype(dtype)`	Set the global default dtype.
`default_dtype(dtype)`	Context manager for temporary dtype overrides.
`manual_seed(seed)`	Seed the RNG used by random ops.
`get_gradient(tensor)`	Access a tensor’s gradient in the global graph.
`clear_autograd_graph()`	Clear the global autograd graph.
`is_autograd_graph_consumed()`	Inspect whether a graph has been consumed.
`mark_autograd_graph_consumed()`	Mark the current graph as consumed.
`available_submodules()`	Return availability of optional submodules.
`list_public_api()`	Return public API symbol lists by module.
`api_summary()`	Return version and API counts by module.
`search_api(query, module=None)`	Search available symbols by name.
`describe_api(symbol)`	Return a one-line description for a symbol.
`help()`	Render a formatted MiniTensor API reference.
`broadcast_shapes(*shapes)`	Compute the NumPy/PyTorch-style broadcast result for shape-like inputs without constructing tensors.
`can_broadcast(*shapes)`	Return whether shape-like inputs are broadcast-compatible.

Shape compatibility helpers¶

broadcast_shapes(*shapes) computes the shape that would result from NumPy/PyTorch-style broadcasting without creating input tensors. Each argument may be a non-negative integer-like scalar dimension (including objects with __index__, such as NumPy integer scalars) or an iterable shape such as a Python tuple/list or tensor.shape. Scalar tensor shapes are represented by an empty iterable, for example broadcast_shapes((), (2, 3)) == (2, 3).

Validation and edge cases:

Boolean dimensions are rejected even though Python bool is integer-like.
Negative dimensions raise ValueError; non-integer dimensions raise TypeError.
Zero-sized dimensions follow NumPy broadcasting rules: they can broadcast with missing dimensions or 1, but not with another non-one positive size.
Incompatible shapes raise ValueError. Use can_broadcast(*shapes) when a boolean compatibility check is preferable to exception handling.

Example:

import minitensor as mt

shape = mt.broadcast_shapes((5, 1, 4), (1, 3, 1), (3, 4))
assert shape == (5, 3, 4)
assert mt.broadcast_shapes(mt.zeros(2, 1, 4).shape, (3, 4)) == (2, 3, 4)
assert mt.can_broadcast((1, 3), (2, 3))
assert not mt.can_broadcast((2, 3), (4, 3))

Compatibility tensor module¶

minitensor.tensor is a lightweight compatibility module populated by the Python package. It exposes Tensor, the top-level tensor creation helpers, get_default_dtype(), set_default_dtype(), manual_seed(), and the default_dtype(...) context manager. Prefer top-level imports in new examples, but keep this module in mind when maintaining older code that imports from minitensor.tensor.

Custom operations (Python API)¶

The custom-ops system is exposed at the top level:

execute_custom_op_py(name, inputs)
is_custom_op_registered_py(name)
list_custom_ops_py()
register_example_custom_ops()
unregister_custom_op_py(name)

2) Tensor creation API¶

Every creation helper is available as either mt.<name>(...) or Tensor.<name>(...).

Random + distribution-based¶

rand, rand_like
randn, randn_like
truncated_normal, truncated_normal_like
uniform, uniform_like
randint, randint_like
randperm

Initialization schemes¶

xavier_uniform, xavier_uniform_like
xavier_normal, xavier_normal_like
he_uniform, he_uniform_like
he_normal, he_normal_like
lecun_uniform, lecun_uniform_like
lecun_normal, lecun_normal_like

Deterministic / structured¶

zeros, zeros_like
ones, ones_like
empty, empty_like
full, full_like
eye
arange
linspace
logspace

NumPy interop¶

from_numpy(array)
from_numpy_shared(array)
as_tensor(obj, dtype=None, requires_grad=None, copy=False)

3) Tensor properties & conversion helpers¶

Frequently used tensor attributes:

tensor.shape / tensor.ndim
tensor.dtype
tensor.device
tensor.requires_grad

Conversion helpers:

tensor.numpy() → NumPy array
tensor.item() → Python scalar (for 0-d tensors)
tensor.tolist() → Python list
tensor.astype(dtype) → dtype conversion

4) Tensor instance methods¶

The following instance methods are exercised by the test suite and are available on Tensor objects (many also have functional/top-level equivalents):

Shape and layout¶

reshape, view, transpose, permute
movedim, moveaxis, swapaxes, swapdims
squeeze, unsqueeze, expand
flatten, ravel

Indexing & reordering¶

index_select, gather, narrow
flip, roll

Linear algebra & matrix ops¶

dot, bmm
solve
diagonal, trace
triu, tril

Reductions, statistics, and equality¶

sum, mean, median, quantile, nanquantile
std(dim=None, unbiased=True, keepdim=False)
var(dim=None, unbiased=True, keepdim=False)
nansum, nanmean, nanmax, nanmin
logsumexp
array_equal(other)
allclose(other, rtol=1e-5, atol=1e-8, equal_nan=False)

std and var accept the same dimension forms as multi-axis reductions such as sum and mean: None reduces all axes, an integer reduces one axis, and a sequence such as a tuple/list reduces multiple axes. Negative axes are normalized, duplicate axes are treated as a single axis, and invalid axes raise IndexError. keepdim=True preserves reduced axes with length one; otherwise those axes are removed after the reduction. unbiased=True applies the sample variance correction (N / (N - 1)) over the total number of reduced elements, and reductions with one or fewer samples return NaN rather than emitting a Python warning.

Example:

import minitensor as mt

x = mt.arange(24, dtype="float32").reshape(2, 3, 4)
channel_var = x.var(dim=(1, 2), unbiased=False, keepdim=True)
row_std = x.std(dim=-1, unbiased=False)

assert channel_var.shape == (2, 1, 1)
assert row_std.shape == (2, 3)

Elementwise math & activation¶

softmax, log_softmax
softsign, rsqrt, reciprocal, sign
clip, clamp, clamp_min, clamp_max
round, floor, ceil
sin, cos, tan
asin, acos, atan
sinh, cosh, asinh, acosh, atanh
softplus, gelu, elu, selu, silu
hardshrink

Normalization¶

layer_norm(shape, weight=None, bias=None, eps=1e-5)

Autograd + in-place¶

backward() to trigger gradient computation.
fill_(value) for in-place fills.

5) Functional API (`minitensor.functional`)¶

MiniTensor provides stateless functional variants that mirror Tensor methods.

Forwarders exported at top level¶

Each of the following names is accessible from:

minitensor.<name>
minitensor.functional.<name>

cat, stack, split, chunk, index_select, gather, narrow, topk, sort, argsort,
median, quantile, nanquantile, nansum, nanmean, nanmax, nanmin, nan_to_num,
logsumexp, softmax, log_softmax, masked_softmax, masked_log_softmax, softsign, rsqrt,
reciprocal, sign, reshape, view, triu, tril, diagonal, trace, solve, flatten,
ravel, transpose, permute, movedim, moveaxis, swapaxes, swapdims, squeeze,
unsqueeze, expand, repeat, repeat_interleave, flip, roll, clip, clamp,
clamp_min, clamp_max, round, floor, ceil, sin, cos, tan, asin, acos, atan,
sinh, cosh, asinh, acosh, atanh, array_equal, allclose, where, one_hot, masked_fill

Equality helpers¶

array_equal(input, other) and allclose(input, other, rtol=1e-5, atol=1e-8, equal_nan=False) are available as both top-level helpers and functional helpers. Both accept MiniTensor tensors and tensor-like Python inputs (such as Python scalars/sequences and NumPy arrays) through the normal Python-to-tensor conversion path.

Behavior and validation:

array_equal requires equal shapes, promotes compatible numeric dtypes, and returns a Python bool indicating exact element equality after promotion.
allclose promotes compatible numeric dtypes and applies abs(a - b) <= atol + rtol * abs(b) for finite unequal floating-point values.
Exact equality is accepted before tolerance checks, so signed zeros and matching infinities compare as close. Opposite infinities and finite/non-finite mismatches compare as not close.
NaNs compare as not close unless equal_nan=True, in which case paired NaNs at the same positions are accepted.
rtol and atol must be finite, non-negative numbers.

Example:

import minitensor as mt

assert mt.array_equal([1, 2], mt.tensor([1.0, 2.0], dtype="float32"))
assert mt.allclose([0.0, float("inf")], [-0.0, float("inf")])
assert mt.allclose([float("nan")], [float("nan")], equal_nan=True)

One-hot encoding¶

one_hot(input, num_classes=None, dtype="float32") converts integer or boolean labels to a one-hot tensor whose final dimension is the class dimension. The helper is available as both minitensor.one_hot(...) and minitensor.functional.one_hot(...).

Supported label inputs:

Tensor values with int32, int64, or bool dtype.
Python integer scalars and nested Python integer/bool sequences.
NumPy integer/bool arrays through the existing Python-to-tensor conversion path.

Behavior and validation:

If num_classes is omitted, MiniTensor infers it as max(label) + 1; empty inputs therefore require an explicit num_classes.
num_classes must be non-negative when provided, and every label must be in [0, num_classes).
Negative labels and floating-point label tensors/scalars are rejected.
dtype controls the encoded output dtype and accepts the standard MiniTensor dtype strings: float32, float64, int32, int64, and bool.

Example:

import minitensor as mt

labels = mt.Tensor([[0, 2], [1, 2]], dtype="int64")
encoded = mt.one_hot(labels, dtype="int32")
assert encoded.shape_vec() == [2, 2, 3]

Tensor-centric math helpers¶

The functional namespace also exposes:

dot
bmm

Cross-pollination with `nn`¶

Lower-case callable symbols from minitensor.nn are mirrored into minitensor.functional for convenience (for example, activation functions that have a functional signature).

6) Neural network module (`minitensor.nn`)¶

Layers & containers¶

Module (base class)
DenseLayer
Conv2d
BatchNorm1d
BatchNorm2d
Dropout, Dropout2d
Sequential (container of modules)

Activations¶

ReLU
LeakyReLU
Sigmoid
Tanh
GELU
ELU
Softmax

Losses¶

MSELoss
MAELoss
HuberLoss
LogCoshLoss
SmoothL1Loss
CrossEntropyLoss
BCELoss
FocalLoss

Common utilities¶

layer.parameters() returns tensors for optimizers.
layer.zero_grad() clears gradients for trainable tensors.

7) Optimizers (`minitensor.optim`)¶

Built-in optimizers¶

SGD
Adam
AdamW
RMSprop

Base optimizer API¶

All optimizer classes share a common interface:

step() — apply parameter updates and clear the global autograd graph.
zero_grad(set_to_none: bool = False) — reset gradients.
lr property — read/write learning rate.

8) NumPy compatibility module (`minitensor.numpy_compat`)¶

Array creation¶

asarray(data, dtype=None, requires_grad=False)
zeros_like, ones_like, empty_like, full_like

Array manipulation¶

concatenate, stack, vstack, hstack
split, hsplit, vsplit

Math & comparisons¶

dot, matmul, cross, where
allclose(a, b, rtol=None, atol=None, equal_nan=False), array_equal(a, b)

Statistics¶

mean, nanmean, std, var, prod, sum, nansum
max, min, nanmax, nanmin

numpy_compat.std(tensor, axis=None, keepdims=None, ddof=None) and numpy_compat.var(tensor, axis=None, keepdims=None, ddof=None) accept a single integer axis or None; ddof=0 maps to population statistics and ddof=1 maps to unbiased sample statistics. Values outside 0 and 1 are rejected because the current tensor engine exposes a boolean unbiased flag rather than arbitrary correction values.

9) Serialization (`minitensor.serialization`)¶

Core types¶

ModelVersion — semantic version for serialized models.
ModelMetadata — name, description, architecture, shapes, custom metadata.
SerializationFormat — json(), binary(), messagepack().
SerializedModel — metadata + state dict.
StateDict — tensor parameters/buffers.
DeploymentModel — compact model format for inference.
ModelSerializer — save() / load() helpers.

Convenience functions¶

save_model(model, path, format=None)
load_model(path, format=None)

10) Plugin system (`minitensor.plugins`)¶

Versioning and metadata¶

VersionInfo (parse, current, compatibility checks)
PluginInfo (name, version, author, min/max supported versions)

Python-side plugins¶

CustomPlugin — plugin object with init/cleanup/custom-op callbacks.
PluginRegistry — register/unregister/list Python plugins.
CustomLayer — define custom layers in Python.
PluginBuilder — fluent builder for plugin metadata.

Dynamic loading (if compiled)¶

load_plugin(path)
unload_plugin(name)
list_plugins()
get_plugin_info(name)
is_plugin_loaded(name)

11) Debug utilities (`minitensor._core.debug`)¶

The compiled extension registers a debug submodule for backend diagnostics. The high-level Python package does not re-export it as minitensor.debug; access it through the core extension when needed by advanced diagnostics or tests. Debug APIs are intended for development and troubleshooting rather than stable end-user workflows.

12) Custom operations¶

MiniTensor supports custom ops in both Rust and Python. Refer to docs/custom_operations.md for:

The CustomOp trait and builder pattern.
Python registration and execution (execute_custom_op_py, etc.).
Example custom ops (Swish, GELU, power).

13) Notes on devices & backends¶

The core engine supports CPU execution and can be compiled with CUDA, Metal, or OpenCL backends where applicable. Device selection flows through the Device API and tensor creation functions.

14) Documentation maintenance¶

When public functionality changes, update this reference together with the focused guide for that area. The runtime helpers list_public_api(), search_api(...), describe_api(...), and help() are useful for auditing the compiled API after rebuilding the extension.

15) Where to go next¶

docs/index.md — documentation map and maintenance checklist.
docs/development.md — contributor setup, validation, and PR workflow.
docs/custom_operations.md — custom ops and autograd integration.
docs/plugin_system.md — plugin registry and compatibility handling.
docs/performance.md — performance tuning and profiling.
examples/ and examples/notebooks/ — end-to-end usage patterns.

MiniTensor API Reference¶

1) Top-level module (minitensor)¶

Core exports¶

Versioning¶

Global configuration & graph controls¶

Shape compatibility helpers¶

Compatibility tensor module¶

Custom operations (Python API)¶

2) Tensor creation API¶

Random + distribution-based¶

Initialization schemes¶

Deterministic / structured¶

NumPy interop¶

3) Tensor properties & conversion helpers¶

4) Tensor instance methods¶

Shape and layout¶

Indexing & reordering¶

Linear algebra & matrix ops¶

Reductions, statistics, and equality¶

Elementwise math & activation¶

Normalization¶

Autograd + in-place¶

5) Functional API (minitensor.functional)¶

Forwarders exported at top level¶

Equality helpers¶

One-hot encoding¶

Tensor-centric math helpers¶

Cross-pollination with nn¶

6) Neural network module (minitensor.nn)¶

Layers & containers¶

Activations¶

Losses¶

Common utilities¶

7) Optimizers (minitensor.optim)¶

Built-in optimizers¶

Base optimizer API¶

8) NumPy compatibility module (minitensor.numpy_compat)¶

Array creation¶

Array manipulation¶

Math & comparisons¶

Statistics¶

9) Serialization (minitensor.serialization)¶

Core types¶

Convenience functions¶

10) Plugin system (minitensor.plugins)¶

Versioning and metadata¶

Python-side plugins¶

Dynamic loading (if compiled)¶

11) Debug utilities (minitensor._core.debug)¶

12) Custom operations¶

13) Notes on devices & backends¶

14) Documentation maintenance¶

15) Where to go next¶

1) Top-level module (`minitensor`)¶

5) Functional API (`minitensor.functional`)¶

Cross-pollination with `nn`¶

6) Neural network module (`minitensor.nn`)¶

7) Optimizers (`minitensor.optim`)¶

8) NumPy compatibility module (`minitensor.numpy_compat`)¶

9) Serialization (`minitensor.serialization`)¶

10) Plugin system (`minitensor.plugins`)¶

11) Debug utilities (`minitensor._core.debug`)¶