zonnx

This tool is a standalone command-line utility responsible for converting machine learning models from the ONNX format to GGUF. It also provides functionality to download ONNX models directly from HuggingFace Hub.

Features

ONNX → GGUF conversion (fast, deterministic): Produce portable GGUF files compatible with the zerfoo runtime and llama.cpp.
Model inspection (ONNX and GGUF): Introspect model metadata, IOs, nodes and tensor stats. Output is JSON-friendly; --pretty planned.
HuggingFace integration: Download ONNX models and common tokenizer files in one step.
CGO-free builds: Ships as a single static binary. Easy to distribute and run in minimal containers.
Clean separation of concerns: Converter lives outside the training/runtime stack. No github.com/zerfoo/zerfoo imports in conversion code.

Architectural Principles

zonnx is designed as a standalone model converter, strictly decoupled from the zerfoo runtime. Its primary responsibility is to transform ONNX models into GGUF, which serves as the universal model format for zerfoo.

Key principles:

GGUF-Only Emission: zonnx emits only GGUF files. It does not contain any zerfoo runtime code, graph building logic, or direct dependencies on zerfoo's internal components (e.g., compute, graph, model, numeric, tensor).
Explicit Schema: The GGUF output captures all necessary model attributes and shapes directly, without relying on runtime inference of ONNX rules.
No zerfoo Imports: The zonnx codebase (outside of documentation, tests, and examples) must not import any packages from github.com/zerfoo/zerfoo.
No ONNX in zerfoo: Conversely, the zerfoo runtime must not contain any ONNX-specific code or dependencies. It consumes only GGUF models.

This strict separation ensures modularity, independent development, and maintainability of both the converter and the runtime.

Usage

Installation

Install the CLI directly:

go install github.com/zerfoo/zonnx/cmd/zonnx@latest

Or build from source at the repo root:

go build -o zonnx ./cmd/zonnx

Notes:

Requires Go specified in go.mod (currently go 1.25).
CGO is not required; the module is tested to build with CGO_ENABLED=0.

Quickstart

# 1) Download an ONNX model and tokenizer files from HuggingFace
zonnx download --model google/gemma-2-2b-it --output ./models

# 2) Convert ONNX → GGUF (flags must come before positional args)
zonnx convert -output ./models/model.gguf ./models/model.onnx

# 3) Inspect either format (flags before input)
zonnx inspect -pretty ./models/model.onnx
zonnx inspect -pretty ./models/model.gguf

Commands

`download`

Downloads an ONNX model and its associated tokenizer files from HuggingFace Hub.

Syntax:

./zonnx download --model <huggingface-model-id> [--output <output-directory>] [--api-key <your-api-key>]

Arguments:

--model <huggingface-model-id>: (Required) The ID of the HuggingFace model to download (e.g., openai/whisper-tiny.en).
--output <output-directory>: (Optional) The directory where the model and tokenizer files will be saved. Defaults to the current directory (.).
--api-key <your-api-key>: (Optional) Your HuggingFace API key for authenticated downloads.

API Key Configuration:

For models that require authentication (e.g., private models or models with restricted access), you can provide your HuggingFace API key in one of two ways:

Using the --api-key flag: Pass your API key directly as a command-line argument:
```
./zonnx download --model google/gemma-2-2b-it --api-key hf_YOUR_API_KEY
```
Replace hf_YOUR_API_KEY with your actual HuggingFace API key.
Using the HF_API_KEY environment variable: Set the HF_API_KEY environment variable before running the zonnx command:
```
export HF_API_KEY=hf_YOUR_API_KEY
./zonnx download --model google/gemma-2-2b-it
```
The --api-key flag takes precedence over the HF_API_KEY environment variable if both are provided.

When a model is downloaded, zonnx will automatically attempt to identify and download common tokenizer-related files (like tokenizer.json, vocab.txt, etc.) found in the same HuggingFace repository. These files will be saved alongside the ONNX model in the specified output directory.

`import`

Import ONNX and emit GGUF. This is a future-friendly alias for convert.

Status: planned; use convert today.

`export`

Export GGUF back to ONNX.

Status: planned; coming soon.

`inspect`

Inspect either ONNX or GGUF. Type can be inferred from extension or set explicitly.

Syntax:

zonnx inspect [-type onnx|gguf] [-pretty] <input-file>

Examples:

zonnx inspect -pretty ./path/to/model.onnx
zonnx inspect -type gguf -pretty ./path/to/model.gguf

Notes:

--pretty human-friendly printing is planned; JSON schema output is the target.

`convert`

Convert ONNX → GGUF. This is the primary conversion command.

Syntax:

zonnx convert [-output <output-file.gguf>] <input-file.onnx>

Example:

zonnx convert -output ./models/encoder.gguf ./models/encoder.onnx

Notes:

Flags must appear before the first positional argument when using Go's standard flag package.
The convert command accepts an alias --output in addition to -output.
If no output is specified, the default is <input-dir>/<input-base>.gguf.
Parent directories for the output path are created automatically.

Why GGUF?

GGUF is a compact, mmap-friendly model format designed for fast loading and efficient inference. Benefits:

Explicit shapes and attributes; no reliance on ONNX runtime semantics at load time.
Compatible with llama.cpp and the broader ecosystem.
Portable files, amenable to signing and caching.
Decouples model authoring/conversion from runtime execution.

Development

Test: make test (runs go test ./...)
Lint: make lint (runs golangci-lint run)
Lint (auto-fix): make lint-fix
Format: make format (gofmt + goimports + gofumpt if available)

The codebase is intentionally free of github.com/zerfoo/zerfoo imports in conversion paths to preserve a strict boundary between conversion and runtime.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
.github		.github
cmd/zonnx		cmd/zonnx
docs		docs
internal/onnx		internal/onnx
pkg		pkg
testdata		testdata
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
.goreleaser.yml		.goreleaser.yml
.release-please-manifest.json		.release-please-manifest.json
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
nocgo_test.go		nocgo_test.go
release-please-config.json		release-please-config.json
test_convert.go		test_convert.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

zonnx

Features

Architectural Principles

Usage

Installation

Quickstart

Commands

`download`

`import`

`export`

`inspect`

`convert`

Why GGUF?

Development

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

zonnx

Features

Architectural Principles

Usage

Installation

Quickstart

Commands

download

import

export

inspect

convert

Why GGUF?

Development

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`download`

`import`

`export`

`inspect`

`convert`

Packages