# Guides

## Gen AI Builder

The Gen AI Builder is a Python API that automates step 2 of the typical LLM deployment workflow:
it takes a quantized ONNX model and compiles it into a `GenAIContainer` ready for on-device
inference. The guides below cover configuration options, advanced features, and migration from
notebook-based workflows.

- Gen AI Builder Overview
- Configuring the Gen AI Builder
- Advanced Features
- Migrating from Notebook Workflows

Last Published: May 08, 2026

Previous Topic
 
Speculative Decoding Tutorial Next Topic

Gen AI Builder Overview