# Run a LiteRT model on CPU

The XNNPACK delegate uses the XNNPACK library to speed up LiteRT
models efficiently on CPUs. XNNPACK is an open-source library from
Google, which does the following:

- Provides an optimized implementation of neural network operators for
Arm CPUs
- Uses low-level CPU instructions, such as the Arm^®^
Neon^™^ instruction set, to optimize operators for efficient
execution

The XNNPACK delegate can run models in both 32-bit floating-point and
int8 formats. For more information, see [XNNPACK back-end for TensorFlow Lite](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/delegates/xnnpack/README.md).

To run a LiteRT model using the XNNPACK delegate, see Deploy LiteRT as a Native application.

Last Published: May 14, 2026

Previous Topic
 
Supported LiteRT runtimes Next Topic

Run LiteRT Model on GPU