# Benchmark a LiteRT model

The open-source LiteRT provides a tool to benchmark model execution
on hardware using delegates. This tool is available along with other
artifacts installed on the device.

This benchmarking tool measures and calculates statistics for the
following performance metrics:

- Initialization time
- Inference time of the Warm-up state
- Inference time of the Steady state
- Memory usage during initialization
- Overall memory usage

Next steps

- [Prerequisites to run LiteRT sample applications](https://docs.qualcomm.com/doc/80-80022-15B/topic/prerequisites-to-run-litert-sample-applications.html)
- [Benchmark LiteRT on CPU](https://docs.qualcomm.com/doc/80-80022-15B/topic/benchmark-litert-on-cpu.html)
- [Benchmark LiteRT on GPU](https://docs.qualcomm.com/doc/80-80022-15B/topic/benchmark-litert-on-gpu.html)
- [Benchmark Qualcomm AI Runtime using an external delegate](https://docs.qualcomm.com/doc/80-80022-15B/topic/benchmark-qairt-on-external-delegate.html)

Last Published: May 14, 2026

[Previous Topic
Export a Pytorch model to LiteRT](https://docs.qualcomm.com/bundle/publicresource/80-80022-15B/topics/export-pytorch-model-litert.md) [Next Topic
Prerequisites to run LiteRT sample applications](https://docs.qualcomm.com/bundle/publicresource/80-80022-15B/topics/prerequisites-to-run-litert-sample-applications.md)