# QNN HTP

The following tutorial demonstrates running the Llama 2 7B model on the QNN HTP backend using
[genie-t2t-run](https://docs.qualcomm.com/doc/80-63442-10/topic/genie-t2t-run.html#genie-t2t-run).

Note

This section assumes that the QNN HTP context binaries have been obtained via the QNN workflow.

An example backend\_ext\_config.json can be found at
`${QNN_SDK_ROOT}/examples/Genie/configs/htp_backend_ext_config.json`.

For more information on the QNN HTP backend extension configurations options, please refer to
`${QNN_SDK_ROOT}/docs/QNN/general/htp/htp_backend.html`.

Please select your target platform:

- [Android](https://docs.qualcomm.com/doc/80-63442-10/topic/android.html)
- [Windows](https://docs.qualcomm.com/doc/80-63442-10/topic/windows_windows.html)

Last Published: Jun 04, 2026

[Previous Topic
Llama 2 7B](https://docs.qualcomm.com/bundle/publicresource/80-63442-10/topics/llama-2-7b.md) [Next Topic
Android](https://docs.qualcomm.com/bundle/publicresource/80-63442-10/topics/android.md)