# INT4 encodings for weights

We have added support for int4 weight encodings for several ops and plan to support few more in the next releases.
The details of the ops which support int4 are given in the table below.
There are few things to keep in mind when using int4:

1. Weights should be static/ const data.
2. The weight values must satisfy the following relation:

> 
> 
> \[-8 &lt;= INT4\\_Weight\\_Value-Offset &lt;= 7\]

3. Software supports both per-tensor and per-channel quantization for INT4 for these ops

Following is a table listing Ops and Op configurations which will have INT4 benefits

| Op name | Op configurations having power benefits | Op configurations having IPS (latency) benefits. (Note latency benefits arise solely from the lower transfer of data to/fro VTCM due to smaller weights) |
| --- | --- | --- |
| Conv2D (1x1 only) | **out\_channels &gt; 32** and **stride 1x1** and **filter\_height = 1** and **filter\_width = 1** | **out\_channels &gt; 32** and **stride 1x1** and **filter\_height = 1** and **filter\_width = 1** |
| Fully Connected | **out\_channels &gt; 32** | **out\_channels &gt; 32** **(\*See Note above)** |
| MatMul | **out\_channels &gt; 32** | **out\_channels &gt; 32** **(\*See Note above)** |

Last Published: Jun 04, 2026

[Previous Topic
Quantized 16 bit activations (A16) vs FP16 and Activation Fusion: Performance and power differences](https://docs.qualcomm.com/bundle/publicresource/80-63442-10/topics/htp_guidelines_a16_vs_fp16.md) [Next Topic
Other Performance and Energy Guidelines](https://docs.qualcomm.com/bundle/publicresource/80-63442-10/topics/htp_guidelines_other_perf.md)