# 运行 AI/ML 示例应用程序

Source: [https://docs.qualcomm.com/doc/80-70022-50SC/topic/ai-ml-sample-applications.html](https://docs.qualcomm.com/doc/80-70022-50SC/topic/ai-ml-sample-applications.html)

将 Qualcomm AI/ML 功能与 Qualcomm AI Runtime SDK（Qualcomm Neural Processing SDK 和 Qualcomm^®^ AI Engine Direct）以及 LiteRT（以前称为 TFLite）模型结合使用。

GA 1.6 版本的示例应用程序现在使用 **qtimlpostprocess** 插件。以下是为支持此更新所做的关键更改：
- **pipeline 更新**：所有使用 IM SDK pipeline 的 AI 示例应用程序都更改为使用新的 qtimlpostprocess 插件，该插件取代了传统的后处理插件，如 qtimlvdetection、qtimlvclassification 等。
- **经过简化的文件配置**：从配置文件中删除了常量字段。qtimlpostprocess 插件现在可以直接从模型中自动获取常量。
- **标签文件格式**：示例应用程序现在可读取 JSON 类格式的标签文件。
- **设置文件：**新增了以下应用程序的设置文件：
    - **gst-ai-pose-detection**：新增了 `hrnet_settings.json` 文件来支持姿态检测用例的阈值配置。
    - **gst-ai-face-recognition**：
        - 将 `face_landmark.labels` 文件替换为 `facemap_3dmm_settings.json`。
        - 新增了 `face_recognition_settings.json` 文件以启用阈值配置。

- **[前提条件](https://docs.qualcomm.com/doc/80-70022-50SC/topic/download-model-and-label-files.html)**  

在运行 AI/ML 示例应用程序之前，请完成这些前提条件。
- **[图像分类](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-classification.html)**  

**gst-ai-classification** 应用程序可以识别图像中的主体。这些用例是使用 Qualcomm Neural Processing SDK、LiteRT 或 Qualcomm AI Engine Direct 模型实现的。
- **[目标检测](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-object-detection.html)**  

**gst-ai-object-detection** 应用程序可以检测图像和视频中的目标。这些用例展示了使用 Qualcomm Neural Processing SDK runtime 执行 [YOLOv5](https://github.com/ultralytics/yolov5)、[YOLOv8](https://github.com/ultralytics/ultralytics) 和 [YOLO-NAS](https://github.com/Megvii-BaseDetection/YOLOX/blob/main/README.md)，使用 Qualcomm AI Engine Direct 执行 YOLOv8，以及使用 LiteRT 执行 YOLOv5 和 YOLOv8。
- **[姿态检测](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-pose-detection.html)**  

 **gst-ai-pose-detection** 应用程序可以检测图像或视频中对象的身体姿态。该用例处理来自摄像头、文件或 RTSP 源的输入流，并使用 LiteRT 和 Qualcomm AI Engine Direct 模型进行姿态检测。结果可以显示在屏幕上，保存为编码的 MP4 文件，或通过 RTSP 服务器进行流传输。
- **[图像分割](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-segmentation.html)**  

**gst-ai-segmentation** 应用程序可以将图像划分为不同且有意义的部分或段，并根据属性的相似性为每个同质段分配标签。该应用程序使用 Qualcomm Neural Processing SDK runtime、Qualcomm AI Engine Direct runtime 和 LiteRT 进行图像分割。
- **[使用 Python 和容器进行图像分割](https://docs.qualcomm.com/doc/80-70022-50SC/topic/image-segmentation-using-python.html)**  

该应用程序允许使用 Qualcomm Neural Processing SDK 和 Python 绑定来执行图像分割，所有操作都在 Docker 容器内进行。
- **[并行推理](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-parallel-inference.html)**  

**gst-ai-parallel-inference** 应用程序可以对来自不同源（例如摄像头、文件或 RTSP 网络）的输入流执行目标检测、目标分类、姿态检测和图像分割。这些用例实施 LiteRT 模型进行目标检测、图像分割、分类和姿态检测。
- **[多输入/输出目标检测](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-multi-input-output-object-detection.html)**  

**gst-ai-multi-input-output-object-detection** 应用程序可以对来自各种来源（如摄像头、文件或 RTSP 等网络）的多个视频流执行目标检测。
- **[菊花链检测与分类](https://docs.qualcomm.com/doc/80-70022-50SC/topic/daisy-chain-detection-and-classification.html)**  

**gst-ai-daisychain-detection-classification** 应用程序可以使用摄像头、文件源或 RTSP 流执行级联目标检测和分类。用例涉及目标检测和对检测到的目标进行分类。
- **[菊花链检测和姿态估计](https://docs.qualcomm.com/doc/80-70022-50SC/topic/daisy-chain-detection-and-pose-detection.html)**  

**gst-ai-daisychain-detection-pose** 应用程序可以对摄像头、文件源或 RTSP 流执行级联目标检测和姿态检测。这些用例涉及检测目标并估计图像或视频中主体的身体姿态。
- **[视频单目深度估计](https://docs.qualcomm.com/doc/80-70022-50SC/topic/mono-depth-from-video.html)**  

**gst-ai-monodepth** 应用程序可以对实时摄像头流、文件或 RTSP 流推断源的深度。
- **[视频超分辨率](https://docs.qualcomm.com/doc/80-70022-50SC/topic/video-super-resolution.html)**  

**gst-ai-superresolution** 应用程序可以用低分辨率输入生成高分辨率视频帧。
- **[多流推理](https://docs.qualcomm.com/doc/80-70022-50SC/topic/multistream-inference.html)**  

**gst-ai-multistream-inference** 应用程序可对来自摄像头、文件或 RTSP 流的最多 32 个输入流执行 AI 推理（目标检测和分类）。
- **[多流批量推理](https://docs.qualcomm.com/doc/80-70022-50SC/topic/multistream-batch-inference.html)**  

**gst-ai-multistream-batch-inference** 应用程序显示对来自视频文件的最多 24 个输入流的批量 AI 推理（目标检测和分割）。
- **[AI 智能 codec](https://docs.qualcomm.com/doc/80-70022-50SC/topic/ai-smart-codec.html)**  

**gst-ai-smartcodec-example** 应用程序可减少摄像头或文件源的输入的网络带宽和存储资源。
- **[人脸检测](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-face-detection.html)**  

**gst-ai-face-detection** 应用程序从摄像头、文件或 RTSP 流中采集实时视频输入，并使用 Qualcomm AI Engine direct 和 LiteRT 人脸检测模型生成预览，在 HDMI 显示器上叠加显示 AI 模型输出。
- **[人脸识别](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-face-recognition.html)**  

**gst-ai-face-recognition** 应用程序从摄像头或 RTSP 流采集实时视频输入，并对这些输入执行人脸检测、人脸特征点定位和人脸识别。它使用 face\_det\_quantized模型进行人脸检测，使用 `facemap_3dmm_quantized` 模型进行人脸特征点定位，使用 `face_attrib_net_quantized` 模型进行人脸识别标签标注。
- **[音频分类](https://docs.qualcomm.com/doc/80-70022-50SC/topic/audio-classification.html)**  

**gst-ai-audio-classification** 应用程序对来自文件源或麦克风的输入显示其音频分类。将同时显示分类结果和视频预览。
- **[元数据解析](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-metadata-parser.html)**  

**gst-ai-metadata-parser-example** 应用程序从摄像头、文件或 RTSP 源接收实时视频流输入，并将该流传递给 YOLO 模型来执行目标检测和预览。叠加的 AI 模型输出（包括标签和边界框）将显示在 HDMI 显示器上。提取的元数据将记录到控制台，并用于计算帧中的人数。
- **[AI 事件编码器](https://docs.qualcomm.com/doc/80-70022-50SC/topic/gst-ai-event-encoder.html)**  

**gst-ai-event-encoder** 应用程序接收来自摄像头、文件或 RTSP 源的实时视频流输入。当有人进入视频画面时，应用程序会对视频进行预处理，在 AI 硬件上运行推理，并对视频进行编码。当有人离开画面 5 秒后，编码停止；当有人进入画面时，编码重新开始。
- **[AI USB 摄像头](https://docs.qualcomm.com/doc/80-70022-50SC/topic/ai-usb-camera.html)**  

**gst-ai-usb-camera-app** 从连接到 Qualcomm EVK 的 USB 网络摄像头传输视频流。该网络摄像头应可作为 /dev/videoX设备访问。此外，您还可以执行目标检测并预览结果。
- **[硬件基准测试应用程序](https://docs.qualcomm.com/doc/80-70022-50SC/topic/vision_ai_demo_app.html)**  

硬件基准测试应用程序监控一组定义的示例应用程序的设备硬件使用情况，以采集 CPU/GPU/NPU 使用情况和设备温控等指标。这些指标解释了资源使用情况和限制，有助于根据需求调整您的 AI 用例。

**Parent Topic:** [示例应用程序](https://docs.qualcomm.com/doc/80-70022-50SC/topic/example-applications.html)

Last Published: Nov 05, 2025

[Previous Topic
示例应用程序](https://docs.qualcomm.com/bundle/publicresource/80-70022-50SC/topics/example-applications.md) [Next Topic
前提条件](https://docs.qualcomm.com/bundle/publicresource/80-70022-50SC/topics/download-model-and-label-files.md)