RTMDet: Optimized for Qualcomm Devices

RTMDet is a highly efficient model for real-time object detection,capable of predicting both the bounding boxes and classes of objects within an image.It is highly optimized for real-time applications, making it reliable for industrial and commercial use

This is based on the implementation of RTMDet found here. This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the Qualcomm® AI Hub Models library to export with custom configurations. More details on model performance across various devices, can be found here.

Qualcomm AI Hub Models uses Qualcomm AI Hub Workbench to compile, profile, and evaluate this model. Sign up to run these models on a hosted Qualcomm® device.

Getting Started

Due to licensing restrictions, we cannot distribute pre-exported model assets for this model. Use the Qualcomm® AI Hub Models Python library to compile and export the model with your own:

  • Custom weights (e.g., fine-tuned checkpoints)
  • Custom input shapes
  • Target device and runtime configurations

See our repository for RTMDet on GitHub for usage instructions.

Model Details

Model Type: Model_use_case.object_detection

Model Stats:

  • Model checkpoint: RTMDet Medium
  • Input resolution: 640x640
  • Number of parameters: 27.5M
  • Model size (float): 105 MB

Performance Summary

Model Runtime Precision Chipset Inference Time (ms) Peak Memory Range (MB) Primary Compute Unit
RTMDet ONNX float Snapdragon® 8 Elite Gen 5 Mobile 5.95 ms 5 - 189 MB NPU
RTMDet ONNX float Snapdragon® X2 Elite 8.182 ms 53 - 53 MB NPU
RTMDet ONNX float Snapdragon® X Elite 14.192 ms 51 - 51 MB NPU
RTMDet ONNX float Snapdragon® 8 Gen 3 Mobile 10.636 ms 5 - 235 MB NPU
RTMDet ONNX float Qualcomm® QCS8550 (Proxy) 13.7 ms 0 - 57 MB NPU
RTMDet ONNX float Qualcomm® QCS9075 23.789 ms 5 - 12 MB NPU
RTMDet ONNX float Snapdragon® 8 Elite For Galaxy Mobile 8.295 ms 4 - 180 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® 8 Elite Gen 5 Mobile 10.283 ms 3 - 329 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® X2 Elite 11.073 ms 31 - 31 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® X Elite 29.512 ms 29 - 29 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® 8 Gen 3 Mobile 19.723 ms 3 - 381 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Qualcomm® QCS8550 (Proxy) 28.027 ms 0 - 34 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Qualcomm® QCS9075 32.871 ms 2 - 5 MB NPU
RTMDet ONNX w8a16_mixed_fp16 Snapdragon® 8 Elite For Galaxy Mobile 14.22 ms 1 - 303 MB NPU
RTMDet TFLITE float Snapdragon® 8 Elite Gen 5 Mobile 6.67 ms 0 - 201 MB NPU
RTMDet TFLITE float Snapdragon® 8 Gen 3 Mobile 11.727 ms 0 - 272 MB NPU
RTMDet TFLITE float Qualcomm® QCS8275 (Proxy) 83.82 ms 0 - 203 MB NPU
RTMDet TFLITE float Qualcomm® QCS8550 (Proxy) 15.366 ms 0 - 3 MB NPU
RTMDet TFLITE float Qualcomm® SA8775P 22.828 ms 0 - 203 MB NPU
RTMDet TFLITE float Qualcomm® QCS9075 24.355 ms 0 - 62 MB NPU
RTMDet TFLITE float Qualcomm® QCS8450 (Proxy) 37.885 ms 0 - 338 MB NPU
RTMDet TFLITE float Qualcomm® SA7255P 83.82 ms 0 - 203 MB NPU
RTMDet TFLITE float Qualcomm® SA8295P 29.819 ms 0 - 265 MB NPU
RTMDet TFLITE float Snapdragon® 8 Elite For Galaxy Mobile 9.158 ms 0 - 203 MB NPU

License

  • The license for the original implementation of RTMDet can be found here.

References

Community

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support