Get shape for each output from the ONNX and infer layout + YOLO26 changes + output field re-names#207
Get shape for each output from the ONNX and infer layout + YOLO26 changes + output field re-names#207
Conversation
☂️ Python Coverage
Overall Coverage
New FilesNo new covered files... Modified Files
|
Test Results 6 files 6 suites 24m 4s ⏱️ Results for commit 1ef0413. ♻️ This comment has been updated with latest results. |
|
I've been testing pose model and the keypoint layer output shape differs compared to say Yolo v8 pose: Here you see C dimension is 51, having the order be: breaks the current implementation of DetectionParser as it assumes data is flattened as [1, 51, 3520]. Which is the "correct" shape? Are both valid shapes? |
|
I have added depthai-core support for YOLO26 here. I tested the PR against the nnarchives @dtronmans generated with this PR and all looks good. The only thing I would change in the NNarchives is the parser is listed as "YOLOExtendedParser" and I would change it to "YOLO" so ParsingNeuralNetwork automatically switches to depthai-core DetectionParser? |
@aljazkonec1 good point, done |
Purpose
Shape and layout:
Renaming output names:
keypoint_outputswas previously incorrect, should be["kpt_output1", "kpt_output2", "kpt_output3"],output->output_yolo26,mask_output->output_masksYolo26 conversion with one2one head:
ReduceMax in graph:
Specification
Dependencies & Potential Impact
Deployment Plan
Testing & Validation
yolov8n:
yolo26n:
yolo26n-seg:
yolo26n-pose: