cvat2 #23 · 5-class (car/truck/bus/motorcycle/bicycle) · polygon seg → bbox · 2026-06-24
| split | images | bbox | 密度/img |
|---|---|---|---|
| train | 23,725 | 217,170 | 9.2 |
| val | 2,239 | 39,230 | 17.5 |
| test | 27 | 361 | —(小,主評估靠 val) |
per-class GT (val):car 22900, truck 1965, bus 1415, motorcycle 10578, bicycle 2372。
SAM3 subset 16 task 按比例分入 train/val(13/3);Test 內 3 個 500-frame 未標 task(5835/36/37)已排除。
| 項目 | 值 |
|---|---|
| base weight | yolo26n.pt (YOLO26n, 2.38M params) |
| 訓練 imgsz / 推論 imgsz | 640 / 640 |
| epochs / patience | 100 / 30(跑滿 100) |
| batch / device | 64 / 單卡 RTX5090 |
| optimizer | auto → MuSGD (lr0 0.01) |
| aug | mosaic(close@10) + fliplr 0.5(YOLO det 標準) |
| shape→bbox | polygon 外接框 + rectangle,5-class |
| 類別 | AP@50 | AP@50-95 | P | R |
|---|---|---|---|---|
| car | 0.691 | 0.505 | 0.754 | 0.633 |
| truck | 0.440 | 0.322 | 0.629 | 0.392 |
| bus | 0.674 | 0.537 | 0.810 | 0.610 |
| motorcycle | 0.354 | 0.168 | 0.609 | 0.316 |
| bicycle | 0.142 | 0.068 | 0.494 | 0.117 |
| 類別 | AP@50 | AP@50-95 | P | R |
|---|---|---|---|---|
| car | 0.840 | 0.599 | 0.818 | 0.751 |
| truck | 0.593 | 0.455 | 0.510 | 0.429 |
| bus | 0.742 | 0.471 | 0.795 | 0.557 |
| motorcycle | 0.739 | 0.390 | 0.657 | 0.629 |
| bicycle | 0.161 | 0.031 | 0.306 | 0.333 |
car/bus 尚可(AP50 ~0.68),truck/motorcycle 中等、bicycle 崩(R 0.12)。
| conf | P | R | TP | FP | FN |
|---|---|---|---|---|---|
| 0.05 | 0.387 | 0.599 | 23499 | 37274 | 15731 |
| 0.15 | 0.665 | 0.519 | 20372 | 10254 | 18858 |
| 0.25 | 0.797 | 0.468 | 18369 | 4680 | 20861 |
| 0.35 | 0.876 | 0.424 | 16640 | 2350 | 22590 |
| 0.50 | 0.939 | 0.361 | 14165 | 925 | 25065 |
| 0.70 | 0.972 | 0.281 | 11030 | 324 | 28200 |
精度可靠拉 conf 換到(conf 0.5 → P 0.94),但代價是 recall 掉到 0.36。場域用建議 conf≈0.25(P 0.80 / R 0.47 / FP 4680,藍底列)。
| 類別 | FP 數 | GT 數 |
|---|---|---|
| car | 20620 | 22900 |
| truck | 1654 | 1965 |
| bus | 695 | 1415 |
| motorcycle | 11773 | 10578 |
| bicycle | 2532 | 2372 |
car/motorcycle 的 FP 最多(各 ~2万/1.2万)——與密集街景小目標、實例重疊有關。
imgsz 1280 對照(同資料同 backbone,純比尺度),若 recall 明顯回升再考慮 backbone 升級與 bicycle 補標。本版 best.pt 保留為 640 baseline。YOLO26n · 訓練 2.36h · 100 epoch · RTX5090 單卡 · 報告自動生成 2026-06-24