架構:MobileNetV3-L backbone + 2 binary heads (image-level multi-label),跟 fire_smoke v20260410 同 pattern。
資料:cvat2 project 7「door_state_detection」7,471 frames(從 raicvat#12 hatch project 168 tasks 移植)。每張圖標 1 個 door tag with state attr ∈ [open/closed/mixed/unknown]。
| attribute | AP | F1 | P | R | thr | valid |
|---|---|---|---|---|---|---|
| has_open 有門開啟 | 0.9966 | 0.977 | 0.974 | 0.979 | 0.76 | 1640 |
| has_close 有門關閉 | 0.9772 | 0.920 | 0.908 | 0.932 | 0.32 | 1640 |
mobilenetv3_large_100.ra_in1k (4.2M params)nn.Linear(1280, 2) + Dropout(0.3)has_open / has_closeimg → model → [has_open=0.95, has_close=0.02] → 圖中有開門 img → model → [has_open=0.02, has_close=0.98] → 圖中有關門 img → model → [has_open=0.85, has_close=0.91] → mixed(多扇門一開一關) img → model → [has_open=0.10, has_close=0.15] → 不確定(無門/看不清)
door tag + state select(open/closed/mixed/unknown),但訓練時轉成 2 個 binary head 處理 multi-door 場景。
| split | open | closed | mixed | unknown | total |
|---|---|---|---|---|---|
| Train | 518 | 1,815 | 1,742 | 461 | 4,536 |
| Validation | 101 | 451 | 539 | 79 | 1,170 |
| Test | 287 | 399 | 954 | 125 | 1,765 |
生成 2026-04-27| PPE 21-attr report| label guide| safety_rope proposal