door_state_v20260511 — MobileNetV3-L 224 (4-class)

cvat2 #11 image-level tag (state ∈ unknown/open/closed/mixed). 4-class CE + WeightedRandomSampler。

注意:cvat manifest 中 unknown 類別在 train/val/test 全為 0 樣本 — 實際資料只有 3 個 active class (open/closed/mixed)。模型架構保留 4 個 output 但 unknown logit 未訓練。

Test 指標

accuracymacro F1N
0.83660.58151640

Per-class metrics

classPRF1
unknown0.00000.00000.0000
open0.62220.48780.5469
closed0.86460.91230.8878
mixed0.87320.90990.8912

Confusion matrix (rows=true, cols=pred)

unknownopenclosedmixedrow total
unknown00000
open014040107287
closed01636419399
mixed06917868954
col total02254219941640

Sample inference (test, 3 per class with samples)

truth=open → pred=open (61%)
truth=open → pred=open (96%)
truth=open → pred=open (99%)
truth=closed → pred=closed (100%)
truth=closed → pred=closed (100%)
truth=closed → pred=closed (100%)
truth=mixed → pred=mixed (100%)
truth=mixed → pred=mixed (100%)
truth=mixed → pred=mixed (100%)

Training history

epochtrain_lossval_accval_macro_f1
11.55210.70490.5015
20.22310.88910.6558
30.08890.89180.6567
40.04450.87440.6389
50.02220.89460.6486
60.01910.92210.6701
70.01220.92480.6738
80.00900.90830.6674
90.00880.91660.6688
100.00450.92030.6666
110.00620.91930.6747
120.00300.91750.6649
130.00290.91200.6598
140.00400.90470.6619
150.00440.91020.6634
160.00280.93130.6761
170.00140.93130.6761
180.00190.92580.6741
190.00210.93220.6772
200.00310.93310.6830
210.00150.92580.6736
220.00260.93220.6790
230.00300.92670.6723
240.00160.92390.6731
250.00210.93130.6772
260.00270.93220.6778
270.00200.93030.6761
280.00290.93220.6778
290.00540.93030.6761
300.00240.93220.6784

Config

{
  "version": "v20260511",
  "task": "door_state image classification (4-class)",
  "backbone": "mobilenetv3_large_100",
  "params_M": 4.207156,
  "img_size": 224,
  "hyperparams": {
    "batch": 128,
    "epochs": 30,
    "lr": 0.0003,
    "wd": 0.01
  },
  "labels": [
    "unknown",
    "open",
    "closed",
    "mixed"
  ],
  "best_val_macro_f1": 0.6830110617839775,
  "best_epoch": 20,
  "epochs_run": 30,
  "train_time_s": 532.9745559692383,
  "test_metrics": {
    "acc": 0.8365853658536585,
    "macro_f1": 0.5814625773150699,
    "per_class": {
      "unknown": {
        "precision": 0.0,
        "recall": 0.0,
        "f1": 0.0
      },
      "open": {
        "precision": 0.6222222222222222,
        "recall": 0.4878048780487805,
        "f1": 0.546875
      },
      "closed": {
        "precision": 0.8646080760095012,
        "recall": 0.9122807017543859,
        "f1": 0.8878048780487805
      },
      "mixed": {
        "precision": 0.8732394366197183,
        "recall": 0.909853249475891,
        "f1": 0.891170431211499
      }
    },
    "confusion_matrix": [
      [
        0,
        0,
        0,
        0
      ],
      [
        0,
        140,
        40,
        107
      ],
      [
        0,
        16,
        364,
        19
      ],
      [
        0,
        69,
        17,
        868
      ]
    ],
    "n": 1640
  }
}