VSSD Family
model | params (m) | pretrain | finetune | GFLOPs | Top-1 |
---|---|---|---|---|---|
VSSD-M | 14.0 | IN-1k : Sup. : 300 | — : — : — | 2.3 | 82.5 |
VSSD-T | 24.0 | IN-1k : Sup. : 300 | — : — : — | 4.5 | 83.7 |
VSSD-S | 40.0 | IN-1k : Sup. : 300 | — : — : — | 7.4 | 84.1 |
VSSD-B | 89.0 | IN-1k : Sup. : 300 | — : — : — | 16.1 | 84.7 |
model | params (m) | pretrain | finetune | gflops | IN-1k |
---|---|---|---|---|---|
VSSD-M | 14.0 | IN-1k : Sup. : 300 | — : — : — | 2.3 | 82.5/— |
VSSD-T | 24.0 | IN-1k : Sup. : 300 | — : — : — | 4.5 | 83.7/— |
VSSD-S | 40.0 | IN-1k : Sup. : 300 | — : — : — | 7.4 | 84.1/— |
VSSD-B | 89.0 | IN-1k : Sup. : 300 | — : — : — | 16.1 | 84.7/— |
COCO (val)
model | pretrain | head | train | gflops | mAPb | APb50 | APb75 | mAPbs | mAPbm | mAPbl |
---|---|---|---|---|---|---|---|---|---|---|
VSSD-M | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 220.0 | 45.4 | 67.5 | 49.8 | — | — | — |
VSSD-M | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 220.0 | 47.7 | 69.7 | 52.1 | — | — | — |
VSSD-T | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 265.0 | 46.9 | 69.4 | 51.4 | — | — | — |
VSSD-T | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 265.0 | 48.8 | 70.4 | 53.4 | — | — | — |
VSSD-S | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 325.0 | 48.4 | 70.1 | 53.1 | — | — | — |
COCO (val)
model | pretrain | head | train | gflops | mAPm | APm50 | APm75 | mAPms | mAPmm | mAPml |
---|---|---|---|---|---|---|---|---|---|---|
VSSD-M | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 220.0 | 41.3 | 64.5 | 44.6 | — | — | — |
VSSD-M | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 220.0 | 42.8 | 66.5 | 46.0 | — | — | — |
VSSD-T | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 265.0 | 42.6 | 66.4 | 45.9 | — | — | — |
VSSD-T | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 265.0 | 43.6 | 67.6 | 46.9 | — | — | — |
VSSD-S | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 325.0 | 43.5 | 67.2 | 47.1 | — | — | — |