VMINet Family
| model | params (m) | pretrain | head | train | GFLOPs | mAP |
|---|---|---|---|---|---|---|
| VMINet-XS | 7.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 189.0 | 38.9 |
| VMINet-S | 13.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 201.0 | 43.2 |
| VMINet-B | 28.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 276.0 | 44.5 |
| model | params (m) | pretrain | finetune | gflops | IN-1k |
|---|---|---|---|---|---|
| VMINet-Ti | 2.0 | IN-1k : Sup. : 300 | — : — : — | 0.3 | 70.7/— |
| VMINet-XS | 7.0 | IN-1k : Sup. : 300 | — : — : — | 1.4 | 78.6/— |
| VMINet-S | 13.0 | IN-1k : Sup. : 300 | — : — : — | 2.3 | 80.5/— |
| VMINet-B | 28.0 | IN-1k : Sup. : 300 | — : — : — | 4.8 | 82.4/— |
COCO (val)
| model | pretrain | head | train | gflops | mAPb | APb50 | APb75 | mAPbs | mAPbm | mAPbl |
|---|---|---|---|---|---|---|---|---|---|---|
| VMINet-XS | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 189.0 | 38.9 | 61.9 | 42.4 | — | — | — |
| VMINet-S | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 201.0 | 43.2 | 65.3 | 47.3 | — | — | — |
| VMINet-B | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 276.0 | 44.5 | 66.7 | 48.6 | — | — | — |
COCO (val)
| model | pretrain | head | train | gflops | mAPm | APm50 | APm75 | mAPms | mAPmm | mAPml |
|---|---|---|---|---|---|---|---|---|---|---|
| VMINet-XS | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 189.0 | 36.4 | 58.7 | 38.8 | — | — | — |
| VMINet-S | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 201.0 | 39.3 | 62.2 | 42.3 | — | — | — |
| VMINet-B | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 276.0 | 40.5 | 63.7 | 43.7 | — | — | — |