Mask R-CNN
| family | model | params (m) | pretrain | head | train | GFLOPs | mAP |
|---|---|---|---|---|---|---|---|
| Swin | Swin-T | 29.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 267.0 | 41.6 |
| EfficientVMamba | EfficientVMamba-S | 11.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 197.0 | 36.7 |
| EfficientVMamba | EfficientVMamba-S | 11.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 197.0 | 38.2 |
| Mamba-Adaptor | Mamba-Adaptor-b1 | 7.8 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 218.0 | 39.5 |
| Mamba-Adaptor | Mamba-Adaptor-b1 | 7.8 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 218.0 | 41.2 |
| CAFormer | CAFormer-S18 | 26.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 508.0 | 43.7 |
| Iwin Transformer | Iwin-T | 30.2 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 268.0 | 38.9 |
| Iwin Transformer | Iwin-T | 30.2 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 268.0 | 40.9 |
| NAT | NAT-T | 28.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 258.0 | 42.6 |
| MogaNet | MogaNet-T | 5.2 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 192.0 | 39.1 |
| FocalNet | FocalNet-B-LRF | 88.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 507.0 | 43.5 |
| FocalNet | FocalNet-B-LRF | 88.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 507.0 | 44.1 |
| InceptionMamba | InceptionMamba-S | 46.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 301.0 | 42.6 |
| DAMamba | DAMamba-S | 45.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 395.0 | 44.5 |
| DAMamba | DAMamba-S | 45.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 395.0 | 45.1 |
| InternImage | InternImage-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 340.0 | 43.3 |
| InternImage | InternImage-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 340.0 | 44.5 |
| InceptionMamba | InceptionMamba-T | 25.4 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 233.0 | 41.8 |
| UniNeXt | UniNeXt-B | 91.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 460.0 | 43.9 |
| FocalNet | FocalNet-T-SRF | 28.4 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 268.0 | 41.3 |
| FocalNet | FocalNet-T-SRF | 28.4 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 268.0 | 42.6 |
| RMT | RMT-S | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 262.0 | 43.9 |
| RMT | RMT-S | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 262.0 | 44.9 |
| MogaNet | MogaNet-XT | 3.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 185.0 | 37.6 |
| S2AFormer | S2AFormer-mini | 5.02 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 177.0 | 31.7 |
| MogaNet | MogaNet-B | 43.8 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 373.0 | 43.2 |
| MA ViT | MA ViT-B | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 372.0 | 46.1 |
| MA ViT | MA ViT-B | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 372.0 | 47.0 |
| UniConvNet | UniConvNet-T | 30.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 265.0 | 43.3 |
| UniConvNet | UniConvNet-T | 30.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 265.0 | 44.5 |
| UniConvNet | UniConvNet-N2 | 15.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 220.0 | 41.9 |
| UniConvNet | UniConvNet-N2 | 15.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 220.0 | 43.2 |
| MA ViT | MA ViT-S | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 262.0 | 44.7 |
| MA ViT | MA ViT-S | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 262.0 | 45.5 |
| PlainMamba | PlainMamba-L2 | 25.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 542.0 | 40.6 |
| UniConvNet | UniConvNet-B | 97.6 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 498.0 | 45.0 |
| UniConvNet | UniConvNet-B | 97.6 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 498.0 | 45.6 |
| PlainMamba | PlainMamba-L3 | 50.5 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 696.0 | 41.2 |
| FocalNet | FocalNet-S-SRF | 50.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 356.0 | 42.7 |
| FocalNet | FocalNet-S-SRF | 50.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 356.0 | 43.6 |
| FocalNet | FocalNet-T-LRF | 28.6 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 268.0 | 41.5 |
| FocalNet | FocalNet-T-LRF | 28.6 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 268.0 | 42.9 |
| VSSD | VSSD-M | 14.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 220.0 | 41.3 |
| VSSD | VSSD-M | 14.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 220.0 | 42.8 |
| LocalVMamba | LocalVMamba-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 414.0 | 43.2 |
| LocalVMamba | LocalVMamba-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 414.0 | 44.1 |
| TransNeXt | TransNeXt-T | 28.2 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 356.0 | 44.6 |
| Mamba-Adaptor | Mamba-Adaptor-b2 | 32.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 259.0 | 43.4 |
| Mamba-Adaptor | Mamba-Adaptor-b2 | 32.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 259.0 | 44.8 |
| SSViT | SSViT-T | 15.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 223.0 | 42.6 |
| TransNeXt | TransNeXt-B | 89.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 728.0 | 45.9 |
| FractalMamba++ | FractalMamba++ (T) | 30.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 260.0 | 43.2 |
| FractalMamba++ | FractalMamba++ (T) | 30.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 260.0 | 44.1 |
| SSViT | SSViT-S | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 266.0 | 45.4 |
| SSViT | SSViT-S | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 266.0 | 44.0 |
| VMamba | VMamba-T | 30.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 271.0 | 42.7 |
| VMamba | VMamba-T | 30.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 271.0 | 43.7 |
| ConvNeXt V2 | ConvNeXt V2-B | 89.0 | IN-1k : FCMAE : 1600 | Mask R-CNN | COCO (train) : 36 | 486.0 | 46.6 |
| FocalNet | FocalNet-B-SRF | 88.1 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 496.0 | 43.3 |
| FocalNet | FocalNet-B-SRF | 88.1 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 496.0 | 44.1 |
| RMT | RMT-T | 14.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 218.0 | 42.6 |
| RMT | RMT-B | 54.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 373.0 | 45.5 |
| RMT | RMT-B | 54.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 373.0 | 46.1 |
| ConvFormer | ConvFormer-S18 | 27.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 502.0 | 42.6 |
| ConvNeXt V2 | ConvNeXt V2-H | 660.0 | IN-1k : FCMAE : 1600 | Mask R-CNN | COCO (train) : 36 | 2525.0 | 48.9 |
| Hiera | Hiera-B+ | 70.0 | IN-1k : MAE : 1600 | Mask R-CNN | COCO (train) : 100 | 600.0 | 47.3 |
| CSWin | CSWin-S | 35.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 342.0 | 43.2 |
| CSWin | CSWin-S | 35.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 342.0 | 44.5 |
| NAT | NAT-S | 51.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 330.0 | 43.2 |
| PlainMamba | PlainMamba-L1 | 7.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 388.0 | 39.1 |
| Swin | Swin-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 359.0 | 43.3 |
| UniNeXt | UniNeXt-S | 51.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 333.0 | 43.7 |
| A2Mamba | A2Mamba-S | 31.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 283.0 | 45.3 |
| Swin | Swin-B | 88.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 50 | 600.0 | 44.5 |
| MambaOut | MambaOut-T | 26.5 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 262.0 | 41.0 |
| EfficientVMamba | EfficientVMamba-B | 33.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 252.0 | 40.2 |
| EfficientVMamba | EfficientVMamba-B | 33.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 252.0 | 40.8 |
| MambaOut | MambaOut-B | 84.8 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 495.0 | 43.0 |
| MambaOut | MambaOut-S | 48.5 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 354.0 | 42.7 |
| InternImage | InternImage-B | 97.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 501.0 | 44.0 |
| InternImage | InternImage-B | 97.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 501.0 | 44.8 |
| VSSD | VSSD-T | 24.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 265.0 | 42.6 |
| VSSD | VSSD-T | 24.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 265.0 | 43.6 |
| TransNeXt | TransNeXt-S | 49.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 516.0 | 45.5 |
| FocalNet | FocalNet-S-LRF | 50.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 365.0 | 43.1 |
| FocalNet | FocalNet-S-LRF | 50.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 365.0 | 43.8 |
| Swin | Swin-B | 88.0 | IN-22k : Sup. : 90 | Mask R-CNN | COCO (train) : 50 | 700.0 | 45.4 |
| A2Mamba | A2Mamba-B | 51.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 410.0 | 46.8 |
| S2AFormer | S2AFormer-M | 24.87 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 253.0 | 39.3 |
| MogaNet | MogaNet-S | 25.3 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 272.0 | 42.2 |
| Iwin Transformer | Iwin-S | 51.6 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 358.0 | 40.0 |
| Iwin Transformer | Iwin-S | 51.6 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 358.0 | 41.0 |
| Hiera | Hiera-L | 214.0 | IN-1k : MAE : 1600 | Mask R-CNN | COCO (train) : 100 | 1200.0 | 48.6 |
| FractalMamba++ | FractalMamba++ (M) | 11.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 199.0 | 40.3 |
| FractalMamba++ | FractalMamba++ (M) | 11.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 199.0 | 42.2 |
| DAMamba | DAMamba-B | 86.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 520.0 | 44.9 |
| DAMamba | DAMamba-B | 86.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 520.0 | 45.3 |
| CSWin | CSWin-T | 23.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 279.0 | 42.2 |
| CSWin | CSWin-T | 23.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 279.0 | 43.6 |
| Hiera | Hiera-B | 52.0 | IN-1k : MAE : 1600 | Mask R-CNN | COCO (train) : 100 | 600.0 | 46.3 |
| UniConvNet | UniConvNet-N3 | 19.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 239.0 | 42.4 |
| UniConvNet | UniConvNet-N3 | 19.7 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 239.0 | 44.2 |
| VMINet | VMINet-B | 28.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 276.0 | 40.5 |
| UniConvNet | UniConvNet-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 336.0 | 43.8 |
| UniConvNet | UniConvNet-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 336.0 | 45.2 |
| S2AFormer | S2AFormer-XS | 6.54 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 185.0 | 35.8 |
| VSSD | VSSD-S | 40.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 325.0 | 43.5 |
| Swin | Swin-L | 197.0 | IN-22k : Sup. : 90 | Mask R-CNN | COCO (train) : 50 | 1100.0 | 46.2 |
| ConvNeXt | ConvNeXt-T | 29.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 262.0 | 41.7 |
| VMamba | VMamba-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 349.0 | 43.7 |
| VMamba | VMamba-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 349.0 | 44.2 |
| S2AFormer | S2AFormer-S | 10.69 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 197.0 | 37.6 |
| VMINet | VMINet-S | 13.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 201.0 | 39.3 |
| EfficientVMamba | EfficientVMamba-T | 6.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 60.0 | 33.2 |
| EfficientVMamba | EfficientVMamba-T | 6.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 60.0 | 35.3 |
| MA ViT | MA ViT-L | 98.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 501.0 | 46.5 |
| MA ViT | MA ViT-L | 98.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 501.0 | 47.2 |
| NAT | NAT-M | 20.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 225.0 | 41.7 |
| A2Mamba | A2Mamba-L | 95.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 552.0 | 46.8 |
| ConvNeXt V2 | ConvNeXt V2-L | 198.0 | IN-1k : FCMAE : 1600 | Mask R-CNN | COCO (train) : 36 | 875.0 | 47.7 |
| SSViT | SSViT-B | 57.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 382.0 | 46.4 |
| SSViT | SSViT-B | 57.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 382.0 | 45.4 |
| DAT | DAT-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 378.0 | 42.5 |
| DAT | DAT-S | 50.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 378.0 | 44.0 |
| DAMamba | DAMamba-T | 26.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 284.0 | 43.4 |
| DAMamba | DAMamba-T | 26.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 284.0 | 44.8 |
| VMINet | VMINet-XS | 7.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 189.0 | 36.4 |
| SSViT | SSViT-L | 100.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 572.0 | 46.0 |
| DAT++ | DAT-S++ | 53.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 430.0 | 44.5 |
| DAT++ | DAT-S++ | 53.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 430.0 | 45.7 |
| DAT++ | DAT-T++ | 24.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 301.0 | 43.7 |
| DAT++ | DAT-T++ | 24.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 301.0 | 45.1 |
| RMT | RMT-L | 95.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 557.0 | 45.9 |
| UniNeXt | UniNeXt-T | 24.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 266.0 | 43.4 |
| GroupMamba | GroupMamba-T | 23.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 279.0 | 42.9 |
| MA ViT | MA ViT-T | 16.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 219.0 | 42.9 |
| VMamba | VMamba-B | 89.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 485.0 | 44.1 |
| LocalVMamba | LocalVMamba-T | 26.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 291.0 | 42.2 |
| LocalVMamba | LocalVMamba-T | 26.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 291.0 | 43.4 |
| CSWin | CSWin-B | 78.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 526.0 | 43.9 |
| CSWin | CSWin-B | 78.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 526.0 | 44.9 |
| InternImage | InternImage-T | 30.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 270.0 | 42.5 |
| InternImage | InternImage-T | 30.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 270.0 | 43.7 |
| DAT | DAT-T | 29.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 272.0 | 40.4 |
| DAT | DAT-T | 29.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 36 | 272.0 | 42.4 |
| InceptionMamba | InceptionMamba-B | 83.0 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 421.0 | 43.1 |
| S2AFormer | S2AFormer-T | 5.8 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 182.0 | 35.4 |
COCO (val)
| family | model | pretrain | train | gflops | mAPb | APb50 | APb75 | mAPbs | mAPbm | mAPbl |
|---|---|---|---|---|---|---|---|---|---|---|
| TransNeXt | TransNeXt-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 356.0 | 49.9 | 71.5 | 54.9 | — | — | — |
| TransNeXt | TransNeXt-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 516.0 | 51.1 | 72.6 | 56.2 | — | — | — |
| TransNeXt | TransNeXt-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 728.0 | 51.7 | 73.2 | 56.9 | — | — | — |
| ConvNeXt | ConvNeXt-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 262.0 | 46.2 | 67.9 | 50.8 | — | — | — |
| Swin | Swin-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 267.0 | 46.0 | 68.1 | 50.3 | — | — | — |
| Swin | Swin-B | IN-1k : Sup. : 300 | COCO (train) : 50 | 700.0 | 50.1 | — | — | — | — | — |
| Swin | Swin-B | IN-22k : Sup. : 90 | COCO (train) : 50 | 700.0 | 51.4 | — | — | — | — | — |
| Swin | Swin-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 359.0 | 48.5 | — | — | — | — | — |
| Swin | Swin-L | IN-22k : Sup. : 90 | COCO (train) : 50 | 1100.0 | 52.4 | — | — | — | — | — |
| ConvNeXt V2 | ConvNeXt V2-H | IN-1k : FCMAE : 1600 | COCO (train) : 36 | 2525.0 | 55.7 | 75.2 | 61.8 | — | — | — |
| ConvNeXt V2 | ConvNeXt V2-L | IN-1k : FCMAE : 1600 | COCO (train) : 36 | 875.0 | 54.4 | 73.9 | 60.4 | — | — | — |
| ConvNeXt V2 | ConvNeXt V2-B | IN-1k : FCMAE : 1600 | COCO (train) : 36 | 486.0 | 52.9 | 72.6 | 58.9 | — | — | — |
| Hiera | Hiera-B | IN-1k : MAE : 1600 | COCO (train) : 100 | 600.0 | 52.2 | — | — | — | — | — |
| Hiera | Hiera-B+ | IN-1k : MAE : 1600 | COCO (train) : 100 | 600.0 | 53.5 | — | — | — | — | — |
| Hiera | Hiera-L | IN-1k : MAE : 1600 | COCO (train) : 100 | 1200.0 | 55.0 | — | — | — | — | — |
| InternImage | InternImage-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 270.0 | 47.2 | 69.0 | 52.1 | — | — | — |
| InternImage | InternImage-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 270.0 | 49.1 | 70.4 | 54.1 | — | — | — |
| InternImage | InternImage-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 340.0 | 47.8 | 69.8 | 52.8 | — | — | — |
| InternImage | InternImage-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 340.0 | 49.7 | 71.1 | 54.5 | — | — | — |
| InternImage | InternImage-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 501.0 | 48.8 | 70.9 | 54.0 | — | — | — |
| InternImage | InternImage-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 501.0 | 50.3 | 71.4 | 55.3 | — | — | — |
| FocalNet | FocalNet-T-LRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 268.0 | 46.1 | 68.2 | 50.6 | — | — | — |
| FocalNet | FocalNet-T-LRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 48.0 | 69.7 | 53.0 | — | — | — |
| FocalNet | FocalNet-T-SRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 268.0 | 45.9 | 68.3 | 50.1 | — | — | — |
| FocalNet | FocalNet-T-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 47.6 | 69.5 | 52.0 | — | — | — |
| FocalNet | FocalNet-S-SRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 356.0 | 48.0 | 69.9 | 52.7 | — | — | — |
| FocalNet | FocalNet-S-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 356.0 | 48.9 | 70.1 | 53.7 | — | — | — |
| FocalNet | FocalNet-S-LRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 365.0 | 48.3 | 70.5 | 53.1 | — | — | — |
| FocalNet | FocalNet-S-LRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 365.0 | 49.3 | 70.7 | 54.2 | — | — | — |
| FocalNet | FocalNet-B-LRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 507.0 | 49.0 | 70.9 | 53.9 | — | — | — |
| FocalNet | FocalNet-B-LRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 507.0 | 49.8 | 70.9 | 54.6 | — | — | — |
| FocalNet | FocalNet-B-SRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 496.0 | 48.8 | 70.7 | 53.5 | — | — | — |
| FocalNet | FocalNet-B-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 496.0 | 49.6 | 70.6 | 54.1 | — | — | — |
| CSWin | CSWin-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 279.0 | 46.7 | 68.6 | 51.3 | — | — | — |
| CSWin | CSWin-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 279.0 | 49.0 | 70.7 | 53.7 | — | — | — |
| CSWin | CSWin-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 342.0 | 47.9 | 70.1 | 52.6 | — | — | — |
| CSWin | CSWin-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 342.0 | 50.0 | 71.3 | 54.7 | — | — | — |
| CSWin | CSWin-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 526.0 | 48.7 | 70.4 | 53.9 | — | — | — |
| CSWin | CSWin-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 526.0 | 50.8 | 72.1 | 55.8 | — | — | — |
| CAFormer | CAFormer-S18 | IN-1k : Sup. : 300 | COCO (train) : 36 | 508.0 | 48.6 | 70.5 | 53.4 | — | — | — |
| ConvFormer | ConvFormer-S18 | IN-1k : Sup. : 300 | COCO (train) : 36 | 502.0 | 47.7 | 69.6 | 52.3 | — | — | — |
| MogaNet | MogaNet-XT | IN-1k : Sup. : 300 | COCO (train) : 12 | 185.0 | 40.7 | 62.3 | 44.4 | — | — | — |
| MogaNet | MogaNet-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 192.0 | 42.6 | 64.0 | 46.4 | — | — | — |
| MogaNet | MogaNet-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 272.0 | 46.7 | 68.0 | 51.3 | — | — | — |
| MogaNet | MogaNet-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 373.0 | 47.9 | 70.0 | 52.7 | — | — | — |
| VMamba | VMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 271.0 | 47.3 | 69.3 | 52.0 | — | — | — |
| VMamba | VMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 271.0 | 48.8 | 70.4 | 53.5 | — | — | — |
| VMamba | VMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 349.0 | 48.7 | 70.0 | 53.4 | — | — | — |
| VMamba | VMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 349.0 | 49.9 | 70.9 | 54.7 | — | — | — |
| VMamba | VMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 485.0 | 49.2 | 71.4 | 54.0 | — | — | — |
| BiFormer | BiFormer-S | IN-1k : Sup. : 300 | COCO (train) : 12 | None | 47.8 | 69.8 | 52.3 | — | — | — |
| BiFormer | BiFormer-B | IN-1k : Sup. : 300 | COCO (train) : 12 | None | 48.6 | 70.5 | 53.8 | — | — | — |
| MambaOut | MambaOut-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 262.0 | 45.1 | 67.3 | 49.6 | — | — | — |
| MambaOut | MambaOut-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 354.0 | 47.4 | 69.1 | 52.4 | — | — | — |
| MambaOut | MambaOut-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 495.0 | 47.4 | 69.3 | 52.2 | — | — | — |
| GroupMamba | GroupMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 279.0 | 47.6 | 69.8 | 52.1 | — | — | — |
| PlainMamba | PlainMamba-L1 | IN-1k : Sup. : 300 | COCO (train) : 12 | 388.0 | 44.1 | 64.8 | 47.9 | — | — | — |
| PlainMamba | PlainMamba-L2 | IN-1k : Sup. : 300 | COCO (train) : 12 | 542.0 | 46.0 | 66.9 | 50.1 | — | — | — |
| PlainMamba | PlainMamba-L3 | IN-1k : Sup. : 300 | COCO (train) : 12 | 696.0 | 46.8 | 68.0 | 51.1 | — | — | — |
| LocalVMamba | LocalVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 291.0 | 46.7 | 68.7 | 50.8 | — | — | — |
| LocalVMamba | LocalVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 291.0 | 48.7 | 70.1 | 53.0 | — | — | — |
| LocalVMamba | LocalVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 414.0 | 48.4 | 69.9 | 52.7 | — | — | — |
| LocalVMamba | LocalVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 414.0 | 49.9 | 70.5 | 54.4 | — | — | — |
| EfficientVMamba | EfficientVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 60.0 | 35.6 | 57.7 | 38.0 | — | — | — |
| EfficientVMamba | EfficientVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 60.0 | 38.3 | 60.3 | 41.6 | — | — | — |
| EfficientVMamba | EfficientVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 197.0 | 39.3 | 61.8 | 42.6 | — | — | — |
| EfficientVMamba | EfficientVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 197.0 | 41.6 | 63.9 | 45.6 | — | — | — |
| EfficientVMamba | EfficientVMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 252.0 | 43.7 | 66.2 | 47.9 | — | — | — |
| EfficientVMamba | EfficientVMamba-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 252.0 | 45.0 | 66.9 | 49.2 | — | — | — |
| DAMamba | DAMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 284.0 | 48.5 | 70.3 | 53.3 | — | — | — |
| DAMamba | DAMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 284.0 | 50.4 | 71.4 | 55.5 | — | — | — |
| DAMamba | DAMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 395.0 | 49.8 | 71.2 | 54.7 | — | — | — |
| DAMamba | DAMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 395.0 | 51.2 | 72.1 | 56.1 | — | — | — |
| DAMamba | DAMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 520.0 | 50.6 | 71.9 | 55.5 | — | — | — |
| DAMamba | DAMamba-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 520.0 | 51.4 | 72.3 | 56.4 | — | — | — |
| VSSD | VSSD-M | IN-1k : Sup. : 300 | COCO (train) : 12 | 220.0 | 45.4 | 67.5 | 49.8 | — | — | — |
| VSSD | VSSD-M | IN-1k : Sup. : 300 | COCO (train) : 36 | 220.0 | 47.7 | 69.7 | 52.1 | — | — | — |
| VSSD | VSSD-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 265.0 | 46.9 | 69.4 | 51.4 | — | — | — |
| VSSD | VSSD-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 265.0 | 48.8 | 70.4 | 53.4 | — | — | — |
| VSSD | VSSD-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 325.0 | 48.4 | 70.1 | 53.1 | — | — | — |
| RMT | RMT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 218.0 | 47.1 | 68.8 | 51.7 | — | — | — |
| RMT | RMT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 262.0 | 49.0 | 70.8 | 53.9 | — | — | — |
| RMT | RMT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 262.0 | 50.7 | 71.9 | 55.6 | — | — | — |
| RMT | RMT-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 373.0 | 51.1 | 72.5 | 56.1 | — | — | — |
| RMT | RMT-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 373.0 | 52.2 | 72.9 | 57.0 | — | — | — |
| RMT | RMT-L | IN-1k : Sup. : 300 | COCO (train) : 12 | 557.0 | 51.6 | 73.1 | 56.5 | — | — | — |
| DAT++ | DAT-T++ | IN-1k : Sup. : 300 | COCO (train) : 12 | 301.0 | 48.7 | 70.9 | 53.7 | 32.8 | 52.4 | 63.5 |
| DAT++ | DAT-T++ | IN-1k : Sup. : 300 | COCO (train) : 36 | 301.0 | 50.5 | 71.9 | 55.7 | 35.0 | 54.3 | 65.3 |
| DAT++ | DAT-S++ | IN-1k : Sup. : 300 | COCO (train) : 12 | 430.0 | 49.8 | 71.9 | 54.6 | 33.8 | 53.9 | 64.4 |
| DAT++ | DAT-S++ | IN-1k : Sup. : 300 | COCO (train) : 36 | 430.0 | 51.2 | 72.6 | 56.3 | 35.8 | 55.4 | 65.6 |
| FAN | FAN-T-Hybrid | IN-1k : Sup. : 300 | COCO (train) : 36 | None | 45.8 | — | — | — | — | — |
| FAN | FAN-S-Hybrid | IN-1k : Sup. : 300 | COCO (train) : 36 | None | 49.1 | — | — | — | — | — |
| DAT | DAT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 272.0 | 44.4 | 67.6 | 48.5 | 28.3 | 47.5 | 58.5 |
| DAT | DAT-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 272.0 | 47.1 | 69.2 | 51.6 | 32.0 | 50.3 | 61.0 |
| DAT | DAT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 378.0 | 47.1 | 69.9 | 51.5 | 30.5 | 50.1 | 62.1 |
| DAT | DAT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 378.0 | 49.0 | 70.9 | 53.8 | 32.7 | 52.6 | 64.0 |
| NAT | NAT-M | IN-1k : Sup. : 300 | COCO (train) : 36 | 225.0 | 46.5 | 68.1 | 51.3 | — | — | — |
| NAT | NAT-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 258.0 | 47.7 | 69.0 | 52.6 | — | — | — |
| NAT | NAT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 330.0 | 48.4 | 69.8 | 53.2 | — | — | — |
| SSViT | SSViT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 223.0 | 47.3 | 69.1 | 51.7 | — | — | — |
| SSViT | SSViT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 266.0 | 51.2 | 72.0 | 56.0 | — | — | — |
| SSViT | SSViT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 266.0 | 49.4 | 70.8 | 54.1 | — | — | — |
| SSViT | SSViT-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 382.0 | 52.6 | 73.2 | 57.7 | — | — | — |
| SSViT | SSViT-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 382.0 | 51.0 | 72.5 | 55.8 | — | — | — |
| SSViT | SSViT-L | IN-1k : Sup. : 300 | COCO (train) : 12 | 572.0 | 51.6 | 72.9 | 56.6 | — | — | — |
| Iwin Transformer | Iwin-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 268.0 | 42.2 | 65.3 | 45.8 | — | — | — |
| Iwin Transformer | Iwin-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 44.7 | 67.2 | 48.8 | — | — | — |
| Iwin Transformer | Iwin-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 358.0 | 43.7 | 67.0 | 47.4 | — | — | — |
| Iwin Transformer | Iwin-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 358.0 | 45.5 | 67.5 | 49.6 | — | — | — |
| MA ViT | MA ViT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 219.0 | 47.6 | 69.5 | 52.5 | — | — | — |
| MA ViT | MA ViT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 262.0 | 50.2 | 71.7 | 55.3 | — | — | — |
| MA ViT | MA ViT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 262.0 | 51.4 | 72.6 | 56.2 | — | — | — |
| MA ViT | MA ViT-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 372.0 | 51.7 | 73.3 | 57.0 | — | — | — |
| MA ViT | MA ViT-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 372.0 | 53.2 | 74.1 | 58.5 | — | — | — |
| MA ViT | MA ViT-L | IN-1k : Sup. : 300 | COCO (train) : 12 | 501.0 | 52.5 | 73.6 | 57.8 | — | — | — |
| MA ViT | MA ViT-L | IN-1k : Sup. : 300 | COCO (train) : 36 | 501.0 | 53.6 | 74.3 | 58.7 | — | — | — |
| UniConvNet | UniConvNet-N2 | IN-1k : Sup. : 300 | COCO (train) : 12 | 220.0 | 46.6 | 68.0 | 51.3 | — | — | — |
| UniConvNet | UniConvNet-N2 | IN-1k : Sup. : 300 | COCO (train) : 36 | 220.0 | 48.4 | 69.7 | 53.2 | — | — | — |
| UniConvNet | UniConvNet-N3 | IN-1k : Sup. : 300 | COCO (train) : 12 | 239.0 | 47.0 | 68.6 | 51.8 | — | — | — |
| UniConvNet | UniConvNet-N3 | IN-1k : Sup. : 300 | COCO (train) : 36 | 239.0 | 49.4 | 70.7 | 54.4 | — | — | — |
| UniConvNet | UniConvNet-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 265.0 | 48.2 | 69.8 | 52.9 | — | — | — |
| UniConvNet | UniConvNet-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 265.0 | 50.1 | 71.0 | 54.8 | — | — | — |
| UniConvNet | UniConvNet-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 336.0 | 48.8 | 70.4 | 53.4 | — | — | — |
| UniConvNet | UniConvNet-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 336.0 | 50.8 | 71.6 | 55.6 | — | — | — |
| UniConvNet | UniConvNet-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 498.0 | 50.0 | 71.7 | 55.3 | — | — | — |
| UniConvNet | UniConvNet-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 498.0 | 51.2 | 72.2 | 56.1 | — | — | — |
| A2Mamba | A2Mamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 283.0 | 51.5 | — | — | — | — | — |
| A2Mamba | A2Mamba-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 410.0 | 52.7 | — | — | — | — | — |
| A2Mamba | A2Mamba-L | IN-1k : Sup. : 300 | COCO (train) : 36 | 552.0 | 53.0 | — | — | — | — | — |
| FractalMamba++ | FractalMamba++ (T) | IN-1k : Sup. : 300 | COCO (train) : 12 | 260.0 | 47.8 | 69.8 | 52.5 | — | — | — |
| FractalMamba++ | FractalMamba++ (T) | IN-1k : Sup. : 300 | COCO (train) : 36 | 260.0 | 49.5 | 71.0 | 54.6 | — | — | — |
| FractalMamba++ | FractalMamba++ (M) | IN-1k : Sup. : 300 | COCO (train) : 12 | 199.0 | 44.1 | 66.2 | 48.0 | — | — | — |
| FractalMamba++ | FractalMamba++ (M) | IN-1k : Sup. : 300 | COCO (train) : 36 | 199.0 | 46.8 | 68.5 | 51.3 | — | — | — |
| InceptionMamba | InceptionMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 233.0 | 46.0 | — | — | — | — | — |
| InceptionMamba | InceptionMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 301.0 | 47.5 | — | — | — | — | — |
| InceptionMamba | InceptionMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 421.0 | 48.1 | — | — | — | — | — |
| S2AFormer | S2AFormer-mini | IN-1k : Sup. : 300 | COCO (train) : 12 | 177.0 | 33.4 | 55.4 | 35.2 | — | — | — |
| S2AFormer | S2AFormer-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 182.0 | 37.6 | 59.8 | 40.6 | — | — | — |
| S2AFormer | S2AFormer-XS | IN-1k : Sup. : 300 | COCO (train) : 12 | 185.0 | 38.4 | 60.2 | 41.5 | — | — | — |
| S2AFormer | S2AFormer-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 197.0 | 41.0 | 62.5 | 45.0 | — | — | — |
| S2AFormer | S2AFormer-M | IN-1k : Sup. : 300 | COCO (train) : 12 | 253.0 | 42.6 | 64.5 | 46.9 | — | — | — |
| VMINet | VMINet-XS | IN-1k : Sup. : 300 | COCO (train) : 12 | 189.0 | 38.9 | 61.9 | 42.4 | — | — | — |
| VMINet | VMINet-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 201.0 | 43.2 | 65.3 | 47.3 | — | — | — |
| VMINet | VMINet-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 276.0 | 44.5 | 66.7 | 48.6 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b1 | IN-1k : Sup. : 300 | COCO (train) : 12 | 218.0 | 43.2 | 65.5 | 47.7 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b1 | IN-1k : Sup. : 300 | COCO (train) : 36 | 218.0 | 45.1 | 67.2 | 49.4 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b2 | IN-1k : Sup. : 300 | COCO (train) : 12 | 259.0 | 47.3 | 69.8 | 52.3 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b2 | IN-1k : Sup. : 300 | COCO (train) : 36 | 259.0 | 49.1 | 71.5 | 54.1 | — | — | — |
| UniNeXt | UniNeXt-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 266.0 | 48.6 | 70.6 | 53.4 | — | — | — |
| UniNeXt | UniNeXt-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 333.0 | 49.0 | 71.3 | 54.1 | — | — | — |
| UniNeXt | UniNeXt-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 460.0 | 49.3 | 71.4 | 54.1 | — | — | — |
COCO (val)
| family | model | pretrain | train | gflops | mAPm | APm50 | APm75 | mAPms | mAPmm | mAPml |
|---|---|---|---|---|---|---|---|---|---|---|
| TransNeXt | TransNeXt-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 356.0 | 44.6 | 68.6 | 48.1 | — | — | — |
| TransNeXt | TransNeXt-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 516.0 | 45.5 | 69.8 | 49.1 | — | — | — |
| TransNeXt | TransNeXt-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 728.0 | 45.9 | 70.5 | 49.7 | — | — | — |
| ConvNeXt | ConvNeXt-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 262.0 | 41.7 | 65.0 | 44.9 | — | — | — |
| Swin | Swin-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 267.0 | 41.6 | 65.1 | 44.9 | — | — | — |
| Swin | Swin-B | IN-1k : Sup. : 300 | COCO (train) : 50 | 600.0 | 44.5 | — | — | — | — | — |
| Swin | Swin-B | IN-22k : Sup. : 90 | COCO (train) : 50 | 700.0 | 45.4 | — | — | — | — | — |
| Swin | Swin-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 359.0 | 43.3 | — | — | — | — | — |
| Swin | Swin-L | IN-22k : Sup. : 90 | COCO (train) : 50 | 1100.0 | 46.2 | — | — | — | — | — |
| ConvNeXt V2 | ConvNeXt V2-H | IN-1k : FCMAE : 1600 | COCO (train) : 36 | 2525.0 | 48.9 | 72.8 | 53.6 | — | — | — |
| ConvNeXt V2 | ConvNeXt V2-L | IN-1k : FCMAE : 1600 | COCO (train) : 36 | 875.0 | 47.7 | 71.4 | 52.3 | — | — | — |
| ConvNeXt V2 | ConvNeXt V2-B | IN-1k : FCMAE : 1600 | COCO (train) : 36 | 486.0 | 46.6 | 70.0 | 51.1 | — | — | — |
| Hiera | Hiera-B | IN-1k : MAE : 1600 | COCO (train) : 100 | 600.0 | 46.3 | — | — | — | — | — |
| Hiera | Hiera-B+ | IN-1k : MAE : 1600 | COCO (train) : 100 | 600.0 | 47.3 | — | — | — | — | — |
| Hiera | Hiera-L | IN-1k : MAE : 1600 | COCO (train) : 100 | 1200.0 | 48.6 | — | — | — | — | — |
| InternImage | InternImage-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 270.0 | 42.5 | 66.1 | 45.8 | — | — | — |
| InternImage | InternImage-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 270.0 | 43.7 | 67.3 | 47.3 | — | — | — |
| InternImage | InternImage-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 340.0 | 43.3 | 67.1 | 46.7 | — | — | — |
| InternImage | InternImage-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 340.0 | 44.5 | 68.5 | 47.8 | — | — | — |
| InternImage | InternImage-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 501.0 | 44.0 | 67.8 | 47.4 | — | — | — |
| InternImage | InternImage-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 501.0 | 44.8 | 68.7 | 48.0 | — | — | — |
| FocalNet | FocalNet-T-LRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 268.0 | 41.5 | 65.1 | 44.5 | — | — | — |
| FocalNet | FocalNet-T-LRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 42.9 | 66.5 | 46.1 | — | — | — |
| FocalNet | FocalNet-T-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 41.3 | 65.0 | 44.3 | — | — | — |
| FocalNet | FocalNet-T-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 42.6 | 66.5 | 45.6 | — | — | — |
| FocalNet | FocalNet-S-SRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 356.0 | 42.7 | 67.1 | 45.7 | — | — | — |
| FocalNet | FocalNet-S-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 356.0 | 43.6 | 67.1 | 47.1 | — | — | — |
| FocalNet | FocalNet-S-LRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 365.0 | 43.1 | 67.4 | 46.2 | — | — | — |
| FocalNet | FocalNet-S-LRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 365.0 | 43.8 | 67.9 | 47.4 | — | — | — |
| FocalNet | FocalNet-B-LRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 507.0 | 43.5 | 67.9 | 46.7 | — | — | — |
| FocalNet | FocalNet-B-LRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 507.0 | 44.1 | 68.2 | 47.2 | — | — | — |
| FocalNet | FocalNet-B-SRF | IN-1k : Sup. : 300 | COCO (train) : 12 | 496.0 | 43.3 | 67.5 | 46.5 | — | — | — |
| FocalNet | FocalNet-B-SRF | IN-1k : Sup. : 300 | COCO (train) : 36 | 496.0 | 44.1 | 68.0 | 47.2 | — | — | — |
| CSWin | CSWin-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 279.0 | 42.2 | 65.6 | 45.4 | — | — | — |
| CSWin | CSWin-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 279.0 | 43.6 | 67.9 | 46.6 | — | — | — |
| CSWin | CSWin-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 342.0 | 43.2 | 67.1 | 46.2 | — | — | — |
| CSWin | CSWin-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 342.0 | 44.5 | 68.4 | 47.7 | — | — | — |
| CSWin | CSWin-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 526.0 | 43.9 | 67.8 | 47.3 | — | — | — |
| CSWin | CSWin-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 526.0 | 44.9 | 69.1 | 48.3 | — | — | — |
| CAFormer | CAFormer-S18 | IN-1k : Sup. : 300 | COCO (train) : 36 | 508.0 | 43.7 | 67.5 | 47.4 | — | — | — |
| ConvFormer | ConvFormer-S18 | IN-1k : Sup. : 300 | COCO (train) : 36 | 502.0 | 42.6 | 66.3 | 45.9 | — | — | — |
| MogaNet | MogaNet-XT | IN-1k : Sup. : 300 | COCO (train) : 12 | 185.0 | 37.6 | 59.6 | 40.2 | — | — | — |
| MogaNet | MogaNet-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 192.0 | 39.1 | 61.3 | 42.0 | — | — | — |
| MogaNet | MogaNet-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 272.0 | 42.2 | 65.4 | 45.5 | — | — | — |
| MogaNet | MogaNet-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 373.0 | 43.2 | 67.0 | 46.6 | — | — | — |
| VMamba | VMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 271.0 | 42.7 | 66.4 | 45.9 | — | — | — |
| VMamba | VMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 271.0 | 43.7 | 67.4 | 47.0 | — | — | — |
| VMamba | VMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 349.0 | 43.7 | 67.3 | 47.0 | — | — | — |
| VMamba | VMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 349.0 | 44.2 | 68.2 | 47.7 | — | — | — |
| VMamba | VMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 485.0 | 44.1 | 68.3 | 47.7 | — | — | — |
| BiFormer | BiFormer-S | IN-1k : Sup. : 300 | COCO (train) : 12 | None | 43.2 | 66.8 | 46.5 | — | — | — |
| BiFormer | BiFormer-B | IN-1k : Sup. : 300 | COCO (train) : 12 | None | 43.7 | 67.6 | 47.1 | — | — | — |
| MambaOut | MambaOut-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 262.0 | 41.0 | 64.1 | 44.1 | — | — | — |
| MambaOut | MambaOut-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 354.0 | 42.7 | 66.1 | 46.2 | — | — | — |
| MambaOut | MambaOut-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 495.0 | 43.0 | 66.4 | 46.3 | — | — | — |
| GroupMamba | GroupMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 279.0 | 42.9 | 66.5 | 46.3 | — | — | — |
| PlainMamba | PlainMamba-L1 | IN-1k : Sup. : 300 | COCO (train) : 12 | 388.0 | 39.1 | 61.6 | 41.9 | — | — | — |
| PlainMamba | PlainMamba-L2 | IN-1k : Sup. : 300 | COCO (train) : 12 | 542.0 | 40.6 | 63.8 | 43.6 | — | — | — |
| PlainMamba | PlainMamba-L3 | IN-1k : Sup. : 300 | COCO (train) : 12 | 696.0 | 41.2 | 64.7 | 43.9 | — | — | — |
| LocalVMamba | LocalVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 291.0 | 42.2 | 65.7 | 45.5 | — | — | — |
| LocalVMamba | LocalVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 291.0 | 43.4 | 67.0 | 46.4 | — | — | — |
| LocalVMamba | LocalVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 414.0 | 43.2 | 66.7 | 46.5 | — | — | — |
| LocalVMamba | LocalVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 414.0 | 44.1 | 67.8 | 47.4 | — | — | — |
| EfficientVMamba | EfficientVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 60.0 | 33.2 | 54.4 | 35.1 | — | — | — |
| EfficientVMamba | EfficientVMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 60.0 | 35.3 | 57.2 | 37.6 | — | — | — |
| EfficientVMamba | EfficientVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 197.0 | 36.7 | 58.9 | 39.2 | — | — | — |
| EfficientVMamba | EfficientVMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 197.0 | 38.2 | 60.8 | 40.7 | — | — | — |
| EfficientVMamba | EfficientVMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 252.0 | 40.2 | 63.3 | 42.9 | — | — | — |
| EfficientVMamba | EfficientVMamba-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 252.0 | 40.8 | 64.1 | 43.7 | — | — | — |
| DAMamba | DAMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 284.0 | 43.4 | 67.2 | 46.7 | — | — | — |
| DAMamba | DAMamba-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 284.0 | 44.8 | 68.6 | 48.6 | — | — | — |
| DAMamba | DAMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 395.0 | 44.5 | 68.4 | 48.2 | — | — | — |
| DAMamba | DAMamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 395.0 | 45.1 | 69.2 | 49.1 | — | — | — |
| DAMamba | DAMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 520.0 | 44.9 | 68.9 | 48.7 | — | — | — |
| DAMamba | DAMamba-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 520.0 | 45.3 | 69.5 | 48.9 | — | — | — |
| VSSD | VSSD-M | IN-1k : Sup. : 300 | COCO (train) : 12 | 220.0 | 41.3 | 64.5 | 44.6 | — | — | — |
| VSSD | VSSD-M | IN-1k : Sup. : 300 | COCO (train) : 36 | 220.0 | 42.8 | 66.5 | 46.0 | — | — | — |
| VSSD | VSSD-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 265.0 | 42.6 | 66.4 | 45.9 | — | — | — |
| VSSD | VSSD-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 265.0 | 43.6 | 67.6 | 46.9 | — | — | — |
| VSSD | VSSD-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 325.0 | 43.5 | 67.2 | 47.1 | — | — | — |
| RMT | RMT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 218.0 | 42.6 | 65.8 | 45.9 | — | — | — |
| RMT | RMT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 262.0 | 43.9 | 67.8 | 47.4 | — | — | — |
| RMT | RMT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 262.0 | 44.9 | 69.1 | 48.4 | — | — | — |
| RMT | RMT-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 373.0 | 45.5 | 69.7 | 49.3 | — | — | — |
| RMT | RMT-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 373.0 | 46.1 | 70.4 | 49.9 | — | — | — |
| RMT | RMT-L | IN-1k : Sup. : 300 | COCO (train) : 12 | 557.0 | 45.9 | 70.3 | 49.8 | — | — | — |
| DAT++ | DAT-T++ | IN-1k : Sup. : 300 | COCO (train) : 12 | 301.0 | 43.7 | 67.9 | 47.3 | 24.5 | 47.4 | 62.4 |
| DAT++ | DAT-T++ | IN-1k : Sup. : 300 | COCO (train) : 36 | 301.0 | 45.1 | 69.2 | 48.7 | 26.7 | 48.5 | 64.0 |
| DAT++ | DAT-S++ | IN-1k : Sup. : 300 | COCO (train) : 12 | 430.0 | 44.5 | 68.7 | 48.2 | 25.0 | 48.0 | 63.3 |
| DAT++ | DAT-S++ | IN-1k : Sup. : 300 | COCO (train) : 36 | 430.0 | 45.7 | 69.9 | 49.7 | 27.6 | 49.2 | 64.3 |
| DAT | DAT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 272.0 | 40.4 | 64.2 | 43.1 | 23.9 | 43.8 | 55.5 |
| DAT | DAT-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 272.0 | 42.4 | 66.1 | 45.5 | 27.2 | 45.8 | 57.1 |
| DAT | DAT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 378.0 | 42.5 | 66.7 | 45.4 | 25.5 | 45.8 | 58.5 |
| DAT | DAT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 378.0 | 44.0 | 68.0 | 47.5 | 27.8 | 47.7 | 59.5 |
| NAT | NAT-M | IN-1k : Sup. : 300 | COCO (train) : 36 | 225.0 | 41.7 | 65.2 | 44.7 | — | — | — |
| NAT | NAT-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 258.0 | 42.6 | 66.1 | 45.9 | — | — | — |
| NAT | NAT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 330.0 | 43.2 | 66.9 | 46.5 | — | — | — |
| SSViT | SSViT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 223.0 | 42.6 | 66.2 | 45.8 | — | — | — |
| SSViT | SSViT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 266.0 | 45.4 | 69.7 | 49.0 | — | — | — |
| SSViT | SSViT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 266.0 | 44.0 | 67.7 | 47.3 | — | — | — |
| SSViT | SSViT-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 382.0 | 46.4 | 70.9 | 50.3 | — | — | — |
| SSViT | SSViT-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 382.0 | 45.4 | 69.7 | 48.9 | — | — | — |
| SSViT | SSViT-L | IN-1k : Sup. : 300 | COCO (train) : 12 | 572.0 | 46.0 | 70.1 | 49.8 | — | — | — |
| Iwin Transformer | Iwin-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 268.0 | 38.9 | 62.1 | 41.6 | — | — | — |
| Iwin Transformer | Iwin-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 268.0 | 40.9 | 64.1 | 43.6 | — | — | — |
| Iwin Transformer | Iwin-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 358.0 | 40.0 | 63.9 | 42.5 | — | — | — |
| Iwin Transformer | Iwin-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 358.0 | 41.0 | 64.3 | 44.0 | — | — | — |
| MA ViT | MA ViT-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 219.0 | 42.9 | 66.5 | 46.4 | — | — | — |
| MA ViT | MA ViT-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 262.0 | 44.7 | 68.7 | 47.9 | — | — | — |
| MA ViT | MA ViT-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 262.0 | 45.5 | 69.8 | 49.2 | — | — | — |
| MA ViT | MA ViT-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 372.0 | 46.1 | 70.6 | 50.1 | — | — | — |
| MA ViT | MA ViT-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 372.0 | 47.0 | 71.5 | 51.1 | — | — | — |
| MA ViT | MA ViT-L | IN-1k : Sup. : 300 | COCO (train) : 12 | 501.0 | 46.5 | 71.0 | 50.6 | — | — | — |
| MA ViT | MA ViT-L | IN-1k : Sup. : 300 | COCO (train) : 36 | 501.0 | 47.2 | 71.5 | 51.4 | — | — | — |
| UniConvNet | UniConvNet-N2 | IN-1k : Sup. : 300 | COCO (train) : 12 | 220.0 | 41.9 | 65.1 | 45.2 | — | — | — |
| UniConvNet | UniConvNet-N2 | IN-1k : Sup. : 300 | COCO (train) : 36 | 220.0 | 43.2 | 66.7 | 46.4 | — | — | — |
| UniConvNet | UniConvNet-N3 | IN-1k : Sup. : 300 | COCO (train) : 12 | 239.0 | 42.4 | 65.6 | 45.7 | — | — | — |
| UniConvNet | UniConvNet-N3 | IN-1k : Sup. : 300 | COCO (train) : 36 | 239.0 | 44.2 | 67.9 | 47.5 | — | — | — |
| UniConvNet | UniConvNet-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 265.0 | 43.3 | 66.6 | 45.7 | — | — | — |
| UniConvNet | UniConvNet-T | IN-1k : Sup. : 300 | COCO (train) : 36 | 265.0 | 44.5 | 68.4 | 48.0 | — | — | — |
| UniConvNet | UniConvNet-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 336.0 | 43.8 | 67.4 | 47.3 | — | — | — |
| UniConvNet | UniConvNet-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 336.0 | 45.2 | 69.3 | 48.9 | — | — | — |
| UniConvNet | UniConvNet-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 498.0 | 45.0 | 69.0 | 48.5 | — | — | — |
| UniConvNet | UniConvNet-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 498.0 | 45.6 | 69.6 | 49.2 | — | — | — |
| A2Mamba | A2Mamba-S | IN-1k : Sup. : 300 | COCO (train) : 36 | 283.0 | 45.3 | — | — | — | — | — |
| A2Mamba | A2Mamba-B | IN-1k : Sup. : 300 | COCO (train) : 36 | 410.0 | 46.8 | — | — | — | — | — |
| A2Mamba | A2Mamba-L | IN-1k : Sup. : 300 | COCO (train) : 36 | 552.0 | 46.8 | — | — | — | — | — |
| FractalMamba++ | FractalMamba++ (T) | IN-1k : Sup. : 300 | COCO (train) : 12 | 260.0 | 43.2 | 66.8 | 46.6 | — | — | — |
| FractalMamba++ | FractalMamba++ (T) | IN-1k : Sup. : 300 | COCO (train) : 36 | 260.0 | 44.1 | 67.9 | 47.5 | — | — | — |
| FractalMamba++ | FractalMamba++ (M) | IN-1k : Sup. : 300 | COCO (train) : 12 | 199.0 | 40.3 | 63.2 | 43.4 | — | — | — |
| FractalMamba++ | FractalMamba++ (M) | IN-1k : Sup. : 300 | COCO (train) : 36 | 199.0 | 42.2 | 65.4 | 45.3 | — | — | — |
| InceptionMamba | InceptionMamba-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 233.0 | 41.8 | — | — | — | — | — |
| InceptionMamba | InceptionMamba-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 301.0 | 42.6 | — | — | — | — | — |
| InceptionMamba | InceptionMamba-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 421.0 | 43.1 | — | — | — | — | — |
| S2AFormer | S2AFormer-mini | IN-1k : Sup. : 300 | COCO (train) : 12 | 177.0 | 31.7 | 52.5 | 33.3 | — | — | — |
| S2AFormer | S2AFormer-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 182.0 | 35.4 | 57.2 | 37.6 | — | — | — |
| S2AFormer | S2AFormer-XS | IN-1k : Sup. : 300 | COCO (train) : 12 | 185.0 | 35.8 | 57.3 | 38.1 | — | — | — |
| S2AFormer | S2AFormer-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 197.0 | 37.6 | 59.7 | 40.3 | — | — | — |
| S2AFormer | S2AFormer-M | IN-1k : Sup. : 300 | COCO (train) : 12 | 253.0 | 39.3 | 62.0 | 41.7 | — | — | — |
| VMINet | VMINet-XS | IN-1k : Sup. : 300 | COCO (train) : 12 | 189.0 | 36.4 | 58.7 | 38.8 | — | — | — |
| VMINet | VMINet-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 201.0 | 39.3 | 62.2 | 42.3 | — | — | — |
| VMINet | VMINet-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 276.0 | 40.5 | 63.7 | 43.7 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b1 | IN-1k : Sup. : 300 | COCO (train) : 12 | 218.0 | 39.5 | 60.1 | 42.7 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b1 | IN-1k : Sup. : 300 | COCO (train) : 36 | 218.0 | 41.2 | 61.9 | 43.8 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b2 | IN-1k : Sup. : 300 | COCO (train) : 12 | 259.0 | 43.4 | 66.9 | 46.4 | — | — | — |
| Mamba-Adaptor | Mamba-Adaptor-b2 | IN-1k : Sup. : 300 | COCO (train) : 36 | 259.0 | 44.8 | 67.3 | 48.3 | — | — | — |
| UniNeXt | UniNeXt-T | IN-1k : Sup. : 300 | COCO (train) : 12 | 266.0 | 43.4 | 67.6 | 46.7 | — | — | — |
| UniNeXt | UniNeXt-S | IN-1k : Sup. : 300 | COCO (train) : 12 | 333.0 | 43.7 | 68.0 | 46.9 | — | — | — |
| UniNeXt | UniNeXt-B | IN-1k : Sup. : 300 | COCO (train) : 12 | 460.0 | 43.9 | 68.3 | 47.3 | — | — | — |