S2AFormer Family
| model | params (m) | pretrain | head | train | GFLOPs | mAP |
|---|---|---|---|---|---|---|
| S2AFormer-mini | 5.02 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 177.0 | 31.7 |
| S2AFormer-T | 5.8 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 182.0 | 35.4 |
| S2AFormer-XS | 6.54 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 185.0 | 35.8 |
| S2AFormer-S | 10.69 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 197.0 | 37.6 |
| S2AFormer-M | 24.87 | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 253.0 | 39.3 |
| model | params (m) | pretrain | finetune | gflops | IN-1k |
|---|---|---|---|---|---|
| S2AFormer-mini | 5.02 | IN-1k : Sup. : 300 | — : — : — | 0.43 | 75.1/— |
| S2AFormer-T | 5.8 | IN-1k : Sup. : 300 | — : — : — | 0.655 | 77.7/— |
| S2AFormer-XS | 6.54 | IN-1k : Sup. : 300 | — : — : — | 0.786 | 78.9/— |
| S2AFormer-S | 10.69 | IN-1k : Sup. : 300 | — : — : — | 1.38 | 80.8/— |
| S2AFormer-M | 24.87 | IN-1k : Sup. : 300 | — : — : — | 4.12 | 82.3/— |
COCO (val)
| model | pretrain | head | train | gflops | mAPb | APb50 | APb75 | mAPbs | mAPbm | mAPbl |
|---|---|---|---|---|---|---|---|---|---|---|
| S2AFormer-mini | IN-1k : Sup. : 300 | RetinaNet | COCO (train) : 12 | 159.0 | 33.4 | 53.2 | 34.9 | 19.9 | 36.3 | 44.5 |
| S2AFormer-mini | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 177.0 | 33.4 | 55.4 | 35.2 | — | — | — |
| S2AFormer-T | IN-1k : Sup. : 300 | RetinaNet | COCO (train) : 12 | 164.0 | 36.7 | 57.0 | 39.1 | 21.1 | 39.7 | 48.6 |
| S2AFormer-T | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 182.0 | 37.6 | 59.8 | 40.6 | — | — | — |
| S2AFormer-XS | IN-1k : Sup. : 300 | RetinaNet | COCO (train) : 12 | 166.0 | 37.9 | 58.6 | 40.3 | 22.9 | 41.5 | 49.8 |
| S2AFormer-XS | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 185.0 | 38.4 | 60.2 | 41.5 | — | — | — |
| S2AFormer-S | IN-1k : Sup. : 300 | RetinaNet | COCO (train) : 12 | 178.0 | 40.0 | 60.9 | 42.7 | 24.4 | 43.6 | 52.9 |
| S2AFormer-S | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 197.0 | 41.0 | 62.5 | 45.0 | — | — | — |
| S2AFormer-M | IN-1k : Sup. : 300 | RetinaNet | COCO (train) : 12 | 234.0 | 41.7 | 62.4 | 44.5 | 25.8 | 44.6 | 55.4 |
| S2AFormer-M | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 253.0 | 42.6 | 64.5 | 46.9 | — | — | — |
COCO (val)
| model | pretrain | head | train | gflops | mAPm | APm50 | APm75 | mAPms | mAPmm | mAPml |
|---|---|---|---|---|---|---|---|---|---|---|
| S2AFormer-mini | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 177.0 | 31.7 | 52.5 | 33.3 | — | — | — |
| S2AFormer-T | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 182.0 | 35.4 | 57.2 | 37.6 | — | — | — |
| S2AFormer-XS | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 185.0 | 35.8 | 57.3 | 38.1 | — | — | — |
| S2AFormer-S | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 197.0 | 37.6 | 59.7 | 40.3 | — | — | — |
| S2AFormer-M | IN-1k : Sup. : 300 | Mask R-CNN | COCO (train) : 12 | 253.0 | 39.3 | 62.0 | 41.7 | — | — | — |
ADE20K (val)
| model | pretrain | head | train | gflops | mIoUms | pAccms | mAccms | mIoUss | pAccss | mAccss |
|---|---|---|---|---|---|---|---|---|---|---|
| S2AFormer-mini | IN-1k : Sup. : 300 | Panoptic FPN | ADE20K (train) : 1000 : 512 | 23.0 | — | — | — | 36.7 | — | — |
| S2AFormer-T | IN-1k : Sup. : 300 | Panoptic FPN | ADE20K (train) : 1000 : 512 | 25.0 | — | — | — | 38.0 | — | — |
| S2AFormer-XS | IN-1k : Sup. : 300 | Panoptic FPN | ADE20K (train) : 1000 : 512 | 25.0 | — | — | — | 39.2 | — | — |
| S2AFormer-S | IN-1k : Sup. : 300 | Panoptic FPN | ADE20K (train) : 1000 : 512 | 28.0 | — | — | — | 40.8 | — | — |
| S2AFormer-M | IN-1k : Sup. : 300 | Panoptic FPN | ADE20K (train) : 1000 : 512 | 43.0 | — | — | — | 43.7 | — | — |