CoCA ViT Family
No Results
| model | params (m) | pretrain | finetune | gflops | IN-1k | IN-ReaL | IN-V2 | IN-A | IN-R | IN-Sketch |
|---|---|---|---|---|---|---|---|---|---|---|
| CoCA ViT-11M | 11.4 | IN-1k : Sup. : 300 | — : — : — | 2.2 | 82.7/— | 87.8/— | 72.9/— | 34.1/— | 47.8/— | 35.5/— |
| CoCA ViT-21M | 20.6 | IN-1k : Sup. : 300 | — : — : — | 4.1 | 83.6/— | 88.3/— | 73.7/— | 38.9/— | 50.1/— | 39.6/— |
| CoCA ViT-28M | 27.8 | IN-1k : Sup. : 300 | — : — : — | 4.9 | 84.0/— | 88.4/— | 74.0/— | 39.8/— | 51.1/— | 40.2/— |
COCO (val)
| model | pretrain | head | train | gflops | mAPb | APb50 | APb75 | mAPbs | mAPbm | mAPbl |
|---|---|---|---|---|---|---|---|---|---|---|
| CoCA ViT-21M | IN-1k : Sup. : 300 | Cascade Mask R-CNN | COCO (train) : 36 | None | 51.8 | 70.5 | 56.1 | — | — | — |
| CoCA ViT-28M | IN-1k : Sup. : 300 | Cascade Mask R-CNN | COCO (train) : 36 | None | 52.2 | 71.0 | 56.8 | — | — | — |
COCO (val)
| model | pretrain | head | train | gflops | mAPm | APm50 | APm75 | mAPms | mAPmm | mAPml |
|---|---|---|---|---|---|---|---|---|---|---|
| CoCA ViT-21M | IN-1k : Sup. : 300 | Cascade Mask R-CNN | COCO (train) : 36 | None | 44.9 | 67.8 | 48.2 | — | — | — |
| CoCA ViT-28M | IN-1k : Sup. : 300 | Cascade Mask R-CNN | COCO (train) : 36 | None | 45.2 | 68.9 | 48.8 | — | — | — |
ADE20K (val)
| model | pretrain | head | train | gflops | mIoUms | pAccms | mAccms | mIoUss | pAccss | mAccss |
|---|---|---|---|---|---|---|---|---|---|---|
| CoCA ViT-21M | IN-1k : Sup. : 300 | UPerNet | ADE20K (train) : 128 : 512 | None | 50.8 | — | — | — | — | — |
| CoCA ViT-28M | IN-1k : Sup. : 300 | UPerNet | ADE20K (train) : 128 : 512 | None | 51.3 | — | — | — | — | — |