Heedless Backbones

CoCA ViT Family

Select an option
Results
Parameters (M)
Images / Second
Publication Date
Select an option
---------
Object Detection
Instance Segmentation
Classification
Semantic Segmentation
Panoptic Segmentation
Select an option
---------
ImageNet-1k
ImageNet-A
ImageNet-R
ImageNet-Sketch
ImageNet-C
ImageNet-C-bar
ImageNet-V2
ImageNet-ReaL
PASCAL VOC 2007 (val)
PASCAL VOC 2007 (test)
Select an option
----------
Top-1
Top-5
GFLOPs
Select an option
----------
224x224
384x384
512x512
Select an option
Results
Parameters (M)
Images / Second
GFLOPs
Publication Date
Select an option
---------
ImageNet-1k
ImageNet-22k
JFT-300M
JFT-3B
MegData73M
Select an option
----------
Supervised
Sup. + TL
FCMAE
MAE
CL
MAP
Select an option
Family
Pretrain Dataset
Pretrain Method
Classification Resolution
Select an option
----------
Pretrain Dataset
Pretrain Method
Classification Resolution
modelparams (m)pretrainfinetuneGFLOPsTop-1
CoCA ViT-11M11.4IN-1k : Sup. : 300— : — : —2.282.7
CoCA ViT-21M20.6IN-1k : Sup. : 300— : — : —4.183.6
CoCA ViT-28M27.8IN-1k : Sup. : 300— : — : —4.984.0
modelparams (m)pretrainfinetunegflopsIN-1kIN-ReaLIN-V2IN-AIN-RIN-Sketch
CoCA ViT-11M11.4IN-1k : Sup. : 300— : — : —2.282.7/—87.8/—72.9/—34.1/—47.8/—35.5/—
CoCA ViT-21M20.6IN-1k : Sup. : 300— : — : —4.183.6/—88.3/—73.7/—38.9/—50.1/—39.6/—
CoCA ViT-28M27.8IN-1k : Sup. : 300— : — : —4.984.0/—88.4/—74.0/—39.8/—51.1/—40.2/—

COCO (val)

modelpretrainheadtraingflopsmAPbAPb50APb75mAPbsmAPbmmAPbl
CoCA ViT-21MIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36None51.870.556.1
CoCA ViT-28MIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36None52.271.056.8

COCO (val)

modelpretrainheadtraingflopsmAPmAPm50APm75mAPmsmAPmmmAPml
CoCA ViT-21MIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36None44.967.848.2
CoCA ViT-28MIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36None45.268.948.8

ADE20K (val)

modelpretrainheadtraingflopsmIoUmspAccmsmAccmsmIoUsspAccssmAccss
CoCA ViT-21MIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512None50.8
CoCA ViT-28MIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512None51.3