Heedless Backbones

DeiT III Family

Select an option
Results
Parameters (M)
Images / Second
Publication Date
Select an option
---------
Object Detection
Instance Segmentation
Classification
Semantic Segmentation
Panoptic Segmentation
Select an option
---------
Cityscapes (val)
Cityscapes (test)
ADE20K (val)
ADE20K (test)
Select an option
mIoUms
pAccms
mAccms
mIoUss
pAccss
mAccss
GFLOPs
Select an option
---------
UPerNet
Mask2Former
Panoptic FPN
Select an option
----------
512x2048
640x2560
Select an option
Results
Parameters (M)
Images / Second
GFLOPs
Publication Date
Select an option
---------
JFT-3B
ImageNet-1k
ImageNet-22k
JFT-300M
Select an option
----------
Supervised
FCMAE
MAE
CL
Select an option
Family
Pretrain Dataset
Pretrain Method
Semantic Segmentation Head
Semantic Segmentation Resolution
Semantic Segmentation Training Epochs
Select an option
----------
Family
Pretrain Method
Semantic Segmentation Head
Semantic Segmentation Resolution
Semantic Segmentation Training Epochs
modelparams (m)pretrainheadtrainGFLOPsmIoUms
ViT-S (DeiT III)22.0IN-1k : Sup. : 800UPerNetADE20K (train) : 128 : 512588.046.8
ViT-B (DeiT III)86.6IN-1k : Sup. : 800UPerNetADE20K (train) : 128 : 5121283.050.2
ViT-B (DeiT III)86.6IN-22k : Sup. : 90UPerNetADE20K (train) : 128 : 5121283.052.8
ViT-L (DeiT III)304.4IN-1k : Sup. : 800UPerNetADE20K (train) : 128 : 5122231.052.0
ViT-L (DeiT III)304.4IN-22k : Sup. : 90UPerNetADE20K (train) : 128 : 5122231.054.7
modelparams (m)pretrainfinetunegflopsIN-1kIN-V2
ViT-S (DeiT III)22.0IN-1k : Sup. : 800IN-1k : 20 : 2244.681.4/—70.5/—
ViT-S (DeiT III)22.0IN-1k : Sup. : 800IN-1k : 20 : 38415.583.4/—73.1/—
ViT-S (DeiT III)22.0IN-22k : Sup. : 90IN-1k : 50 : 2244.683.1/—73.8/—
ViT-B (DeiT III)86.6IN-1k : Sup. : 800IN-1k : 20 : 22417.583.8/—73.6/—
ViT-B (DeiT III)86.6IN-1k : Sup. : 800IN-1k : 20 : 38455.585.0/—74.8/—
ViT-B (DeiT III)86.6IN-22k : Sup. : 90IN-1k : 50 : 22417.685.7/—76.5/—
ViT-B (DeiT III)86.6IN-22k : Sup. : 90IN-1k : 50 : 38455.586.7/—77.9/—
ViT-L (DeiT III)304.4IN-1k : Sup. : 800IN-1k : 20 : 22461.684.9/—75.1/—
ViT-L (DeiT III)304.4IN-1k : Sup. : 800IN-1k : 20 : 384191.285.8/—76.6/—
ViT-L (DeiT III)304.4IN-22k : Sup. : 90IN-1k : 50 : 22461.687.0/—78.6/—
ViT-L (DeiT III)304.4IN-22k : Sup. : 90IN-1k : 50 : 384191.287.7/—79.1/—
ViT-H (DeiT III)632.1IN-1k : Sup. : 800IN-1k : 20 : 224167.485.2/—75.9/—
ViT-H (DeiT III)632.1IN-22k : Sup. : 90IN-1k : 50 : 224167.487.2/—79.2/—

ADE20K (val)

modelpretrainheadtraingflopsmIoUmspAccmsmAccmsmIoUsspAccssmAccss
ViT-S (DeiT III)IN-1k : Sup. : 800UPerNetADE20K (train) : 128 : 512588.046.845.6
ViT-B (DeiT III)IN-1k : Sup. : 800UPerNetADE20K (train) : 128 : 5121283.050.249.3
ViT-B (DeiT III)IN-22k : Sup. : 90UPerNetADE20K (train) : 128 : 5121283.052.851.8
ViT-L (DeiT III)IN-1k : Sup. : 800UPerNetADE20K (train) : 128 : 5122231.052.051.5
ViT-L (DeiT III)IN-22k : Sup. : 90UPerNetADE20K (train) : 128 : 5122231.054.753.8