Heedless Backbones

LocalVim Family

Select an option
Results
Parameters (M)
Images / Second
Publication Date
Select an option
---------
Object Detection
Instance Segmentation
Classification
Semantic Segmentation
Panoptic Segmentation
Select an option
---------
Cityscapes (val)
Cityscapes (test)
ADE20K (val)
ADE20K (test)
PASCAL VOC 2007 (val)
PASCAL VOC 2007 (test)
Select an option
mIoUms
pAccms
mAccms
mIoUss
pAccss
mAccss
GFLOPs
Select an option
---------
UPerNet
Mask2Former
Panoptic FPN
SETR
Select an option
----------
512x2048
640x2560
Select an option
Results
Parameters (M)
Images / Second
GFLOPs
Publication Date
Select an option
---------
MegData73M
JFT-3B
JFT-300M
ImageNet-1k
ImageNet-22k
Select an option
----------
Supervised
Sup. + TL
FCMAE
MAE
CL
Select an option
Family
Pretrain Dataset
Semantic Segmentation Head
Semantic Segmentation Resolution
Semantic Segmentation Training Epochs
Select an option
----------
Family
Pretrain Method
Semantic Segmentation Head
Semantic Segmentation Resolution
Semantic Segmentation Training Epochs
modelparams (m)pretrainheadtrainGFLOPsmIoUms
LocalVim-T8.0IN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512181.044.4
LocalVim-S28.0IN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512297.047.5
modelparams (m)pretrainfinetunegflopsIN-1k
LocalVim-T8.0IN-1k : Sup. : 300— : — : —1.576.2/—
LocalVim-S28.0IN-1k : Sup. : 300— : — : —4.881.2/—

COCO (val)

modelpretrainheadtraingflopsmAPbAPb50APb75mAPbsmAPbmmAPbl
LocalVim-TIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 200403.045.366.249.126.049.561.7

COCO (val)

modelpretrainheadtraingflopsmAPmAPm50APm75mAPmsmAPmmmAPml
LocalVim-TIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 200403.039.963.042.517.743.060.5

ADE20K (val)

modelpretrainheadtraingflopsmIoUmspAccmsmAccmsmIoUsspAccssmAccss
LocalVim-TIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512181.044.443.4
LocalVim-SIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512297.047.546.4