Heedless Backbones

MaxViT Family

Select an option
Results
Parameters (M)
Images / Second
Publication Date
Select an option
---------
Object Detection
Instance Segmentation
Classification
Semantic Segmentation
Panoptic Segmentation
Select an option
---------
COCO (val)
COCO (test)
Select an option
----------
mAP
AP50
AP75
mAPs
mAPm
mAPl
GFLOPs
Select an option
---------
Mask R-CNN
Cascade Mask R-CNN
RetinaNet
DINO
HTC++
HTC
Select an option
Results
Parameters (M)
Images / Second
GFLOPs
Publication Date
Select an option
---------
JFT-3B
ImageNet-1k
ImageNet-22k
JFT-300M
Select an option
----------
Supervised
FCMAE
MAE
CL
Select an option
Family
Pretrain Dataset
Pretrain Method
Instance Head
Instance Training Epochs
Select an option
----------
Family
Pretrain Method
Instance Head
Instance Training Epochs
modelparams (m)pretrainheadtrainGFLOPsmAP
MaxViT-T31.0IN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36475.052.1
MaxViT-S69.0IN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36595.053.1
MaxViT-B120.0IN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36856.053.4
modelparams (m)pretrainfinetunegflopsIN-1k
MaxViT-T31.0IN-1k : Sup. : 300— : — : —5.683.62/—
MaxViT-T31.0IN-1k : Sup. : 300IN-1k : 30 : 38417.785.24/—
MaxViT-T31.0IN-1k : Sup. : 300IN-1k : 30 : 51233.785.72/—
MaxViT-S69.0IN-1k : Sup. : 300— : — : —11.784.45/—
MaxViT-S69.0IN-1k : Sup. : 300IN-1k : 30 : 38436.185.74/—
MaxViT-S69.0IN-1k : Sup. : 300IN-1k : 30 : 51267.686.19/—
MaxViT-B120.0IN-1k : Sup. : 300— : — : —23.484.95/—
MaxViT-B120.0IN-1k : Sup. : 300IN-1k : 30 : 38474.286.34/—
MaxViT-B120.0IN-1k : Sup. : 300IN-1k : 30 : 512138.586.66/—
MaxViT-B120.0IN-22k : Sup. : 90IN-1k : 30 : 38474.288.24/—
MaxViT-B120.0IN-22k : Sup. : 90IN-1k : 30 : 512138.388.38/—
MaxViT-L212.0IN-1k : Sup. : 300— : — : —43.985.17/—
MaxViT-L212.0IN-1k : Sup. : 300IN-1k : 30 : 384133.186.4/—
MaxViT-L212.0IN-1k : Sup. : 300IN-1k : 30 : 512245.486.7/—
MaxViT-L212.0IN-22k : Sup. : 90IN-1k : 30 : 384128.788.32/—
MaxViT-L212.0IN-22k : Sup. : 90IN-1k : 30 : 512245.288.46/—
MaxViT-XL475.0IN-22k : Sup. : 90IN-1k : 30 : 384293.788.51/—
MaxViT-XL475.0IN-22k : Sup. : 90IN-1k : 30 : 512535.288.7/—

COCO (val)

modelpretrainheadtraingflopsmAPbAPb50APb75mAPbsmAPbmmAPbl
MaxViT-TIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36475.052.171.956.8
MaxViT-SIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36595.053.172.558.1
MaxViT-BIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36856.053.472.958.1

COCO (val)

modelpretrainheadtraingflopsmAPmAPm50APm75mAPmsmAPmmmAPml
MaxViT-TIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36475.044.669.148.4
MaxViT-SIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36595.045.469.849.5
MaxViT-BIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36856.045.770.350.0