Heedless Backbones

FocalNet Family

Select an option
Results
Parameters (M)
Images / Second
Publication Date
Select an option
---------
Object Detection
Instance Segmentation
Classification
Semantic Segmentation
Panoptic Segmentation
Select an option
---------
ImageNet-1k
ImageNet-A
ImageNet-R
ImageNet-Sketch
ImageNet-C
ImageNet-C-bar
ImageNet-V2
ImageNet-ReaL
Select an option
----------
Top-1
Top-5
GFLOPs
Select an option
----------
224x224
384x384
512x512
Select an option
Results
Parameters (M)
Images / Second
GFLOPs
Publication Date
Select an option
---------
JFT-3B
ImageNet-1k
ImageNet-22k
JFT-300M
Select an option
----------
Supervised
FCMAE
MAE
CL
Select an option
Family
Pretrain Dataset
Pretrain Method
Classification Resolution
Select an option
----------
Pretrain Dataset
Pretrain Method
Classification Resolution
modelparams (m)pretrainfinetuneGFLOPsTop-1
FocalNet-T-LRF28.6IN-1k : Sup. : 300— : — : —4.582.3
FocalNet-T-SRF28.4IN-1k : Sup. : 300— : — : —4.582.1
FocalNet-S-SRF50.3IN-1k : Sup. : 300— : — : —8.783.4
FocalNet-S-LRF50.3IN-1k : Sup. : 300— : — : —8.783.5
FocalNet-B-LRF88.7IN-1k : Sup. : 300— : — : —15.483.9
FocalNet-B-SRF88.1IN-1k : Sup. : 300— : — : —15.383.7
FocalNet-B-SRF88.1IN-22k : Sup. : 90IN-1k : 30 : 22415.385.6
FocalNet-B-SRF88.1IN-22k : Sup. : 90IN-1k : 30 : 38444.886.5
FocalNet-L-SRF197.1IN-22k : Sup. : 90IN-1k : 30 : 22434.286.5
FocalNet-L-SRF197.1IN-22k : Sup. : 90IN-1k : 30 : 384100.687.3
modelparams (m)pretrainfinetunegflopsIN-1k
FocalNet-T-LRF28.6IN-1k : Sup. : 300— : — : —4.582.3/—
FocalNet-T-SRF28.4IN-1k : Sup. : 300— : — : —4.582.1/—
FocalNet-S-SRF50.3IN-1k : Sup. : 300— : — : —8.783.4/—
FocalNet-S-LRF50.3IN-1k : Sup. : 300— : — : —8.783.5/—
FocalNet-B-LRF88.7IN-1k : Sup. : 300— : — : —15.483.9/—
FocalNet-B-SRF88.1IN-1k : Sup. : 300— : — : —15.383.7/—
FocalNet-B-SRF88.1IN-22k : Sup. : 90IN-1k : 30 : 22415.385.6/—
FocalNet-B-SRF88.1IN-22k : Sup. : 90IN-1k : 30 : 38444.886.5/—
FocalNet-L-SRF197.1IN-22k : Sup. : 90IN-1k : 30 : 22434.286.5/—
FocalNet-L-SRF197.1IN-22k : Sup. : 90IN-1k : 30 : 384100.687.3/—

COCO (val)

modelpretrainheadtraingflopsmAPbAPb50APb75mAPbsmAPbmmAPbl
FocalNet-T-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12268.046.168.250.6
FocalNet-T-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36268.048.069.753.0
FocalNet-T-LRFIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36751.051.570.356.0
FocalNet-T-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12268.045.968.350.1
FocalNet-T-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36268.047.669.552.0
FocalNet-T-SRFIN-1k : Sup. : 300Cascade Mask R-CNNCOCO (train) : 36746.051.570.155.8
FocalNet-S-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12356.048.069.952.7
FocalNet-S-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36356.048.970.153.7
FocalNet-S-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12365.048.370.553.1
FocalNet-S-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36365.049.370.754.2
FocalNet-B-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12507.049.070.953.9
FocalNet-B-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36507.049.870.954.6
FocalNet-B-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12496.048.870.753.5
FocalNet-B-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36496.049.670.654.1

COCO (val)

modelpretrainheadtraingflopsmAPmAPm50APm75mAPmsmAPmmmAPml
FocalNet-T-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12268.041.565.144.5
FocalNet-T-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36268.042.966.546.1
FocalNet-T-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36268.041.365.044.3
FocalNet-T-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36268.042.666.545.6
FocalNet-S-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 1256.042.767.145.7
FocalNet-S-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 3656.043.667.147.1
FocalNet-S-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12365.043.167.446.2
FocalNet-S-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36365.043.867.947.4
FocalNet-B-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12507.043.567.946.7
FocalNet-B-LRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36507.044.168.247.2
FocalNet-B-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 12496.043.367.546.5
FocalNet-B-SRFIN-1k : Sup. : 300Mask R-CNNCOCO (train) : 36496.044.168.047.2

ADE20K (val)

modelpretrainheadtraingflopsmIoUmspAccmsmAccmsmIoUsspAccssmAccss
FocalNet-T-LRFIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512949.047.846.8
FocalNet-T-SRFIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 512944.047.246.5
FocalNet-S-SRFIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 5121035.050.149.3
FocalNet-S-LRFIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 5121044.050.149.1
FocalNet-B-LRFIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 5121192.051.450.5
FocalNet-B-SRFIN-1k : Sup. : 300UPerNetADE20K (train) : 128 : 5121180.051.150.2