ImageNet-ReaL Dataset
| family | model | params (m) | pretrain | finetune | GFLOPs | Top-1 |
|---|---|---|---|---|---|---|
| ResNet (RSB) | ResNet-18 (RSB) | 11.7 | IN-1k : Sup. : 600 | — : — : — | 1.8 | 79.4 |
| ResNet (RSB) | ResNet-34 (RSB) | 21.8 | IN-1k : Sup. : 600 | — : — : — | 3.7 | 83.4 |
| ResNet (RSB) | ResNet-50 (RSB) | 25.6 | IN-1k : Sup. : 600 | — : — : — | 4.1 | 85.7 |
| ResNet (RSB) | ResNet-101 (RSB) | 44.5 | IN-1k : Sup. : 600 | — : — : — | 7.9 | 86.3 |
| ResNet (RSB) | ResNet-152 (RSB) | 60.2 | IN-1k : Sup. : 600 | — : — : — | 11.6 | 86.4 |
| CoCA ViT | CoCA ViT-11M | 11.4 | IN-1k : Sup. : 300 | — : — : — | 2.2 | 87.8 |
| CoCA ViT | CoCA ViT-21M | 20.6 | IN-1k : Sup. : 300 | — : — : — | 4.1 | 88.3 |
| CoCA ViT | CoCA ViT-28M | 27.8 | IN-1k : Sup. : 300 | — : — : — | 4.9 | 88.4 |
| family | model | params (m) | pretrain | finetune | gflops | top-1 | top-5 |
|---|---|---|---|---|---|---|---|
| ResNet (RSB) | ResNet-18 (RSB) | 11.7 | IN-1k : Sup. : 600 | — : — : — | 1.8 | 79.4 | — |
| ResNet (RSB) | ResNet-34 (RSB) | 21.8 | IN-1k : Sup. : 600 | — : — : — | 3.7 | 83.4 | — |
| ResNet (RSB) | ResNet-50 (RSB) | 25.6 | IN-1k : Sup. : 600 | — : — : — | 4.1 | 85.7 | — |
| ResNet (RSB) | ResNet-101 (RSB) | 44.5 | IN-1k : Sup. : 600 | — : — : — | 7.9 | 86.3 | — |
| ResNet (RSB) | ResNet-152 (RSB) | 60.2 | IN-1k : Sup. : 600 | — : — : — | 11.6 | 86.4 | — |
| CoCA ViT | CoCA ViT-11M | 11.4 | IN-1k : Sup. : 300 | — : — : — | 2.2 | 87.8 | — |
| CoCA ViT | CoCA ViT-21M | 20.6 | IN-1k : Sup. : 300 | — : — : — | 4.1 | 88.3 | — |
| CoCA ViT | CoCA ViT-28M | 27.8 | IN-1k : Sup. : 300 | — : — : — | 4.9 | 88.4 | — |