Supported Models#
The table below lists all models currently targeted by NewComputeBench.
Task |
Model |
Sizes |
Notes |
|---|---|---|---|
Text classification |
|
|
Encoder-only model included as a sanity-check baseline. |
Causal language modeling |
|
60M, 200M, 400M, 1.1B |
Custom family using the Llama-3.1 architecture. Trained with cosmo2-tokenizer on FineWeb-Edu. Checkpoints: AICrossSim collection. |
Causal language modeling |
|
1B, 3B, 8B, 70B |
Meta’s Llama-3 family. |
Image classification |
|
86M |
google/vit-base-patch16-224 from HuggingFace. |
Causal language modeling |
TBD |
TBD |
|
Image generation |
TBD |
TBD |
Training Support#
Pretraining from scratch#
Model |
Supported |
|---|---|
|
✅ |
|
✅ |
Fine-tuning#
Model |
Supported |
|---|---|
|
✅ |
|
⏹️ |
Evaluation#
Task |
Model |
Supported |
|---|---|---|
Text classification (GLUE) |
|
✅ |
Causal language modeling |
|
✅ |
lm-eval-harness benchmarks |
|
✅ |
Model Behaviour-Level Simulation#
Transform-aware pretraining from scratch#
Transform |
Model |
Supported |
|---|---|---|
Random Bitflip |
|
✅ |
Optical Compute |
|
⏹️ |
In-Memory Compute |
|
⏹️ |
Spiking Neural Networks |
|
⏹️ |
Post-transform evaluation#
Transform |
Task |
Model |
Supported |
|---|---|---|---|
Random Bitflip |
lm-eval-harness |
|
✅ |
Optical Compute |
lm-eval-harness |
|
✅ |
In-Memory Compute |
lm-eval-harness |
|
✅ |
Spiking Neural Networks |
lm-eval-harness |
|
✅ |