Changelog#
31 May 2026#
Bitflip-aware LoRA fine-tuning of Llama-3-70B with FSDP2 (Bitflip-Aware LoRA Fine-Tuning with FSDP2 (Llama-3-70B))
Scales the bitflip-aware LoRA recipe from 8B to 70B by replacing the HF Trainer /
Accelerate path with a standalone torchrun + FSDP2 script built on
torchtitan model definitions. Bitflip-only eval degrades perplexity from
6.17 → 89.28 on Llama-3-70B; bitflip-aware LoRA recovery training
(r = 32, lr = 2e-4) converged training loss from ~4.49 down to
~2.5 – 2.7 and was early-stopped at 7,900 steps (original target: 21,000),
demonstrating that the 8B recipe transfers cleanly to 70B.
Item |
Link |
|---|---|
Llama-3-70B bitflip-aware LoRA fine-tuning (FSDP2) |
4 February 2026#
Bitflip-aware LoRA fine-tuning of Llama-3.1-8B (Bitflip-Aware LoRA Fine-Tuning)
LoRA adapters with only 1.2% trainable parameters effectively mitigate random bitflip noise, reducing validation perplexity from 1008.95 to 11.01 (clean baseline: 7.91).
Item |
Link |
|---|---|
Llama-3.1-8B with random bitflip noise |
4 October 2025#
Optical Transformer fine-tuning on CLM models (60M – 1.1B) (Scaling Optical Transformers to Causal Language Models)
Full fine-tuning of pretrained CLM models with optical transformer simulation.
Item |
Link |
|---|---|
Optical Transformer on CLM |
1 October 2025#
Optical Transformer, Spiking Transformer, and PIM on RoBERTa
Initial experiments on RoBERTa with three new compute paradigms.
Item |
Link |
|---|---|
Optical Transformer on RoBERTa |
|
Spiking Transformer on RoBERTa |
|
Processing in Memory on RoBERTa |
9 June 2025#
Mase-triton released on PyPI (Mase-Triton)
Our software-emulation and acceleration backend is now publicly available:
pip install mase-triton
See Mase-Triton for full documentation.
15 April 2025#
System and model-level training simulation for Small Language Models
Initial release of the scaling framework and bitflip-aware pretraining pipeline.
Item |
Link |
|---|---|
Environment setup |
|
Pretraining AICrossSim-CLM (60M – 1.1B) and evaluation |
|
Bitflip-aware pretraining and evaluation |