olmo_tap.experiments.security.training¶
HydraTransformer Security Finetuning Pipeline.
Loads base OLMo weights then injects fresh LoRA for security training on MedMCQA.
- Intended Usage (run from tap root)::
# quick test on shard 0 pixi run python -m experiments.security.training –shard-id 0 –num-epochs 3
# train all 9 shards bash experiments/security/run_all.sh 3
Functions
|
Compute total training steps from dataset geometry (no data loading needed). |
|
|