olmo_tap.experiments.uncertainty.training¶

HydraTransformer Double-Head Uncertainty Finetuning Pipeline.

Uses the FrozenHeadHandler to cycle through frozen LLM heads, each loaded with OLMo base + prod security LoRA + robustness LoRA. Uncertainty head is trained to predict confidence score of the multiple choice answer.

Intended Usage (run from tap root)::: # cycles through randomly selected heads for 100 steps each pixi run python -m experiments.uncertainty.training –num-epochs 5 –swap-freq 100

Functions

`compute_total_steps`(num_shards, batch_size, ...)
`main`()
`parse_args`()

olmo_tap.experiments.uncertainty.training.compute_total_steps(num_shards: int, batch_size: int, num_epochs: int) → int[source]¶

olmo_tap.experiments.uncertainty.training.main()[source]¶

olmo_tap.experiments.uncertainty.training.parse_args() → Namespace[source]¶