olmo_tap.experiments.uncertainty.training¶
HydraTransformer Double-Head Uncertainty Finetuning Pipeline.
Uses the FrozenHeadHandler to cycle through frozen LLM heads, each loaded with OLMo base + prod security LoRA + robustness LoRA. Uncertainty head is trained to predict confidence score of the multiple choice answer.
- Intended Usage (run from tap root)::
# cycles through randomly selected heads for 100 steps each pixi run python -m experiments.uncertainty.training –num-epochs 5 –swap-freq 100
Functions
|
|
|
|