Fenrir
Applied AI Engineer Experimental
Updated: 2026-04-18
Local LLM behavior evaluator with pressure conditions, canonical readouts, and explicit uncertainty guardrails.
Impact: Makes pressure-sensitive behavior shifts inspectable without pretending a heuristic readout is a universal safety score.
What I built
- Runs a setup-first local UI for endpoint configuration, connection tests, evaluation launch, and canonical readout export.
- Defines hybrid MVP batteries across authority override, reputation shielding, and urgency tradeoff conditions.
- Keeps claims bounded with explicit uncertainty, non-diagnostic language, and heuristic readout contracts.
Proof: Run `python3 scripts/start_fenrir.py`, configure an endpoint, and execute the hybrid MVP evaluation.
PythonFastAPIPytestYAMLLocal UI