f-IRL#

Reference PDF: pending known-truth migration.

f-IRL learns a tabular reward by matching expert and model occupancy measures under an f-divergence. It is part of the 12-estimator known-truth validation suite.

This RTD page is intentionally short while the dedicated f-IRL TeX/PDF tutorial is migrated to the shared synthetic DGP. No real-data estimation examples are published on RTD.

Migration Target#

  • PDF source: pending dedicated primer TeX

  • Result generator: pending known-truth result generator

  • Shared DGP harness: experiments/known_truth.py