Documentation

LeanMachineLearning.SequentialLearning.AlgorithmDensity

Algorithm density #

We define a density function that allows obtaining the law of the history under one algorithm from the law of the history under another algorithm when they are interacting with the same environment. This also requires one algorithm to be absolutely continuous with respect to another, a concept that we also introduce here.

Main definitions #

Main results #

structure Learning.Algorithm.AbsolutelyContinuous {𝓐 : Type u_1} {𝓨 : Type u_2} [MeasurableSpace 𝓐] [MeasurableSpace 𝓨] (alg alg₀ : Algorithm 𝓐 𝓨) :

For every time and history, the distribution over actions according to alg is absolutely continuous with respect to the distribution over actions according to alg₀.

Instances For

    For every time and history, the distribution over actions according to alg is absolutely continuous with respect to the distribution over actions according to alg₀.

    Equations
    • One or more equations did not get rendered due to their size.
    Instances For
      noncomputable def Learning.Algorithm.density {𝓐 : Type u_1} {𝓨 : Type u_2} [MeasurableSpace 𝓐] [MeasurableSpace 𝓨] [MeasurableSpace.CountablyGenerated 𝓐] (alg alg₀ : Algorithm 𝓐 𝓨) (n : ) :
      ((Finset.Iic n)𝓐 × 𝓨)ENNReal

      If the algorithm alg is absolutely continuous with respect to the algorithm alg₀ and they are both interacting with the same environment, then the law of the history at time n under alg is the law of the history at time n under alg₀ with density alg.density alg₀ n.

      Equations
      Instances For
        theorem Learning.Algorithm.measurable_density {𝓐 : Type u_1} {𝓨 : Type u_2} [MeasurableSpace 𝓐] [MeasurableSpace 𝓨] [MeasurableSpace.CountablyGenerated 𝓐] (alg alg₀ : Algorithm 𝓐 𝓨) (n : ) :
        Measurable (alg.density alg₀ n)
        theorem Learning.IsAlgEnvSeq.absolutelyContinuous_map_history {𝓐 : Type u_1} {𝓨 : Type u_2} [MeasurableSpace 𝓐] [MeasurableSpace 𝓨] {Ω : Type u_3} [MeasurableSpace Ω] [StandardBorelSpace 𝓐] [Nonempty 𝓐] [StandardBorelSpace 𝓨] [Nonempty 𝓨] {alg : Algorithm 𝓐 𝓨} {env : Environment 𝓐 𝓨} {A : Ω𝓐} {Y : Ω𝓨} {P : MeasureTheory.Measure Ω} [MeasureTheory.IsFiniteMeasure P] {Ω₀ : Type u_4} [MeasurableSpace Ω₀] {alg₀ : Algorithm 𝓐 𝓨} {A₀ : Ω₀𝓐} {Y₀ : Ω₀𝓨} {P₀ : MeasureTheory.Measure Ω₀} [MeasureTheory.IsProbabilityMeasure P₀] (h : IsAlgEnvSeq A Y alg env P) (h₀ : IsAlgEnvSeq A₀ Y₀ alg₀ env P₀) (hc : alg.AbsolutelyContinuous alg₀) (n : ) :
        theorem Learning.IsAlgEnvSeq.hasLaw_history_withDensity {𝓐 : Type u_1} {𝓨 : Type u_2} [MeasurableSpace 𝓐] [MeasurableSpace 𝓨] {Ω : Type u_3} [MeasurableSpace Ω] [StandardBorelSpace 𝓐] [Nonempty 𝓐] [StandardBorelSpace 𝓨] [Nonempty 𝓨] {alg : Algorithm 𝓐 𝓨} {env : Environment 𝓐 𝓨} {A : Ω𝓐} {Y : Ω𝓨} {P : MeasureTheory.Measure Ω} [MeasureTheory.IsFiniteMeasure P] {Ω₀ : Type u_4} [MeasurableSpace Ω₀] {alg₀ : Algorithm 𝓐 𝓨} {A₀ : Ω₀𝓐} {Y₀ : Ω₀𝓨} {P₀ : MeasureTheory.Measure Ω₀} [MeasureTheory.IsProbabilityMeasure P₀] (h : IsAlgEnvSeq A Y alg env P) (h₀ : IsAlgEnvSeq A₀ Y₀ alg₀ env P₀) (hc : alg.AbsolutelyContinuous alg₀) (n : ) :