`Learning.isStoppingTime_stepsUntil`🔗

This page has the declaration's own card below, then its dependency graph, then a card for each dependency (type dependencies first, then the rest of the transitive closure). For a theorem, the graph and the dependency cards only follow its statement's dependencies (its proof is replaced by sorry, so what it proves doesn't depend on how); for everything else, both the type and the body/value are followed, since their content is part of what later declarations build on.

Minimal Lean file

`isStoppingTime_stepsUntil`🔗

LemmaLearning.isStoppingTime_stepsUntil

Details

No docstring.

theorem

Learning.isStoppingTime_stepsUntil.{u_1, u_2, u_3} {𝓐 : Type u_1}
  {R : Type u_2} {Ω : Type u_3} {m𝓐 : MeasurableSpace 𝓐}
  {mR : MeasurableSpace R} {mΩ : MeasurableSpace Ω} [DecidableEq 𝓐]
  {A : ℕ → Ω → 𝓐} {R' : ℕ → Ω → R} {m : ℕ} [MeasurableSingletonClass 𝓐]
  (hA : ∀ (n : ℕ), Measurable (A n))
  (hR' : ∀ (n : ℕ), Measurable (R' n)) (a : 𝓐) (hm : m ≠ 0) :
  MeasureTheory.IsStoppingTime (IsAlgEnvSeq.filtration hA hR')
    (stepsUntil A a m)
Learning.isStoppingTime_stepsUntil.{u_1,
    u_2, u_3}
  {𝓐 : Type u_1} {R : Type u_2}
  {Ω : Type u_3} {m𝓐 : MeasurableSpace 𝓐}
  {mR : MeasurableSpace R}
  {mΩ : MeasurableSpace Ω} [DecidableEq 𝓐]
  {A : ℕ → Ω → 𝓐} {R' : ℕ → Ω → R} {m : ℕ}
  [MeasurableSingletonClass 𝓐]
  (hA : ∀ (n : ℕ), Measurable (A n))
  (hR' : ∀ (n : ℕ), Measurable (R' n))
  (a : 𝓐) (hm : m ≠ 0) :
  MeasureTheory.IsStoppingTime
    (IsAlgEnvSeq.filtration hA hR')
    (stepsUntil A a m)

Code

lemma isStoppingTime_stepsUntil [MeasurableSingletonClass 𝓐]
    (hA : ∀ n, Measurable (A n)) (hR' : ∀ n, Measurable (R' n)) (a : 𝓐) (hm : m ≠ 0) :
    IsStoppingTime (IsAlgEnvSeq.filtration hA hR') (stepsUntil A a m)

Type uses (2)

Body uses (3)

Actions: Source · Open Issue

Proof

by
  rw [stepsUntil_eq_leastGE _ hm]
  refine StronglyAdapted.isStoppingTime_leastGE _ fun n ↦ ?_
  suffices StronglyMeasurable[IsAlgEnvSeq.filtration hA hR' n] (pullCount A a (n + 1)) by
    fun_prop
  refine Measurable.stronglyMeasurable ?_
  exact adapted_pullCount_add_one hA hR' a n

Dependency graph

Type dependencies (2)

`filtration`🔗

DefinitionLearning.IsAlgEnvSeq.filtration

Details

Filtration generated by the history up to time n.

def

Learning.IsAlgEnvSeq.filtration.{u_1, u_2, u_3} {𝓐 : Type u_1}
  {𝓨 : Type u_2} {Ω : Type u_3} {m𝓐 : MeasurableSpace 𝓐}
  {m𝓨 : MeasurableSpace 𝓨} {mΩ : MeasurableSpace Ω} {A : ℕ → Ω → 𝓐}
  {Y : ℕ → Ω → 𝓨} (hA : ∀ (n : ℕ), Measurable (A n))
  (hY : ∀ (n : ℕ), Measurable (Y n)) : MeasureTheory.Filtration ℕ mΩ
Learning.IsAlgEnvSeq.filtration.{u_1, u_2,
    u_3}
  {𝓐 : Type u_1} {𝓨 : Type u_2}
  {Ω : Type u_3} {m𝓐 : MeasurableSpace 𝓐}
  {m𝓨 : MeasurableSpace 𝓨}
  {mΩ : MeasurableSpace Ω} {A : ℕ → Ω → 𝓐}
  {Y : ℕ → Ω → 𝓨}
  (hA : ∀ (n : ℕ), Measurable (A n))
  (hY : ∀ (n : ℕ), Measurable (Y n)) :
  MeasureTheory.Filtration ℕ mΩ

Code

def IsAlgEnvSeq.filtration (hA : ∀ n, Measurable (A n)) (hY : ∀ n, Measurable (Y n)) :
    Filtration ℕ mΩ where
  seq i := MeasurableSpace.comap (history A Y i) inferInstance
  mono' i j hij := by
    simp only
    rw [← measurable_iff_comap_le]
    have : history A Y i = (fun h k ↦ h ⟨k.1, by grind⟩) ∘ history A Y j := rfl
    rw [this]
    exact measurable_comp_comap _ (by fun_prop)
  le' i := by
    rw [← measurable_iff_comap_le]
    exact Learning.measurable_history hA hY i

Body uses (3)

Used by (18)

Actions: Source · Open Issue

`stepsUntil`🔗

DefinitionLearning.stepsUntil

Details

Number of steps until action a was pulled exactly m times.

def

Learning.stepsUntil.{u_1, u_3} {𝓐 : Type u_1} {Ω : Type u_3}
  [DecidableEq 𝓐] (A : ℕ → Ω → 𝓐) (a : 𝓐) (m : ℕ) (ω : Ω) : ℕ∞
Learning.stepsUntil.{u_1, u_3}
  {𝓐 : Type u_1} {Ω : Type u_3}
  [DecidableEq 𝓐] (A : ℕ → Ω → 𝓐) (a : 𝓐)
  (m : ℕ) (ω : Ω) : ℕ∞

Code

noncomputable
def stepsUntil (A : ℕ → Ω → 𝓐) (a : 𝓐) (m : ℕ) (ω : Ω) : ℕ∞ :=
  sInf ((↑) '' {s | pullCount A a (s + 1) ω = m})

Body uses (1)

pullCount

Used by (46)

Actions: Source · Open Issue

All dependencies, transitively (4)

`history`🔗

DefinitionLearning.history

Details

History of the algorithm-environment sequence up to time n.

def

Learning.history.{u_1, u_2, u_3} {𝓐 : Type u_1} {𝓨 : Type u_2}
  {Ω : Type u_3} (A : ℕ → Ω → 𝓐) (Y : ℕ → Ω → 𝓨) (n : ℕ) (ω : Ω) :
  ↥(Finset.Iic n) → 𝓐 × 𝓨
Learning.history.{u_1, u_2, u_3}
  {𝓐 : Type u_1} {𝓨 : Type u_2}
  {Ω : Type u_3} (A : ℕ → Ω → 𝓐)
  (Y : ℕ → Ω → 𝓨) (n : ℕ) (ω : Ω) :
  ↥(Finset.Iic n) → 𝓐 × 𝓨

Code

def history (A : ℕ → Ω → 𝓐) (Y : ℕ → Ω → 𝓨) (n : ℕ) (ω : Ω) : Iic n → 𝓐 × 𝓨 :=
  fun i ↦ (A i ω, Y i ω)

Used by (72)

Actions: Source · Open Issue

`measurable_comp_comap`🔗

LemmaMeasureTheory.measurable_comp_comap

Details

No docstring.

theorem

MeasureTheory.measurable_comp_comap.{u_1, u_2, u_3} {α : Type u_1}
  {β : Type u_2} {γ : Type u_3} {mβ : MeasurableSpace β}
  {mγ : MeasurableSpace γ} (f : α → β) {g : β → γ} (hg : Measurable g) :
  Measurable (g ∘ f)
MeasureTheory.measurable_comp_comap.{u_1,
    u_2, u_3}
  {α : Type u_1} {β : Type u_2}
  {γ : Type u_3} {mβ : MeasurableSpace β}
  {mγ : MeasurableSpace γ} (f : α → β)
  {g : β → γ} (hg : Measurable g) :
  Measurable (g ∘ f)

Code

lemma measurable_comp_comap (f : α → β) {g : β → γ} (hg : Measurable g) :
    Measurable[mβ.comap f] (g ∘ f)

Used by (10)

Actions: Source · Open Issue

Proof

by
  rw [measurable_iff_comap_le, ← MeasurableSpace.comap_comp]
  exact MeasurableSpace.comap_mono hg.comap_le

`measurable_history`🔗

LemmaLearning.measurable_history

Details

No docstring.

theorem

Learning.measurable_history.{u_1, u_2, u_3} {𝓐 : Type u_1}
  {𝓨 : Type u_2} {Ω : Type u_3} {m𝓐 : MeasurableSpace 𝓐}
  {m𝓨 : MeasurableSpace 𝓨} {mΩ : MeasurableSpace Ω} {A : ℕ → Ω → 𝓐}
  {Y : ℕ → Ω → 𝓨} (hA : ∀ (n : ℕ), Measurable (A n))
  (hY : ∀ (n : ℕ), Measurable (Y n)) (n : ℕ) :
  Measurable (history A Y n)
Learning.measurable_history.{u_1, u_2,
    u_3}
  {𝓐 : Type u_1} {𝓨 : Type u_2}
  {Ω : Type u_3} {m𝓐 : MeasurableSpace 𝓐}
  {m𝓨 : MeasurableSpace 𝓨}
  {mΩ : MeasurableSpace Ω} {A : ℕ → Ω → 𝓐}
  {Y : ℕ → Ω → 𝓨}
  (hA : ∀ (n : ℕ), Measurable (A n))
  (hY : ∀ (n : ℕ), Measurable (Y n))
  (n : ℕ) : Measurable (history A Y n)

Code

lemma measurable_history (hA : ∀ n, Measurable (A n))
    (hY : ∀ n, Measurable (Y n)) (n : ℕ) :
    Measurable (history A Y n)

Type uses (1)

history

Used by (10)

Actions: Source · Open Issue

Proof

by
  unfold history
  fun_prop

`pullCount`🔗

DefinitionLearning.pullCount

Details

Number of times action a was chosen up to time t (excluding t).

def

Learning.pullCount.{u_1, u_3} {𝓐 : Type u_1} {Ω : Type u_3}
  [DecidableEq 𝓐] (A : ℕ → Ω → 𝓐) (a : 𝓐) (t : ℕ) (ω : Ω) : ℕ
Learning.pullCount.{u_1, u_3}
  {𝓐 : Type u_1} {Ω : Type u_3}
  [DecidableEq 𝓐] (A : ℕ → Ω → 𝓐) (a : 𝓐)
  (t : ℕ) (ω : Ω) : ℕ

Code

noncomputable
def pullCount (A : ℕ → Ω → 𝓐) (a : 𝓐) (t : ℕ) (ω : Ω) : ℕ :=
  #(filter (fun s ↦ A s ω = a) (range t))

Used by (146)

Actions: Source · Open Issue

Learning.isStoppingTime_stepsUntil🔗

isStoppingTime_stepsUntil🔗

filtration🔗

stepsUntil🔗

history🔗

measurable_comp_comap🔗

measurable_history🔗

pullCount🔗

`Learning.isStoppingTime_stepsUntil`🔗

`isStoppingTime_stepsUntil`🔗

`filtration`🔗

`stepsUntil`🔗

`history`🔗

`measurable_comp_comap`🔗

`measurable_history`🔗

`pullCount`🔗