Learning.isStoppingTime_stepsUntil
This page has the declaration's own card below, then its dependency graph, then a card for each dependency (type dependencies first, then the rest of the transitive closure). For a theorem, the graph and the dependency cards only follow its statement's dependencies (its proof is replaced by sorry, so what it proves doesn't depend on how); for everything else, both the type and the body/value are followed, since their content is part of what later declarations build on.
isStoppingTime_stepsUntil๐
Learning.isStoppingTime_stepsUntilNo docstring.
Learning.isStoppingTime_stepsUntil.{u_1, u_2, u_3} {๐ : Type u_1} {R : Type u_2} {ฮฉ : Type u_3} {m๐ : MeasurableSpace ๐} {mR : MeasurableSpace R} {mฮฉ : MeasurableSpace ฮฉ} [DecidableEq ๐] {A : โ โ ฮฉ โ ๐} {R' : โ โ ฮฉ โ R} {m : โ} [MeasurableSingletonClass ๐] (hA : โ (n : โ), Measurable (A n)) (hR' : โ (n : โ), Measurable (R' n)) (a : ๐) (hm : m โ 0) : MeasureTheory.IsStoppingTime (IsAlgEnvSeq.filtration hA hR') (stepsUntil A a m)Learning.isStoppingTime_stepsUntil.{u_1, u_2, u_3} {๐ : Type u_1} {R : Type u_2} {ฮฉ : Type u_3} {m๐ : MeasurableSpace ๐} {mR : MeasurableSpace R} {mฮฉ : MeasurableSpace ฮฉ} [DecidableEq ๐] {A : โ โ ฮฉ โ ๐} {R' : โ โ ฮฉ โ R} {m : โ} [MeasurableSingletonClass ๐] (hA : โ (n : โ), Measurable (A n)) (hR' : โ (n : โ), Measurable (R' n)) (a : ๐) (hm : m โ 0) : MeasureTheory.IsStoppingTime (IsAlgEnvSeq.filtration hA hR') (stepsUntil A a m)
Code
lemma isStoppingTime_stepsUntil [MeasurableSingletonClass ๐]
(hA : โ n, Measurable (A n)) (hR' : โ n, Measurable (R' n)) (a : ๐) (hm : m โ 0) :
IsStoppingTime (IsAlgEnvSeq.filtration hA hR') (stepsUntil A a m)Type uses (2)
Body uses (3)
Actions: Source ยท Open Issue
Proof
by
rw [stepsUntil_eq_leastGE _ hm]
refine StronglyAdapted.isStoppingTime_leastGE _ fun n โฆ ?_
suffices StronglyMeasurable[IsAlgEnvSeq.filtration hA hR' n] (pullCount A a (n + 1)) by
fun_prop
refine Measurable.stronglyMeasurable ?_
exact adapted_pullCount_add_one hA hR' a nDependency graph
Type dependencies (2)
filtration๐
Learning.IsAlgEnvSeq.filtration
Filtration generated by the history up to time n.
Learning.IsAlgEnvSeq.filtration.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {ฮฉ : Type u_3} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {mฮฉ : MeasurableSpace ฮฉ} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (hA : โ (n : โ), Measurable (A n)) (hY : โ (n : โ), Measurable (Y n)) : MeasureTheory.Filtration โ mฮฉLearning.IsAlgEnvSeq.filtration.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {ฮฉ : Type u_3} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {mฮฉ : MeasurableSpace ฮฉ} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (hA : โ (n : โ), Measurable (A n)) (hY : โ (n : โ), Measurable (Y n)) : MeasureTheory.Filtration โ mฮฉ
Code
def IsAlgEnvSeq.filtration (hA : โ n, Measurable (A n)) (hY : โ n, Measurable (Y n)) :
Filtration โ mฮฉ where
seq i := MeasurableSpace.comap (history A Y i) inferInstance
mono' i j hij := by
simp only
rw [โ measurable_iff_comap_le]
have : history A Y i = (fun h k โฆ h โจk.1, by grindโฉ) โ history A Y j := rfl
rw [this]
exact measurable_comp_comap _ (by fun_prop)
le' i := by
rw [โ measurable_iff_comap_le]
exact Learning.measurable_history hA hY iBody uses (3)
Used by (18)
Actions: Source ยท Open Issue
stepsUntil๐
Learning.stepsUntil
Number of steps until action a was pulled exactly m times.
Learning.stepsUntil.{u_1, u_3} {๐ : Type u_1} {ฮฉ : Type u_3} [DecidableEq ๐] (A : โ โ ฮฉ โ ๐) (a : ๐) (m : โ) (ฯ : ฮฉ) : โโLearning.stepsUntil.{u_1, u_3} {๐ : Type u_1} {ฮฉ : Type u_3} [DecidableEq ๐] (A : โ โ ฮฉ โ ๐) (a : ๐) (m : โ) (ฯ : ฮฉ) : โโ
Code
noncomputable
def stepsUntil (A : โ โ ฮฉ โ ๐) (a : ๐) (m : โ) (ฯ : ฮฉ) : โโ :=
sInf ((โ) '' {s | pullCount A a (s + 1) ฯ = m})Body uses (1)
Used by (46)
Actions: Source ยท Open Issue
All dependencies, transitively (4)
history๐
Learning.history
History of the algorithm-environment sequence up to time n.
Learning.history.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {ฮฉ : Type u_3} (A : โ โ ฮฉ โ ๐) (Y : โ โ ฮฉ โ ๐จ) (n : โ) (ฯ : ฮฉ) : โฅ(Finset.Iic n) โ ๐ ร ๐จLearning.history.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {ฮฉ : Type u_3} (A : โ โ ฮฉ โ ๐) (Y : โ โ ฮฉ โ ๐จ) (n : โ) (ฯ : ฮฉ) : โฅ(Finset.Iic n) โ ๐ ร ๐จ
Code
def history (A : โ โ ฮฉ โ ๐) (Y : โ โ ฮฉ โ ๐จ) (n : โ) (ฯ : ฮฉ) : Iic n โ ๐ ร ๐จ := fun i โฆ (A i ฯ, Y i ฯ)
Actions: Source ยท Open Issue
measurable_comp_comap๐
MeasureTheory.measurable_comp_comapNo docstring.
MeasureTheory.measurable_comp_comap.{u_1, u_2, u_3} {ฮฑ : Type u_1} {ฮฒ : Type u_2} {ฮณ : Type u_3} {mฮฒ : MeasurableSpace ฮฒ} {mฮณ : MeasurableSpace ฮณ} (f : ฮฑ โ ฮฒ) {g : ฮฒ โ ฮณ} (hg : Measurable g) : Measurable (g โ f)MeasureTheory.measurable_comp_comap.{u_1, u_2, u_3} {ฮฑ : Type u_1} {ฮฒ : Type u_2} {ฮณ : Type u_3} {mฮฒ : MeasurableSpace ฮฒ} {mฮณ : MeasurableSpace ฮณ} (f : ฮฑ โ ฮฒ) {g : ฮฒ โ ฮณ} (hg : Measurable g) : Measurable (g โ f)
Code
lemma measurable_comp_comap (f : ฮฑ โ ฮฒ) {g : ฮฒ โ ฮณ} (hg : Measurable g) :
Measurable[mฮฒ.comap f] (g โ f)Used by (10)
Actions: Source ยท Open Issue
Proof
by rw [measurable_iff_comap_le, โ MeasurableSpace.comap_comp] exact MeasurableSpace.comap_mono hg.comap_le
measurable_history๐
Learning.measurable_historyNo docstring.
Learning.measurable_history.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {ฮฉ : Type u_3} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {mฮฉ : MeasurableSpace ฮฉ} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (hA : โ (n : โ), Measurable (A n)) (hY : โ (n : โ), Measurable (Y n)) (n : โ) : Measurable (history A Y n)Learning.measurable_history.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {ฮฉ : Type u_3} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {mฮฉ : MeasurableSpace ฮฉ} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (hA : โ (n : โ), Measurable (A n)) (hY : โ (n : โ), Measurable (Y n)) (n : โ) : Measurable (history A Y n)
Code
lemma measurable_history (hA : โ n, Measurable (A n))
(hY : โ n, Measurable (Y n)) (n : โ) :
Measurable (history A Y n)Type uses (1)
Used by (10)
Actions: Source ยท Open Issue
Proof
by unfold history fun_prop
pullCount๐
Learning.pullCount
Number of times action a was chosen up to time t (excluding t).
Learning.pullCount.{u_1, u_3} {๐ : Type u_1} {ฮฉ : Type u_3} [DecidableEq ๐] (A : โ โ ฮฉ โ ๐) (a : ๐) (t : โ) (ฯ : ฮฉ) : โLearning.pullCount.{u_1, u_3} {๐ : Type u_1} {ฮฉ : Type u_3} [DecidableEq ๐] (A : โ โ ฮฉ โ ๐) (a : ๐) (t : โ) (ฯ : ฮฉ) : โ
Code
noncomputable def pullCount (A : โ โ ฮฉ โ ๐) (a : ๐) (t : โ) (ฯ : ฮฉ) : โ := #(filter (fun s โฆ A s ฯ = a) (range t))
Actions: Source ยท Open Issue