Learning.IsBayesAlgEnvSeq.measurable_regret
This page has the declaration's own card below, then its dependency graph, then a card for each dependency (type dependencies first, then the rest of the transitive closure). For a theorem, the graph and the dependency cards only follow its statement's dependencies (its proof is replaced by sorry, so what it proves doesn't depend on how); for everything else, both the type and the body/value are followed, since their content is part of what later declarations build on.
measurable_regret๐
Learning.IsBayesAlgEnvSeq.measurable_regretNo docstring.
Learning.IsBayesAlgEnvSeq.measurable_regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ฮฉ] [Countable ๐] {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {n : โ} (hE : Measurable E) (hA : โ (t : โ), Measurable (A t)) : Measurable (regret ฮบ E A n)Learning.IsBayesAlgEnvSeq.measurable_regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ฮฉ] [Countable ๐] {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {n : โ} (hE : Measurable E) (hA : โ (t : โ), Measurable (A t)) : Measurable (regret ฮบ E A n)
Code
lemma measurable_regret [Countable ๐] {ฮบ : Kernel (๐ ร ๐) โ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {n : โ}
(hE : Measurable E) (hA : โ t, Measurable (A t)) : Measurable (regret ฮบ E A n)Type uses (1)
Body uses (3)
Actions: Source ยท Open Issue
Proof
by rw [regret_eq_sum_gap'] fun_prop
Dependency graph
Type dependencies (1)
regret๐
Learning.IsBayesAlgEnvSeq.regret
A random variable that gives the regret at time n.
Learning.IsBayesAlgEnvSeq.regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (n : โ) (ฯ : ฮฉ) : โLearning.IsBayesAlgEnvSeq.regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (n : โ) (ฯ : ฮฉ) : โ
Code
noncomputable def regret (ฮบ : Kernel (๐ ร ๐) โ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (n : โ) (ฯ : ฮฉ) : โ := Bandits.regret (ฮบ.sectR (E ฯ)) A n ฯ
Body uses (1)
Used by (6)
Actions: Source ยท Open Issue
All dependencies, transitively (1)
regret๐
Bandits.regret
Regret of a sequence of pulls k : โ โ ๐ at time t for the reward kernel ฮฝ ; Kernel ๐ โ.
Bandits.regret.{u_1, u_2} {๐ : Type u_1} {ฮฉ : Type u_2} {m๐ : MeasurableSpace ๐} (ฮฝ : ProbabilityTheory.Kernel ๐ โ) (A : โ โ ฮฉ โ ๐) (t : โ) (ฯ : ฮฉ) : โBandits.regret.{u_1, u_2} {๐ : Type u_1} {ฮฉ : Type u_2} {m๐ : MeasurableSpace ๐} (ฮฝ : ProbabilityTheory.Kernel ๐ โ) (A : โ โ ฮฉ โ ๐) (t : โ) (ฯ : ฮฉ) : โ
Code
noncomputable def regret (ฮฝ : Kernel ๐ โ) (A : โ โ ฮฉ โ ๐) (t : โ) (ฯ : ฮฉ) : โ := t * (โจ a, (ฮฝ a)[id]) - โ s โ range t, (ฮฝ (A s ฯ))[id]
Used by (11)
Actions: Source ยท Open Issue