Learning.IsBayesAlgEnvSeq.integrable_regret
This page has the declaration's own card below, then its dependency graph, then a card for each dependency (type dependencies first, then the rest of the transitive closure). For a theorem, the graph and the dependency cards only follow its statement's dependencies (its proof is replaced by sorry, so what it proves doesn't depend on how); for everything else, both the type and the body/value are followed, since their content is part of what later declarations build on.
integrable_regret๐
Learning.IsBayesAlgEnvSeq.integrable_regretNo docstring.
Learning.IsBayesAlgEnvSeq.integrable_regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ฮฉ] [Countable ๐] [Nonempty ๐] {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {n : โ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] (hE : Measurable E) (hA : โ (t : โ), Measurable (A t)) {l u : โ} (h : โ (e : ๐) (a : ๐), โซ (x : โ), id x โฮบ (e, a) โ Set.Icc l u) : MeasureTheory.Integrable (regret ฮบ E A n) PLearning.IsBayesAlgEnvSeq.integrable_regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ฮฉ] [Countable ๐] [Nonempty ๐] {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {n : โ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] (hE : Measurable E) (hA : โ (t : โ), Measurable (A t)) {l u : โ} (h : โ (e : ๐) (a : ๐), โซ (x : โ), id x โฮบ (e, a) โ Set.Icc l u) : MeasureTheory.Integrable (regret ฮบ E A n) P
Code
lemma integrable_regret [Countable ๐] [Nonempty ๐] {ฮบ : Kernel (๐ ร ๐) โ} {E : ฮฉ โ ๐}
{A : โ โ ฮฉ โ ๐} {n : โ} {P : Measure ฮฉ} [IsFiniteMeasure P] (hE : Measurable E)
(hA : โ t, Measurable (A t)) {l u : โ} (h : โ e a, (ฮบ (e, a))[id] โ Set.Icc l u) :
Integrable (regret ฮบ E A n) PType uses (1)
Body uses (3)
Actions: Source ยท Open Issue
Proof
by rw [regret_eq_sum_gap'] exact integrable_finsetSum _ (fun _ _ โฆ integrable_gap hE hA h)
Dependency graph
Type dependencies (1)
regret๐
Learning.IsBayesAlgEnvSeq.regret
A random variable that gives the regret at time n.
Learning.IsBayesAlgEnvSeq.regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (n : โ) (ฯ : ฮฉ) : โLearning.IsBayesAlgEnvSeq.regret.{u_1, u_2, u_4} {๐ : Type u_1} {๐ : Type u_2} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) โ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (n : โ) (ฯ : ฮฉ) : โ
Code
noncomputable def regret (ฮบ : Kernel (๐ ร ๐) โ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (n : โ) (ฯ : ฮฉ) : โ := Bandits.regret (ฮบ.sectR (E ฯ)) A n ฯ
Body uses (1)
Used by (6)
Actions: Source ยท Open Issue
All dependencies, transitively (1)
regret๐
Bandits.regret
Regret of a sequence of pulls k : โ โ ๐ at time t for the reward kernel ฮฝ ; Kernel ๐ โ.
Bandits.regret.{u_1, u_2} {๐ : Type u_1} {ฮฉ : Type u_2} {m๐ : MeasurableSpace ๐} (ฮฝ : ProbabilityTheory.Kernel ๐ โ) (A : โ โ ฮฉ โ ๐) (t : โ) (ฯ : ฮฉ) : โBandits.regret.{u_1, u_2} {๐ : Type u_1} {ฮฉ : Type u_2} {m๐ : MeasurableSpace ๐} (ฮฝ : ProbabilityTheory.Kernel ๐ โ) (A : โ โ ฮฉ โ ๐) (t : โ) (ฯ : ฮฉ) : โ
Code
noncomputable def regret (ฮฝ : Kernel ๐ โ) (A : โ โ ฮฉ โ ๐) (t : โ) (ฯ : ฮฉ) : โ := t * (โจ a, (ฮฝ a)[id]) - โ s โ range t, (ฮฝ (A s ฯ))[id]
Used by (11)
Actions: Source ยท Open Issue