Bandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'
This page has the declaration's own card below, then its dependency graph, then a card for each dependency (type dependencies first, then the rest of the transitive closure). For a theorem, the graph and the dependency cards only follow its statement's dependencies (its proof is replaced by sorry, so what it proves doesn't depend on how); for everything else, both the type and the body/value are followed, since their content is part of what later declarations build on.
prob_sum_range_sub_le_le_of_HasSubgaussianMGF'๐
Bandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'No docstring.
Bandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'.{u_1} {๐ : Type u_1} {m๐ : MeasurableSpace ๐} {ฮฝ : ProbabilityTheory.Kernel ๐ โ} [ProbabilityTheory.IsMarkovKernel ฮฝ] {n : โ} {a : ๐} {ฯ2 : NNReal} (hฯ2 : 0 < ฯ2) (h : ProbabilityTheory.HasSubgaussianMGF (fun x => x - โซ (x : โ), id x โฮฝ a) ฯ2 (ฮฝ a)) {ฮด : โ} (hฮด : 0 < ฮด) (hn : 0 < n) : (streamMeasure ฮฝ) {ฯ | โ k โ Finset.range n, (ฯ k a - โซ (x : โ), id x โฮฝ a) โค -โ(2 * โn * โฯ2 * Real.log (1 / ฮด))} โค ENNReal.ofReal ฮดBandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'.{u_1} {๐ : Type u_1} {m๐ : MeasurableSpace ๐} {ฮฝ : ProbabilityTheory.Kernel ๐ โ} [ProbabilityTheory.IsMarkovKernel ฮฝ] {n : โ} {a : ๐} {ฯ2 : NNReal} (hฯ2 : 0 < ฯ2) (h : ProbabilityTheory.HasSubgaussianMGF (fun x => x - โซ (x : โ), id x โฮฝ a) ฯ2 (ฮฝ a)) {ฮด : โ} (hฮด : 0 < ฮด) (hn : 0 < n) : (streamMeasure ฮฝ) {ฯ | โ k โ Finset.range n, (ฯ k a - โซ (x : โ), id x โฮฝ a) โค -โ(2 * โn * โฯ2 * Real.log (1 / ฮด))} โค ENNReal.ofReal ฮด
Code
lemma prob_sum_range_sub_le_le_of_HasSubgaussianMGF' {ฯ2 : โโฅ0} (hฯ2 : 0 < ฯ2)
(h : HasSubgaussianMGF (fun x โฆ x - (ฮฝ a)[id]) ฯ2 (ฮฝ a)) {ฮด : โ} (hฮด : 0 < ฮด) (hn : 0 < n) :
streamMeasure ฮฝ {ฯ | โ k โ range n, (ฯ k a - (ฮฝ a)[id]) โค
-โ(2 * n * ฯ2 * Real.log (1 / ฮด))} โค ENNReal.ofReal ฮดType uses (1)
Body uses (1)
Used by (1)
Actions: Source ยท Open Issue
Proof
calc
_ โค ENNReal.ofReal (Real.exp (-โ(2 * n * ฯ2 * Real.log (1 / ฮด)) ^ 2 / (2 * n * ฯ2))) :=
prob_sum_range_sub_le_le_of_HasSubgaussianMGF h (by positivity) n
_ โค ENNReal.ofReal ฮด := by
gcongr
exact exp_neg_sqrt_sq_div_le hฯ2 hฮด hnDependency graph
Type dependencies (1)
streamMeasure๐
Bandits.streamMeasureMeasure of an infinite stream of rewards from each action.
Bandits.streamMeasure.{u_1, u_2} {๐ : Type u_1} {R : Type u_2} {m๐ : MeasurableSpace ๐} {mR : MeasurableSpace R} (ฮฝ : ProbabilityTheory.Kernel ๐ R) : MeasureTheory.Measure (โ โ ๐ โ R)Bandits.streamMeasure.{u_1, u_2} {๐ : Type u_1} {R : Type u_2} {m๐ : MeasurableSpace ๐} {mR : MeasurableSpace R} (ฮฝ : ProbabilityTheory.Kernel ๐ R) : MeasureTheory.Measure (โ โ ๐ โ R)
Code
noncomputable def streamMeasure (ฮฝ : Kernel ๐ R) : Measure (โ โ ๐ โ R) := Measure.infinitePi fun _ โฆ Measure.infinitePi ฮฝ
Used by (56)
Actions: Source ยท Open Issue