LeanMachineLearning exposition

Bandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'๐Ÿ”—

This page has the declaration's own card below, then its dependency graph, then a card for each dependency (type dependencies first, then the rest of the transitive closure). For a theorem, the graph and the dependency cards only follow its statement's dependencies (its proof is replaced by sorry, so what it proves doesn't depend on how); for everything else, both the type and the body/value are followed, since their content is part of what later declarations build on.

Minimal Lean file

prob_sum_range_sub_le_le_of_HasSubgaussianMGF'๐Ÿ”—

LemmaBandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'

No docstring.

๐Ÿ”—theorem
Bandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'.{u_1} {๐“ : Type u_1} {m๐“ : MeasurableSpace ๐“} {ฮฝ : ProbabilityTheory.Kernel ๐“ โ„} [ProbabilityTheory.IsMarkovKernel ฮฝ] {n : โ„•} {a : ๐“} {ฯƒ2 : NNReal} (hฯƒ2 : 0 < ฯƒ2) (h : ProbabilityTheory.HasSubgaussianMGF (fun x => x - โˆซ (x : โ„), id x โˆ‚ฮฝ a) ฯƒ2 (ฮฝ a)) {ฮด : โ„} (hฮด : 0 < ฮด) (hn : 0 < n) : (streamMeasure ฮฝ) {ฯ‰ | โˆ‘ k โˆˆ Finset.range n, (ฯ‰ k a - โˆซ (x : โ„), id x โˆ‚ฮฝ a) โ‰ค -โˆš(2 * โ†‘n * โ†‘ฯƒ2 * Real.log (1 / ฮด))} โ‰ค ENNReal.ofReal ฮด
Bandits.StreamMeasure.prob_sum_range_sub_le_le_of_HasSubgaussianMGF'.{u_1} {๐“ : Type u_1} {m๐“ : MeasurableSpace ๐“} {ฮฝ : ProbabilityTheory.Kernel ๐“ โ„} [ProbabilityTheory.IsMarkovKernel ฮฝ] {n : โ„•} {a : ๐“} {ฯƒ2 : NNReal} (hฯƒ2 : 0 < ฯƒ2) (h : ProbabilityTheory.HasSubgaussianMGF (fun x => x - โˆซ (x : โ„), id x โˆ‚ฮฝ a) ฯƒ2 (ฮฝ a)) {ฮด : โ„} (hฮด : 0 < ฮด) (hn : 0 < n) : (streamMeasure ฮฝ) {ฯ‰ | โˆ‘ k โˆˆ Finset.range n, (ฯ‰ k a - โˆซ (x : โ„), id x โˆ‚ฮฝ a) โ‰ค -โˆš(2 * โ†‘n * โ†‘ฯƒ2 * Real.log (1 / ฮด))} โ‰ค ENNReal.ofReal ฮด

Code

lemma prob_sum_range_sub_le_le_of_HasSubgaussianMGF' {ฯƒ2 : โ„โ‰ฅ0} (hฯƒ2 : 0 < ฯƒ2)
    (h : HasSubgaussianMGF (fun x โ†ฆ x - (ฮฝ a)[id]) ฯƒ2 (ฮฝ a)) {ฮด : โ„} (hฮด : 0 < ฮด) (hn : 0 < n) :
    streamMeasure ฮฝ {ฯ‰ | โˆ‘ k โˆˆ range n, (ฯ‰ k a - (ฮฝ a)[id]) โ‰ค
      -โˆš(2 * n * ฯƒ2 * Real.log (1 / ฮด))} โ‰ค ENNReal.ofReal ฮด
Type uses (1)
Body uses (1)
Used by (1)

Actions: Source ยท Open Issue

Proof
calc
  _ โ‰ค ENNReal.ofReal (Real.exp (-โˆš(2 * n * ฯƒ2 * Real.log (1 / ฮด)) ^ 2 / (2 * n * ฯƒ2))) :=
    prob_sum_range_sub_le_le_of_HasSubgaussianMGF h (by positivity) n
  _ โ‰ค ENNReal.ofReal ฮด := by
    gcongr
    exact exp_neg_sqrt_sq_div_le hฯƒ2 hฮด hn

Dependency graph

Type dependencies (1)

streamMeasure๐Ÿ”—

DefinitionBandits.streamMeasure

Measure of an infinite stream of rewards from each action.

๐Ÿ”—def
Bandits.streamMeasure.{u_1, u_2} {๐“ : Type u_1} {R : Type u_2} {m๐“ : MeasurableSpace ๐“} {mR : MeasurableSpace R} (ฮฝ : ProbabilityTheory.Kernel ๐“ R) : MeasureTheory.Measure (โ„• โ†’ ๐“ โ†’ R)
Bandits.streamMeasure.{u_1, u_2} {๐“ : Type u_1} {R : Type u_2} {m๐“ : MeasurableSpace ๐“} {mR : MeasurableSpace R} (ฮฝ : ProbabilityTheory.Kernel ๐“ R) : MeasureTheory.Measure (โ„• โ†’ ๐“ โ†’ R)

Code

noncomputable
def streamMeasure (ฮฝ : Kernel ๐“ R) : Measure (โ„• โ†’ ๐“ โ†’ R) :=
  Measure.infinitePi fun _ โ†ฆ Measure.infinitePi ฮฝ
Used by (56)

Actions: Source ยท Open Issue