3.4.ย SequentialLearning.BayesStationaryEnv
Bayesian stationary environments
This file defines the structure IsBayesAlgEnvSeq and provides its basic properties.
Main definitions
-
IsBayesAlgEnvSeq Q ฮบ alg E A Y P: states that there is a measureP : Measure ฮฉsuch that the parameterE : ฮฉ โ ๐has lawQand that the sequences of actionsA : โ โ ฮฉ โ ๐and feedbacksY : โ โ ฮฉ โ ๐จare generated by the algorithmalg : Algorithm ๐ ๐จinteracting with an underlying environment that depends onEandฮบ(stationaryEnv (ฮบ.sectR (E ฯ))). -
bayesTrajMeasure Q ฮบ alg: for any choice of probability measureQ : Measure ๐, Markov kernelฮบ : Kernel (๐ ร ๐) ๐จ, and algorithmalg : Algorithm ๐ ๐จ, provides a probability measureP : Measure (โ โ ๐ ร ๐ ร ๐จ)on a space that carriesE,A, andYsuch thatIsBayesAlgEnvSeq Q ฮบ alg E A Y P. -
bayesTrajMeasurePosterior Q ฮบ alg n: aKernel (Iic n โ ๐ ร ๐จ) ๐that represents the posterior overEgiven the history up to timenunder the priorQand the algorithmalg, assuming that the kernelฮบspecifies howEgives rise to the underlying (stationary) environment. See alsoLeanMachineLearning/SequentialLearning/AlgorithmDensityBayes.lean.
Main results
-
ae_IsAlgEnvSeq h: ifh : IsBayesAlgEnvSeq Q ฮบ alg E A Y P, forQ-almost everye : ๐,IsAlgEnvSeq A' Y' alg (stationaryEnv (ฮบ.sectR e)) (condDistrib (trajectory A Y) E P e)for some sequence of actionsA' : โ โ (โ โ ๐ ร ๐จ) โ ๐and sequence of feedbacksY' : โ โ (โ โ ๐ ร ๐จ) โ ๐จ. Intuitively, if the observable trajectory is generated by an underlying parametere : ๐, the measure that carries theIsBayesAlgEnvSeqstructure reveals a measure that carries anIsAlgEnvSeqstructure under the environmentstationaryEnv (ฮบ.sectR e)and the same algorithm. This allows transferring results from theIsAlgEnvSeqstructure to theIsBayesAlgEnvSeqstructure.
Module LeanMachineLearning.SequentialLearning.BayesStationaryEnv contains 17 exposed declarations.
IsBayesAlgEnvSeq๐
Learning.IsBayesAlgEnvSeq
IsBayesAlgEnvSeq Q ฮบ alg E A Y P states that there is a measure P : Measure ฮฉ such
that the parameter E : ฮฉ โ ๐ has law Q and that the sequences of actions A : โ โ ฮฉ โ ๐
and feedbacks Y : โ โ ฮฉ โ ๐จ are generated by the algorithm alg : Algorithm ๐ ๐จ interacting
with an underlying environment that depends on E and ฮบ (stationaryEnv (ฮบ.sectR (E ฯ))).
Learning.IsBayesAlgEnvSeq.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] (Q : MeasureTheory.Measure ๐) (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) (alg : Algorithm ๐ ๐จ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (Y : โ โ ฮฉ โ ๐จ) (P : MeasureTheory.Measure ฮฉ) [MeasureTheory.IsFiniteMeasure P] : PropLearning.IsBayesAlgEnvSeq.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] (Q : MeasureTheory.Measure ๐) (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) (alg : Algorithm ๐ ๐จ) (E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (Y : โ โ ฮฉ โ ๐จ) (P : MeasureTheory.Measure ฮฉ) [MeasureTheory.IsFiniteMeasure P] : Prop
Code
structure IsBayesAlgEnvSeq
(Q : Measure ๐) (ฮบ : Kernel (๐ ร ๐) ๐จ) (alg : Algorithm ๐ ๐จ)
(E : ฮฉ โ ๐) (A : โ โ ฮฉ โ ๐) (Y : โ โ ฮฉ โ ๐จ)
(P : Measure ฮฉ) [IsFiniteMeasure P] : Prop where
measurable_param : Measurable E := by fun_prop
measurable_action n : Measurable (A n) := by fun_prop
measurable_feedback n : Measurable (Y n) := by fun_prop
hasLaw_env : HasLaw E Q P
hasCondDistrib_action_zero : HasCondDistrib (A 0) E (Kernel.const _ alg.p0) P
hasCondDistrib_feedback_zero : HasCondDistrib (Y 0) (fun ฯ โฆ (E ฯ, A 0 ฯ)) ฮบ P
hasCondDistrib_action n :
HasCondDistrib (A (n + 1)) (fun ฯ โฆ (E ฯ, history A Y n ฯ))
((alg.policy n).prodMkLeft _) P
hasCondDistrib_feedback n :
HasCondDistrib (Y (n + 1)) (fun ฯ โฆ (history A Y n ฯ, E ฯ, A (n + 1) ฯ))
(ฮบ.prodMkLeft _) PActions: Source ยท Open Issue
hasLaw_action_zero๐
Learning.IsBayesAlgEnvSeq.hasLaw_action_zeroNo docstring.
Learning.IsBayesAlgEnvSeq.hasLaw_action_zero.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [MeasureTheory.IsProbabilityMeasure P] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : ProbabilityTheory.HasLaw (A 0) (Algorithm.p0 alg) PLearning.IsBayesAlgEnvSeq.hasLaw_action_zero.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [MeasureTheory.IsProbabilityMeasure P] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : ProbabilityTheory.HasLaw (A 0) (Algorithm.p0 alg) P
Code
lemma hasLaw_action_zero [IsProbabilityMeasure P] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) :
HasLaw (A 0) alg.p0 PType uses (2)
Body uses (1)
Actions: Source ยท Open Issue
Proof
h.hasCondDistrib_action_zero.hasLaw_of_const
hasCondDistrib_action'๐
Learning.IsBayesAlgEnvSeq.hasCondDistrib_action'No docstring.
Learning.IsBayesAlgEnvSeq.hasCondDistrib_action'.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : ProbabilityTheory.HasCondDistrib (A (n + 1)) (history A Y n) (Algorithm.policy alg n) PLearning.IsBayesAlgEnvSeq.hasCondDistrib_action'.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : ProbabilityTheory.HasCondDistrib (A (n + 1)) (history A Y n) (Algorithm.policy alg n) P
Code
lemma hasCondDistrib_action' (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) :
HasCondDistrib (A (n + 1)) (history A Y n) (alg.policy n) PType uses (3)
Body uses (1)
Used by (1)
Actions: Source ยท Open Issue
Proof
(h.hasCondDistrib_action n).comp_right
hasCondDistrib_feedback'๐
Learning.IsBayesAlgEnvSeq.hasCondDistrib_feedback'No docstring.
Learning.IsBayesAlgEnvSeq.hasCondDistrib_feedback'.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [ProbabilityTheory.IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : ProbabilityTheory.HasCondDistrib (Y (n + 1)) (fun ฯ => (E ฯ, A (n + 1) ฯ)) ฮบ PLearning.IsBayesAlgEnvSeq.hasCondDistrib_feedback'.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [ProbabilityTheory.IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : ProbabilityTheory.HasCondDistrib (Y (n + 1)) (fun ฯ => (E ฯ, A (n + 1) ฯ)) ฮบ P
Code
lemma hasCondDistrib_feedback' [IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) :
HasCondDistrib (Y (n + 1)) (fun ฯ โฆ (E ฯ, A (n + 1) ฯ)) ฮบ PType uses (2)
Body uses (1)
Actions: Source ยท Open Issue
Proof
(h.hasCondDistrib_feedback n).comp_right
hasLaw_IT_action_zero๐
Learning.IsBayesAlgEnvSeq.hasLaw_IT_action_zeroNo docstring.
Learning.IsBayesAlgEnvSeq.hasLaw_IT_action_zero.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : โแต (e : ๐) โQ, ProbabilityTheory.HasLaw (IT.action 0) (Algorithm.p0 alg) (๐[trajectory A Y | E; P] e)Learning.IsBayesAlgEnvSeq.hasLaw_IT_action_zero.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : โแต (e : ๐) โQ, ProbabilityTheory.HasLaw (IT.action 0) (Algorithm.p0 alg) (๐[trajectory A Y | E; P] e)
Code
lemma hasLaw_IT_action_zero (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) :
โแต e โQ, HasLaw (IT.action 0) alg.p0 (condDistrib (trajectory A Y) E P e)Type uses (4)
Used by (1)
Actions: Source ยท Open Issue
Proof
by
rw [โ h.hasLaw_env.map_eq]
filter_upwards [condDistrib_comp E
((measurable_trajectory h.measurable_action h.measurable_feedback).aemeasurable)
(IT.measurable_action (๐ := ๐) (๐จ := ๐จ) 0),
h.hasCondDistrib_action_zero.condDistrib_eq] with _ hc hcd
exact โจ(IT.measurable_action 0).aemeasurable, by
rw [โ Kernel.map_apply _ (IT.measurable_action 0), โ hc,
show IT.action 0 โ trajectory A Y = A 0 from rfl, hcd, Kernel.const_apply]โฉ
hasCondDistrib_IT_feedback_zero๐
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback_zeroNo docstring.
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback_zero.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] [ProbabilityTheory.IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : โแต (e : ๐) โQ, ProbabilityTheory.HasCondDistrib (IT.feedback 0) (IT.action 0) (ProbabilityTheory.Kernel.sectR ฮบ e) (๐[trajectory A Y | E; P] e)Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback_zero.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] [ProbabilityTheory.IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : โแต (e : ๐) โQ, ProbabilityTheory.HasCondDistrib (IT.feedback 0) (IT.action 0) (ProbabilityTheory.Kernel.sectR ฮบ e) (๐[trajectory A Y | E; P] e)
Code
lemma hasCondDistrib_IT_feedback_zero [IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) :
โแต e โQ, HasCondDistrib (IT.feedback 0) (IT.action 0) (ฮบ.sectR e)
(condDistrib (trajectory A Y) E P e)Type uses (5)
Used by (1)
Actions: Source ยท Open Issue
Proof
by
rw [โ h.hasLaw_env.map_eq]
exact h.hasCondDistrib_feedback_zero.hasCondDistrib_sectR
(IT.measurable_action 0) (IT.measurable_feedback 0)
(measurable_trajectory h.measurable_action h.measurable_feedback).aemeasurable
hasCondDistrib_IT_action๐
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_actionNo docstring.
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_action.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : โแต (e : ๐) โQ, ProbabilityTheory.HasCondDistrib (IT.action (n + 1)) (IT.hist n) (Algorithm.policy alg n) (๐[trajectory A Y | E; P] e)Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_action.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : โแต (e : ๐) โQ, ProbabilityTheory.HasCondDistrib (IT.action (n + 1)) (IT.hist n) (Algorithm.policy alg n) (๐[trajectory A Y | E; P] e)
Code
lemma hasCondDistrib_IT_action (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) :
โแต e โQ, HasCondDistrib (IT.action (n + 1)) (IT.hist n) (alg.policy n)
(condDistrib (trajectory A Y) E P e)Type uses (5)
Body uses (6)
Used by (1)
Actions: Source ยท Open Issue
Proof
by
rw [โ h.hasLaw_env.map_eq]
filter_upwards [(h.hasCondDistrib_action n).hasCondDistrib_sectR
(IT.measurable_hist n) (IT.measurable_action (n + 1))
(measurable_trajectory h.measurable_action h.measurable_feedback).aemeasurable] with _ he
rwa [Kernel.sectR_prodMkLeft] at he
hasCondDistrib_IT_feedback๐
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedbackNo docstring.
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] [ProbabilityTheory.IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : โแต (e : ๐) โQ, ProbabilityTheory.HasCondDistrib (IT.feedback (n + 1)) (fun ฯ => (IT.hist n ฯ, IT.action (n + 1) ฯ)) (ProbabilityTheory.Kernel.prodMkLeft (โฅ(Finset.Iic n) โ ๐ ร ๐จ) (ProbabilityTheory.Kernel.sectR ฮบ e)) (๐[trajectory A Y | E; P] e)Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] [ProbabilityTheory.IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : โแต (e : ๐) โQ, ProbabilityTheory.HasCondDistrib (IT.feedback (n + 1)) (fun ฯ => (IT.hist n ฯ, IT.action (n + 1) ฯ)) (ProbabilityTheory.Kernel.prodMkLeft (โฅ(Finset.Iic n) โ ๐ ร ๐จ) (ProbabilityTheory.Kernel.sectR ฮบ e)) (๐[trajectory A Y | E; P] e)
Code
lemma hasCondDistrib_IT_feedback [IsFiniteKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P)
(n : โ) :
โแต e โQ, HasCondDistrib (IT.feedback (n + 1)) (fun ฯ โฆ (IT.hist n ฯ, IT.action (n + 1) ฯ))
((ฮบ.sectR e).prodMkLeft _) (condDistrib (trajectory A Y) E P e)Type uses (6)
Body uses (6)
Used by (1)
Actions: Source ยท Open Issue
Proof
by
rw [โ h.hasLaw_env.map_eq]
have hc : HasCondDistrib (Y (n + 1))
(fun ฯ โฆ (E ฯ, history A Y n ฯ, A (n + 1) ฯ))
(ฮบ.comap (fun (e, _, a) โฆ (e, a)) (by fun_prop)) P :=
(h.hasCondDistrib_feedback n).measurableEquiv_comp_right (MeasurableEquiv.prodAssoc.symm.trans
((MeasurableEquiv.prodCongr .prodComm (.refl _)).trans .prodAssoc))
exact hc.hasCondDistrib_sectR ((IT.measurable_hist n).prodMk
(IT.measurable_action (n + 1))) (IT.measurable_feedback (n + 1))
(measurable_trajectory h.measurable_action h.measurable_feedback).aemeasurable
hasLaw_IT_hist๐
Learning.IsBayesAlgEnvSeq.hasLaw_IT_histNo docstring.
Learning.IsBayesAlgEnvSeq.hasLaw_IT_hist.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : โแต (e : ๐) โQ, ProbabilityTheory.HasLaw (IT.hist n) (๐[history A Y n | E; P] e) (๐[trajectory A Y | E; P] e)Learning.IsBayesAlgEnvSeq.hasLaw_IT_hist.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) : โแต (e : ๐) โQ, ProbabilityTheory.HasLaw (IT.hist n) (๐[history A Y n | E; P] e) (๐[trajectory A Y | E; P] e)
Code
lemma hasLaw_IT_hist (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) (n : โ) :
โแต e โQ, HasLaw (IT.hist n) (condDistrib (history A Y n) E P e)
(condDistrib (trajectory A Y) E P e)Type uses (5)
Body uses (2)
Actions: Source ยท Open Issue
Proof
by
rw [โ h.hasLaw_env.map_eq, show history A Y n = IT.hist n โ trajectory A Y from rfl]
filter_upwards [condDistrib_comp E
(measurable_trajectory h.measurable_action h.measurable_feedback).aemeasurable
(IT.measurable_hist n)] with _ he
exact โจ(IT.measurable_hist n).aemeasurable, by
rw [โ Kernel.map_apply _ (IT.measurable_hist n), he]โฉ
ae_IsAlgEnvSeq๐
Learning.IsBayesAlgEnvSeq.ae_IsAlgEnvSeqNo docstring.
Learning.IsBayesAlgEnvSeq.ae_IsAlgEnvSeq.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] [ProbabilityTheory.IsMarkovKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : โแต (e : ๐) โQ, IsAlgEnvSeq IT.action IT.feedback alg (stationaryEnv (ProbabilityTheory.Kernel.sectR ฮบ e)) (๐[trajectory A Y | E; P] e)Learning.IsBayesAlgEnvSeq.ae_IsAlgEnvSeq.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} {alg : Algorithm ๐ ๐จ} {E : ฮฉ โ ๐} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsFiniteMeasure P] [StandardBorelSpace ๐] [Nonempty ๐] [StandardBorelSpace ๐จ] [Nonempty ๐จ] [ProbabilityTheory.IsMarkovKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) : โแต (e : ๐) โQ, IsAlgEnvSeq IT.action IT.feedback alg (stationaryEnv (ProbabilityTheory.Kernel.sectR ฮบ e)) (๐[trajectory A Y | E; P] e)
Code
lemma ae_IsAlgEnvSeq [IsMarkovKernel ฮบ] (h : IsBayesAlgEnvSeq Q ฮบ alg E A Y P) :
โแต e โQ, IsAlgEnvSeq IT.action IT.feedback alg (stationaryEnv (ฮบ.sectR e))
(condDistrib (trajectory A Y) E P e)Type uses (7)
Body uses (9)
Used by (3)
Actions: Source ยท Open Issue
Proof
by
filter_upwards [hasLaw_IT_action_zero h, hasCondDistrib_IT_feedback_zero h,
ae_all_iff.2 (hasCondDistrib_IT_action h), ae_all_iff.2 (hasCondDistrib_IT_feedback h)]
with _ ha0 hr0 hA hR
exact โจIT.measurable_action, IT.measurable_feedback, ha0, hr0, hA, hRโฉ
bayesStationaryEnv๐
Learning.bayesStationaryEnv
An environment with observations in ๐ ร ๐จ. The first element e of an observation is
sampled from Q once and remains constant. The second element of an observation is sampled from
ฮบ (e, a), where a is the corresponding action.
Learning.bayesStationaryEnv.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] : Environment ๐ (๐ ร ๐จ)Learning.bayesStationaryEnv.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] : Environment ๐ (๐ ร ๐จ)
Code
noncomputable
def bayesStationaryEnv (Q : Measure ๐) [IsProbabilityMeasure Q] (ฮบ : Kernel (๐ ร ๐) ๐จ)
[IsMarkovKernel ฮบ] : Environment ๐ (๐ ร ๐จ) where
feedback n :=
let g : (Iic n โ ๐ ร ๐ ร ๐จ) ร ๐ โ ๐ ร ๐ := fun (h, a) => ((h โจ0, by simpโฉ).2.1, a)
(Kernel.deterministic (Prod.fst โ g) (by fun_prop)) รโ (ฮบ.comap g (by fun_prop))
ฮฝ0 := (Kernel.const _ Q) โโ ฮบ.swapLeftType uses (1)
Used by (4)
Actions: Source ยท Open Issue
isBayesAlgEnvSeq๐
Learning.IsAlgEnvSeq.isBayesAlgEnvSeqNo docstring.
Learning.IsAlgEnvSeq.isBayesAlgEnvSeq.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} [MeasureTheory.IsProbabilityMeasure Q] {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} [ProbabilityTheory.IsMarkovKernel ฮบ] {alg : Algorithm ๐ ๐จ} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐ ร ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] (h : IsAlgEnvSeq A Y (Algorithm.prodLeft ๐ alg) (bayesStationaryEnv Q ฮบ) P) : IsBayesAlgEnvSeq Q ฮบ alg (fun ฯ => Prod.fst (Y 0 ฯ)) A (fun n ฯ => Prod.snd (Y n ฯ)) PLearning.IsAlgEnvSeq.isBayesAlgEnvSeq.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} {ฮฉ : Type u_4} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [MeasurableSpace ฮฉ] {Q : MeasureTheory.Measure ๐} [MeasureTheory.IsProbabilityMeasure Q] {ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ} [ProbabilityTheory.IsMarkovKernel ฮบ] {alg : Algorithm ๐ ๐จ} {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐ ร ๐จ} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] (h : IsAlgEnvSeq A Y (Algorithm.prodLeft ๐ alg) (bayesStationaryEnv Q ฮบ) P) : IsBayesAlgEnvSeq Q ฮบ alg (fun ฯ => Prod.fst (Y 0 ฯ)) A (fun n ฯ => Prod.snd (Y n ฯ)) P
Code
lemma IsAlgEnvSeq.isBayesAlgEnvSeq
(h : IsAlgEnvSeq A Y (alg.prodLeft ๐) (bayesStationaryEnv Q ฮบ) P) :
IsBayesAlgEnvSeq Q ฮบ alg (fun ฯ โฆ (Y 0 ฯ).1) A (fun n ฯ โฆ (Y n ฯ).2) P where
measurable_paramType uses (5)
Body uses (6)
Used by (1)
Actions: Source ยท Open Issue
Proof
(h.measurable_feedback 0).fst
measurable_action := h.measurable_action
measurable_feedback n := (h.measurable_feedback n).snd
hasLaw_env := by
apply HasCondDistrib.hasLaw_of_const
simpa [bayesStationaryEnv] using h.hasCondDistrib_feedback_zero.fst
hasCondDistrib_action_zero := by
have hc : HasCondDistrib (fun ฯ โฆ (Y 0 ฯ).1) (A 0) (Kernel.const _ Q) P := by
simpa [bayesStationaryEnv] using h.hasCondDistrib_feedback_zero.fst
simpa [h.hasLaw_action_zero.map_eq, Algorithm.prodLeft] using hc.const_map_of_const
hasCondDistrib_feedback_zero :=
h.hasCondDistrib_feedback_zero.of_compProd.measurableEquiv_comp_right MeasurableEquiv.prodComm
hasCondDistrib_action n := by
let f : (Iic n โ ๐ ร ๐ ร ๐จ) โ ๐ ร (Iic n โ ๐ ร ๐จ) :=
fun h โฆ ((h โจ0, by simpโฉ).2.1, fun i โฆ ((h i).1, (h i).2.2))
have hc : HasCondDistrib (A (n + 1)) (history A Y n)
(((alg.policy n).comap Prod.snd (by fun_prop)).comap f (by fun_prop)) P :=
h.hasCondDistrib_action n
exact hc.comp_right (f := f)
hasCondDistrib_feedback n := by
let f : (Iic n โ ๐ ร ๐ ร ๐จ) ร ๐ โ (Iic n โ ๐ ร ๐จ) ร ๐ ร ๐ :=
fun p โฆ ((fun i โฆ ((p.1 i).1, (p.1 i).2.2)), (p.1 โจ0, by simpโฉ).2.1, p.2)
have hc : HasCondDistrib (fun ฯ โฆ (Y (n + 1) ฯ).2)
(fun ฯ โฆ (history A Y n ฯ, A (n + 1) ฯ))
((Kernel.prodMkLeft ((Iic n) โ ๐ ร ๐จ) ฮบ).comap f (by fun_prop)) P := by
simpa [bayesStationaryEnv, Kernel.prodMkLeft, โ Kernel.comap_comp_right, Function.comp_def]
using (h.hasCondDistrib_feedback n).snd
exact hc.comp_right
bayesTrajMeasure๐
Learning.IT.bayesTrajMeasure
A measure P on a measurable space that carries random variables E, A, and Y such that
IsBayesAlgEnvSeq Q ฮบ alg E A Y P.
Learning.IT.bayesTrajMeasure.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : MeasureTheory.Measure (โ โ ๐ ร ๐ ร ๐จ)Learning.IT.bayesTrajMeasure.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : MeasureTheory.Measure (โ โ ๐ ร ๐ ร ๐จ)
Code
noncomputable
def bayesTrajMeasure (Q : Measure ๐) [IsProbabilityMeasure Q] (ฮบ : Kernel (๐ ร ๐) ๐จ)
[IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : Measure (โ โ ๐ ร ๐ ร ๐จ) :=
trajMeasure (alg.prodLeft ๐) (bayesStationaryEnv Q ฮบ)
deriving IsProbabilityMeasureType uses (1)
Body uses (3)
Used by (5)
Actions: Source ยท Open Issue
instIsProbabilityMeasureForallNatProdBayesTrajMeasure๐
Learning.IT.instIsProbabilityMeasureForallNatProdBayesTrajMeasureNo docstring.
Learning.IT.instIsProbabilityMeasureForallNatProdBayesTrajMeasure.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : MeasureTheory.IsProbabilityMeasure (bayesTrajMeasure Q ฮบ alg)Learning.IT.instIsProbabilityMeasureForallNatProdBayesTrajMeasure.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : MeasureTheory.IsProbabilityMeasure (bayesTrajMeasure Q ฮบ alg)
Code
deriving IsProbabilityMeasure
Type uses (2)
Used by (4)
Actions: Source ยท Open Issue
Proof
deriving IsProbabilityMeasure
isBayesAlgEnvSeq_bayesTrajMeasure๐
Learning.IT.isBayesAlgEnvSeq_bayesTrajMeasureNo docstring.
Learning.IT.isBayesAlgEnvSeq_bayesTrajMeasure.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : IsBayesAlgEnvSeq Q ฮบ alg (fun ฯ => Prod.fst (Prod.snd (ฯ 0))) action (fun n ฯ => Prod.snd (Prod.snd (ฯ n))) (bayesTrajMeasure Q ฮบ alg)Learning.IT.isBayesAlgEnvSeq_bayesTrajMeasure.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) : IsBayesAlgEnvSeq Q ฮบ alg (fun ฯ => Prod.fst (Prod.snd (ฯ 0))) action (fun n ฯ => Prod.snd (Prod.snd (ฯ n))) (bayesTrajMeasure Q ฮบ alg)
Code
lemma isBayesAlgEnvSeq_bayesTrajMeasure
(Q : Measure ๐) [IsProbabilityMeasure Q] (ฮบ : Kernel (๐ ร ๐) ๐จ) [IsMarkovKernel ฮบ]
(alg : Algorithm ๐ ๐จ) :
IsBayesAlgEnvSeq Q ฮบ alg (fun ฯ โฆ (ฯ 0).2.1) action (fun n ฯ โฆ (ฯ n).2.2)
(bayesTrajMeasure Q ฮบ alg)Type uses (5)
Body uses (7)
Used by (1)
Actions: Source ยท Open Issue
Proof
(isAlgEnvSeq_trajMeasure _ _).isBayesAlgEnvSeq
bayesTrajMeasurePosterior๐
Learning.IT.bayesTrajMeasurePosterior
A kernel that represents the posterior over E given the history up to time n.
Learning.IT.bayesTrajMeasurePosterior.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [StandardBorelSpace ๐] [Nonempty ๐] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) (n : โ) : ProbabilityTheory.Kernel (โฅ(Finset.Iic n) โ ๐ ร ๐จ) ๐Learning.IT.bayesTrajMeasurePosterior.{u_1, u_2, u_3} {๐ : Type u_1} {๐ : Type u_2} {๐จ : Type u_3} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [StandardBorelSpace ๐] [Nonempty ๐] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) (n : โ) : ProbabilityTheory.Kernel (โฅ(Finset.Iic n) โ ๐ ร ๐จ) ๐
Code
noncomputable
def bayesTrajMeasurePosterior [StandardBorelSpace ๐] [Nonempty ๐]
(Q : Measure ๐) [IsProbabilityMeasure Q] (ฮบ : Kernel (๐ ร ๐) ๐จ) [IsMarkovKernel ฮบ]
(alg : Algorithm ๐ ๐จ) (n : โ) : Kernel (Iic n โ ๐ ร ๐จ) ๐ :=
condDistrib (fun ฯ โฆ (ฯ 0).2.1) (history action (fun n ฯ โฆ (ฯ n).2.2) n)
(bayesTrajMeasure Q ฮบ alg)
deriving IsMarkovKernelType uses (1)
Used by (4)
Actions: Source ยท Open Issue
instIsMarkovKernelForallSubtypeNatMemFinsetIicProdBayesTrajMeasurePosterior๐
Learning.IT.instIsMarkovKernelForallSubtypeNatMemFinsetIicProdBayesTrajMeasurePosteriorNo docstring.
Learning.IT.instIsMarkovKernelForallSubtypeNatMemFinsetIicProdBayesTrajMeasurePosterior.{u_1, u_2, u_3} {๐ : Type u_3} {๐ : Type u_1} {๐จ : Type u_2} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [StandardBorelSpace ๐] [Nonempty ๐] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) (n : โ) : ProbabilityTheory.IsMarkovKernel (bayesTrajMeasurePosterior Q ฮบ alg n)Learning.IT.instIsMarkovKernelForallSubtypeNatMemFinsetIicProdBayesTrajMeasurePosterior.{u_1, u_2, u_3} {๐ : Type u_3} {๐ : Type u_1} {๐จ : Type u_2} [MeasurableSpace ๐] [MeasurableSpace ๐] [MeasurableSpace ๐จ] [StandardBorelSpace ๐] [Nonempty ๐] (Q : MeasureTheory.Measure ๐) [MeasureTheory.IsProbabilityMeasure Q] (ฮบ : ProbabilityTheory.Kernel (๐ ร ๐) ๐จ) [ProbabilityTheory.IsMarkovKernel ฮบ] (alg : Algorithm ๐ ๐จ) (n : โ) : ProbabilityTheory.IsMarkovKernel (bayesTrajMeasurePosterior Q ฮบ alg n)
Code
deriving IsMarkovKernel
Type uses (2)
Actions: Source ยท Open Issue
Proof
deriving IsMarkovKernel
-
Learning.IsBayesAlgEnvSeq -
Learning.IsBayesAlgEnvSeq.hasLaw_action_zero -
Learning.IsBayesAlgEnvSeq.hasCondDistrib_action' -
Learning.IsBayesAlgEnvSeq.hasCondDistrib_feedback' -
Learning.IsBayesAlgEnvSeq.hasLaw_IT_action_zero -
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback_zero -
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_action -
Learning.IsBayesAlgEnvSeq.hasCondDistrib_IT_feedback -
Learning.IsBayesAlgEnvSeq.hasLaw_IT_hist -
Learning.IsBayesAlgEnvSeq.ae_IsAlgEnvSeq -
Learning.bayesStationaryEnv -
Learning.IsAlgEnvSeq.isBayesAlgEnvSeq -
Learning.IT.bayesTrajMeasure -
Learning.IT.instIsProbabilityMeasureForallNatProdBayesTrajMeasure -
Learning.IT.isBayesAlgEnvSeq_bayesTrajMeasure -
Learning.IT.bayesTrajMeasurePosterior -
Learning.IT.instIsMarkovKernelForallSubtypeNatMemFinsetIicProdBayesTrajMeasurePosterior