3.12.ย SequentialLearning.EvaluationEnv
Function evaluation environments
We define two environments, onlineEvalEnv and evalEnv, where the feedback is given by evaluating
a measurable function at the chosen action. The first one allows the function to change at every
time step, while the second one uses a fixed function at every time step.
Main definitions
-
onlineEvalEnv g hg: A stationary environment where the feedback at timenis given by a deterministic kernel that evaluates the measurable functiong nat the chosen action. -
evalEnv f hf: A stationary environment where the feedback is given by a deterministic kernel that evaluates a fixed measurable functionfat the chosen action.
They both satisfy the typeclasses IsObliviousEnv and IsDeterministicEnv.
Main statements
-
forall_feedback_onlineEvalEnv_ae_eq_eval_action: For almost allฯ, the feedback at timenis equal tog nevaluated at the action taken at timen. -
forall_feedback_evalEnv_ae_eq_eval_action: For almost allฯ, the feedback at timenis equal tofevaluated at the action taken at timen.
Module LeanMachineLearning.SequentialLearning.EvaluationEnv contains 19 exposed declarations.
onlineEvalEnv๐
Learning.onlineEvalEnv
The evaluation environment where the feedback is given by evaluating a fixed measurable function
f at the chosen action.
Learning.onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} (g : โ โ ๐ โ ๐จ) (hg : โ (n : โ), Measurable (g n)) : Environment ๐ ๐จLearning.onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} (g : โ โ ๐ โ ๐จ) (hg : โ (n : โ), Measurable (g n)) : Environment ๐ ๐จ
Code
noncomputable def onlineEvalEnv (g : โ โ ๐ โ ๐จ) (hg : โ n, Measurable (g n)) := obliviousEnv (fun n โฆ Kernel.deterministic (g n) (hg n))
Type uses (1)
Body uses (1)
Used by (11)
Actions: Source ยท Open Issue
instIsObliviousEnvOnlineEvalEnv๐
Learning.instIsObliviousEnvOnlineEvalEnvNo docstring.
Learning.instIsObliviousEnvOnlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} : IsObliviousEnv (onlineEvalEnv g hg)Learning.instIsObliviousEnvOnlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} : IsObliviousEnv (onlineEvalEnv g hg)
Code
instance : IsObliviousEnv (onlineEvalEnv g hg)
Type uses (2)
Body uses (1)
Used by (3)
Actions: Source ยท Open Issue
Proof
โจโจfun n โฆ Kernel.deterministic (g n) (hg n), fun _ โฆ inferInstance, rfl, fun _ โฆ rflโฉโฉ
instIsDeterministicEnvOnlineEvalEnv๐
Learning.instIsDeterministicEnvOnlineEvalEnvNo docstring.
Learning.instIsDeterministicEnvOnlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} : IsDeterministicEnv (onlineEvalEnv g hg)Learning.instIsDeterministicEnvOnlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} : IsDeterministicEnv (onlineEvalEnv g hg)
Code
instance : IsDeterministicEnv (onlineEvalEnv g hg) where exists_f0
Type uses (2)
Body uses (1)
Actions: Source ยท Open Issue
Proof
โจg 0, hg 0, rflโฉ exists_f n := โจfun p โฆ g (n + 1) p.2, by fun_prop, rflโฉ
feedbackCondAction_onlineEvalEnv๐
Learning.feedbackCondAction_onlineEvalEnvNo docstring.
Learning.feedbackCondAction_onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} (n : โ) : feedbackCondAction (onlineEvalEnv g hg) n = ProbabilityTheory.Kernel.deterministic (g n) โฏLearning.feedbackCondAction_onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} (n : โ) : feedbackCondAction (onlineEvalEnv g hg) n = ProbabilityTheory.Kernel.deterministic (g n) โฏ
Code
lemma feedbackCondAction_onlineEvalEnv (n : โ) :
feedbackCondAction (onlineEvalEnv g hg) n = Kernel.deterministic (g n) (hg n)Type uses (3)
Body uses (1)
Actions: Source ยท Open Issue
Proof
by simp [onlineEvalEnv]
feedbackFunZero_onlineEvalEnv๐
Learning.feedbackFunZero_onlineEvalEnvNo docstring.
Learning.feedbackFunZero_onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} [MeasurableSpace.SeparatesPoints ๐จ] : feedbackFunZero (onlineEvalEnv g hg) = g 0Learning.feedbackFunZero_onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} [MeasurableSpace.SeparatesPoints ๐จ] : feedbackFunZero (onlineEvalEnv g hg) = g 0
Code
lemma feedbackFunZero_onlineEvalEnv [MeasurableSpace.SeparatesPoints ๐จ] :
feedbackFunZero (onlineEvalEnv g hg) = g 0Type uses (3)
Body uses (6)
Used by (1)
Actions: Source ยท Open Issue
Proof
by
have h_eq := ฮฝ0_eq_deterministic (onlineEvalEnv g hg)
simpa only [onlineEvalEnv, ฮฝ0_obliviousEnv, Kernel.prodMkLeft_deterministic,
Kernel.deterministic_inj] using h_eq.symm
feedbackFun_onlineEvalEnv๐
Learning.feedbackFun_onlineEvalEnvNo docstring.
Learning.feedbackFun_onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} [MeasurableSpace.SeparatesPoints ๐จ] (n : โ) : feedbackFun (onlineEvalEnv g hg) n = fun p => g (n + 1) (Prod.snd p)Learning.feedbackFun_onlineEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} [MeasurableSpace.SeparatesPoints ๐จ] (n : โ) : feedbackFun (onlineEvalEnv g hg) n = fun p => g (n + 1) (Prod.snd p)
Code
lemma feedbackFun_onlineEvalEnv [MeasurableSpace.SeparatesPoints ๐จ] (n : โ) :
feedbackFun (onlineEvalEnv g hg) n = fun p โฆ g (n + 1) p.2Type uses (3)
Body uses (7)
Used by (1)
Actions: Source ยท Open Issue
Proof
by
have h_eq := feedback_eq_deterministic (onlineEvalEnv g hg) n
simpa only [onlineEvalEnv, feedback_obliviousEnv, Kernel.prodMkLeft_deterministic,
Kernel.deterministic_inj] using h_eq.symm
hascondDistrib_feedback_onlineEvalEnv๐
Learning.hascondDistrib_feedback_onlineEvalEnvNo docstring.
Learning.hascondDistrib_feedback_onlineEvalEnv.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) (n : โ) : ProbabilityTheory.HasCondDistrib (Y n) (A n) (ProbabilityTheory.Kernel.deterministic (g n) โฏ) PLearning.hascondDistrib_feedback_onlineEvalEnv.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) (n : โ) : ProbabilityTheory.HasCondDistrib (Y n) (A n) (ProbabilityTheory.Kernel.deterministic (g n) โฏ) P
Code
lemma hascondDistrib_feedback_onlineEvalEnv
(h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) (n : โ) :
HasCondDistrib (Y n) (A n) (Kernel.deterministic (g n) (hg n)) PType uses (3)
Body uses (4)
Used by (1)
Actions: Source ยท Open Issue
Proof
by simpa using IsObliviousEnv.hasCondDistrib_feedback h n
feedback_onlineEvalEnv_ae_eq_eval_action๐
Learning.feedback_onlineEvalEnv_ae_eq_eval_actionNo docstring.
Learning.feedback_onlineEvalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) (n : โ) : Y n =แต[P] g n โ A nLearning.feedback_onlineEvalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) (n : โ) : Y n =แต[P] g n โ A n
Code
lemma feedback_onlineEvalEnv_ae_eq_eval_action [StandardBorelSpace ๐จ] [Nonempty ๐จ]
(h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) (n : โ) :
Y n =แต[P] g n โ A nType uses (3)
Body uses (4)
Actions: Source ยท Open Issue
Proof
ae_eq_of_condDistrib_eq_deterministic (hg n) (h.measurable_action n).aemeasurable
(h.measurable_feedback n).aemeasurable
(hascondDistrib_feedback_onlineEvalEnv h n).condDistrib_eq
forall_feedback_onlineEvalEnv_ae_eq_eval_action๐
Learning.forall_feedback_onlineEvalEnv_ae_eq_eval_actionNo docstring.
Learning.forall_feedback_onlineEvalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) : โแต (ฯ : ฮฉ) โP, โ (n : โ), Y n ฯ = g n (A n ฯ)Learning.forall_feedback_onlineEvalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {g : โ โ ๐ โ ๐จ} {hg : โ (n : โ), Measurable (g n)} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) : โแต (ฯ : ฮฉ) โP, โ (n : โ), Y n ฯ = g n (A n ฯ)
Code
lemma forall_feedback_onlineEvalEnv_ae_eq_eval_action [StandardBorelSpace ๐จ] [Nonempty ๐จ]
(h : IsAlgEnvSeq A Y alg (onlineEvalEnv g hg) P) :
โแต ฯ โP, โ n, Y n ฯ = g n (A n ฯ)Type uses (3)
Body uses (1)
Used by (1)
Actions: Source ยท Open Issue
Proof
by rw [ae_all_iff] intro n exact feedback_onlineEvalEnv_ae_eq_eval_action h n
evalEnv๐
Learning.evalEnv
The evaluation environment where the feedback is given by evaluating a fixed measurable function
f at the chosen action.
Learning.evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} (f : ๐ โ ๐จ) (hf : Measurable f) : Environment ๐ ๐จLearning.evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} (f : ๐ โ ๐จ) (hf : Measurable f) : Environment ๐ ๐จ
Code
noncomputable def evalEnv (f : ๐ โ ๐จ) (hf : Measurable f) := onlineEvalEnv (fun _ โฆ f) (fun _ โฆ hf)
Type uses (1)
Body uses (1)
Used by (9)
Actions: Source ยท Open Issue
instIsObliviousEnvEvalEnv๐
Learning.instIsObliviousEnvEvalEnvNo docstring.
Learning.instIsObliviousEnvEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} : IsObliviousEnv (evalEnv f hf)Learning.instIsObliviousEnvEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} : IsObliviousEnv (evalEnv f hf)
Code
instance : IsObliviousEnv (evalEnv f hf)
Type uses (2)
Body uses (2)
Actions: Source ยท Open Issue
Proof
by unfold evalEnv; infer_instance
instIsDeterministicEnvEvalEnv๐
Learning.instIsDeterministicEnvEvalEnvNo docstring.
Learning.instIsDeterministicEnvEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} : IsDeterministicEnv (evalEnv f hf)Learning.instIsDeterministicEnvEvalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} : IsDeterministicEnv (evalEnv f hf)
Code
instance : IsDeterministicEnv (evalEnv f hf)
Type uses (2)
Body uses (2)
Used by (2)
Actions: Source ยท Open Issue
Proof
by unfold evalEnv; infer_instance
feedbackCondAction_evalEnv๐
Learning.feedbackCondAction_evalEnvNo docstring.
Learning.feedbackCondAction_evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} (n : โ) : feedbackCondAction (evalEnv f hf) n = ProbabilityTheory.Kernel.deterministic f hfLearning.feedbackCondAction_evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} (n : โ) : feedbackCondAction (evalEnv f hf) n = ProbabilityTheory.Kernel.deterministic f hf
Code
lemma feedbackCondAction_evalEnv (n : โ) :
feedbackCondAction (evalEnv f hf) n = Kernel.deterministic f hfType uses (3)
Body uses (1)
Used by (1)
Actions: Source ยท Open Issue
Proof
by simp [evalEnv]
feedbackFunZero_evalEnv๐
Learning.feedbackFunZero_evalEnvNo docstring.
Learning.feedbackFunZero_evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} [MeasurableSpace.SeparatesPoints ๐จ] : feedbackFunZero (evalEnv f hf) = fLearning.feedbackFunZero_evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} [MeasurableSpace.SeparatesPoints ๐จ] : feedbackFunZero (evalEnv f hf) = f
Code
lemma feedbackFunZero_evalEnv [MeasurableSpace.SeparatesPoints ๐จ] :
feedbackFunZero (evalEnv f hf) = fType uses (3)
Body uses (1)
Actions: Source ยท Open Issue
Proof
by simp [evalEnv]
feedbackFun_evalEnv๐
Learning.feedbackFun_evalEnvNo docstring.
Learning.feedbackFun_evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} [MeasurableSpace.SeparatesPoints ๐จ] (n : โ) : feedbackFun (evalEnv f hf) n = fun p => f (Prod.snd p)Learning.feedbackFun_evalEnv.{u_1, u_2} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} [MeasurableSpace.SeparatesPoints ๐จ] (n : โ) : feedbackFun (evalEnv f hf) n = fun p => f (Prod.snd p)
Code
lemma feedbackFun_evalEnv [MeasurableSpace.SeparatesPoints ๐จ] (n : โ) :
feedbackFun (evalEnv f hf) n = fun p โฆ f p.2Type uses (3)
Body uses (1)
Actions: Source ยท Open Issue
Proof
by simp [evalEnv]
hascondDistrib_feedback_evalEnv๐
Learning.hascondDistrib_feedback_evalEnvNo docstring.
Learning.hascondDistrib_feedback_evalEnv.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) (n : โ) : ProbabilityTheory.HasCondDistrib (Y n) (A n) (ProbabilityTheory.Kernel.deterministic f hf) PLearning.hascondDistrib_feedback_evalEnv.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) (n : โ) : ProbabilityTheory.HasCondDistrib (Y n) (A n) (ProbabilityTheory.Kernel.deterministic f hf) P
Code
lemma hascondDistrib_feedback_evalEnv (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) (n : โ) :
HasCondDistrib (Y n) (A n) (Kernel.deterministic f hf) PType uses (3)
Body uses (4)
Actions: Source ยท Open Issue
Proof
by simpa using IsObliviousEnv.hasCondDistrib_feedback h n
feedback_evalEnv_ae_eq_eval_action๐
Learning.feedback_evalEnv_ae_eq_eval_actionNo docstring.
Learning.feedback_evalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) (n : โ) : Y n =แต[P] f โ A nLearning.feedback_evalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) (n : โ) : Y n =แต[P] f โ A n
Code
lemma feedback_evalEnv_ae_eq_eval_action [StandardBorelSpace ๐จ] [Nonempty ๐จ]
(h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) (n : โ) :
Y n =แต[P] f โ A nType uses (3)
Body uses (1)
Actions: Source ยท Open Issue
Proof
feedback_onlineEvalEnv_ae_eq_eval_action h n
forall_feedback_evalEnv_ae_eq_eval_action๐
Learning.forall_feedback_evalEnv_ae_eq_eval_actionNo docstring.
Learning.forall_feedback_evalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) : โแต (ฯ : ฮฉ) โP, โ (n : โ), Y n ฯ = f (A n ฯ)Learning.forall_feedback_evalEnv_ae_eq_eval_action.{u_1, u_2, u_3} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) : โแต (ฯ : ฮฉ) โP, โ (n : โ), Y n ฯ = f (A n ฯ)
Code
lemma forall_feedback_evalEnv_ae_eq_eval_action [StandardBorelSpace ๐จ] [Nonempty ๐จ]
(h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) :
โแต ฯ โP, โ n, Y n ฯ = f (A n ฯ)Type uses (3)
Body uses (1)
Used by (1)
Actions: Source ยท Open Issue
Proof
forall_feedback_onlineEvalEnv_ae_eq_eval_action h
feedback_evalEnv_ae_eq_eval_action_comp๐
Learning.feedback_evalEnv_ae_eq_eval_action_compNo docstring.
Learning.feedback_evalEnv_ae_eq_eval_action_comp.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {ฮฒ : Type u_4} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) {n : โ} (g : (โฅ(Finset.Iic n) โ ๐จ) โ ฮฒ) : โแต (ฯ : ฮฉ) โP, (g fun i => Y (โi) ฯ) = g fun i => f (A (โi) ฯ)Learning.feedback_evalEnv_ae_eq_eval_action_comp.{u_1, u_2, u_3, u_4} {๐ : Type u_1} {๐จ : Type u_2} {m๐ : MeasurableSpace ๐} {m๐จ : MeasurableSpace ๐จ} {ฮฉ : Type u_3} {mฮฉ : MeasurableSpace ฮฉ} {alg : Algorithm ๐ ๐จ} {f : ๐ โ ๐จ} {hf : Measurable f} {P : MeasureTheory.Measure ฮฉ} [MeasureTheory.IsProbabilityMeasure P] {A : โ โ ฮฉ โ ๐} {Y : โ โ ฮฉ โ ๐จ} {ฮฒ : Type u_4} [StandardBorelSpace ๐จ] [Nonempty ๐จ] (h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) {n : โ} (g : (โฅ(Finset.Iic n) โ ๐จ) โ ฮฒ) : โแต (ฯ : ฮฉ) โP, (g fun i => Y (โi) ฯ) = g fun i => f (A (โi) ฯ)
Code
lemma feedback_evalEnv_ae_eq_eval_action_comp {ฮฒ : Type*} [StandardBorelSpace ๐จ] [Nonempty ๐จ]
(h : IsAlgEnvSeq A Y alg (evalEnv f hf) P) {n : โ} (g : (Iic n โ ๐จ) โ ฮฒ) :
โแต ฯ โP, g (fun i โฆ Y i ฯ) = g (fun i โฆ f (A i ฯ))Type uses (3)
Body uses (1)
Actions: Source ยท Open Issue
Proof
by filter_upwards [forall_feedback_evalEnv_ae_eq_eval_action h] with ฯ hฯ simp_rw [hฯ]
-
Learning.onlineEvalEnv -
Learning.instIsObliviousEnvOnlineEvalEnv -
Learning.instIsDeterministicEnvOnlineEvalEnv -
Learning.feedbackCondAction_onlineEvalEnv -
Learning.feedbackFunZero_onlineEvalEnv -
Learning.feedbackFun_onlineEvalEnv -
Learning.hascondDistrib_feedback_onlineEvalEnv -
Learning.feedback_onlineEvalEnv_ae_eq_eval_action -
Learning.forall_feedback_onlineEvalEnv_ae_eq_eval_action -
Learning.evalEnv -
Learning.instIsObliviousEnvEvalEnv -
Learning.instIsDeterministicEnvEvalEnv -
Learning.feedbackCondAction_evalEnv -
Learning.feedbackFunZero_evalEnv -
Learning.feedbackFun_evalEnv -
Learning.hascondDistrib_feedback_evalEnv -
Learning.feedback_evalEnv_ae_eq_eval_action -
Learning.forall_feedback_evalEnv_ae_eq_eval_action -
Learning.feedback_evalEnv_ae_eq_eval_action_comp