Supplementary material for Uncorrected least-squares temporal difference with lambda-return
Here, we provide a supplementary material for Takayuki Osogami, "Uncorrected least-squares temporal difference with lambda-return," which appears in Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI-20).
READ FULL TEXT