Landelijk Netwerk Mathematische Besliskunde |

Conference 2015

Home

Program LNMB Conference

Invited Speakers LNMB Conference

Program PhD presentations

Abstracts PhD presentations

Registration LNMB Conference

Announcement NGB/LNMB Seminar

Abstracts/Bios NGB/LNMB Seminar

Registration NGB/LNMB Seminar

Registered Participants

Conference Office

How to get there

Return to LNMB Site

Benjamin Van Roy: Learning to Optimize: Delayed Consequences

Abstract: Learning to make effective decisions that may influence observations appearing after subsequent decisions poses challenges beyond those faced when all consequences are immediate. In particular, observations must somehow be attributed to past actions. The area of reinforcement learning addresses this issue alongside the challenges of exploration and generalization. I will discuss reinforcement learning algorithms and results pertaining to them.