reinforcement learning – Mathemafrica

Permalink
Gallery
CRL Task 3: Counterfactual Decision Making

English, Level: intermediate, Uncategorized

CRL Task 3: Counterfactual Decision Making

In the previous blog post we discussed some theory of how to select optimal and possibly optimal interventions in a causal framework. For those interested in the decision science, this blog post may be more inspiring. This next task involves applying counterfactual quantities to boost learning performance. This is clearly very important for an RL agent where its entire learning mechanism is based on interventions in a system. What if intervention isn’t possible? Let’s begin!

This Series

Causal Reinforcement Learning
Preliminaries for CRL
CRL Task 1: Generalised Policy Learning
CRL Task 2: Interventions – When and Where?
CRL Task 3: Counterfactual Decision Making
CRL Task 4: Generalisability and Robustness
Task 5: Learning Causal Models
(Coming soon) Task 6: Causal Imitation Learning
(Coming soon) Wrapping Up: Where To From Here?

Counterfactual Decision Making

A key feature of causal inference is its ability to deal with counterfactual queries. Reinforcement learning, by its nature, deals with interventional quantities in a trial-and-error style of learning.…

By St John Grimbly| July 10th, 2021|English, Level: intermediate, Uncategorized|6 Comments

Permalink
Gallery
CRL Task 2: Interventions – When and Where?

English, Level: intermediate

CRL Task 2: Interventions – When and Where?

In the previous blog post we discussed the gorey details of generalised policy learning – the first task of CRL. We went into some very detailed mathematical description of dynamic treatment regimes and generalised modes of learning for data processing agents. The next task is a bit more conceptual and focuses on the question on how to identfy optimal areas of intervention in a system. This is clearly very important for an RL agent where its entire learning mechanism is based on these very interventions in some system with a feedback mechanism. Let’s begin!

This Series

Causal Reinforcement Learning
Preliminaries for CRL
CRL Task 1: Generalised Policy Learning
CRL Task 2: Interventions – When and Where?
CRL Task 3: Counterfactual Decision Making
CRL Task 4: Generalisability and Robustness
Task 5: Learning Causal Models
(Coming soon) Task 6: Causal Imitation Learning
(Coming soon) Wrapping Up: Where To From Here?

Interventions – When and Where?

…

By St John Grimbly| July 6th, 2021|English, Level: intermediate|4 Comments

Permalink
Gallery
CRL Task 1: Generalised Policy Learning

Background, English, Level: intermediate

CRL Task 1: Generalised Policy Learning

In the previous blog post we developed some ideas and theory needed to discuss a causal approach to reinforcement learning. We formalised notions of multi-armed bandits (MABs), Markov Decision Processes (MDPs), and some causal notions. In this blog post we’ll finally get to developing some causal reinforcement learning ideas. The first of which is dubbed Task 1, for CRL can help solve. This is Generalised Policy Learning. Let’s begin.

This Series

Causal Reinforcement Learning
Preliminaries for CRL
CRL Task 1: Generalised Policy Learning
CRL Task 2: Interventions – When and Where?
CRL Task 3: Counterfactual Decision Making
CRL Task 4: Generalisability and Robustness
Task 5: Learning Causal Models
(Coming soon) Task 6: Causal Imitation Learning
(Coming soon) Wrapping Up: Where To From Here?

Generalised Policy Learning

Reinforcement learning typically involves learning and optimising some policy about how to interact in an environment to maximise some reward signal. Typical reinforcement learning agents are trained in isolation, exploiting copious amounts of computing power and energy resources.…

By St John Grimbly| July 1st, 2021|Background, English, Level: intermediate|4 Comments

Permalink
Gallery
Preliminaries for CRL

Background, English, Level: Simple

Preliminaries for CRL

In the previous blog post we discussed and motivated the need for a causal approach to reinforcement learning. We argued that reinforcement learning naturally falls on the interventional rung of the ladder of causation. In this blog post we’ll develop some ideas necessary for understanding the material covered in this series. This might get quite technical, but don’t worry. There is still always something to take away. Let’s begin.

This Series

Causal Reinforcement Learning
Preliminaries for CRL
CRL Task 1: Generalised Policy Learning
CRL Task 2: Interventions – When and Where?
CRL Task 3: Counterfactual Decision Making
CRL Task 4: Generalisability and Robustness
Task 5: Learning Causal Models
(Coming soon) Task 6: Causal Imitation Learning
(Coming soon) Wrapping Up: Where To From Here?

Preliminaries

As you probably recall from high school, probability and statistics are almost entirely formulated on the idea of drawing random samples from an experiment. One imagines observing realisations of outcomes from some set of possibilities when drawing from an assortment of independent and identically distributed (i.i.d.) events.…

By St John Grimbly| April 6th, 2021|Background, English, Level: Simple|5 Comments

Permalink
Gallery
Causal Reinforcement Learning: A Primer

Background, English, Level: intermediate, Level: Simple

Causal Reinforcement Learning: A Primer

As part of any honours degree at the University of Cape Town, one is obliged to write a thesis ‘droning’ on about some topic. Luckily for me, applied mathematics can pertain to pretty much anything of interest. Lo and behold, my thesis on merging causality and reinforcement learning. This was entitled Climbing the Ladder: A Survey of Counterfactual Methods in Decision Making Processes and was supervised by Dr Jonathan Shock.

In this series of posts I will break down my thesis into digestible blog chucks and go into quite some detail of the emerging field of Causal Reinforcement Learning (CRL) – which is being spearheaded by Elias Bareinboim and Judea Pearl, among others. I will try to present this in such a way as to satisfy those craving some mathematical detail whilst also trying to paint a broader picture as to why this is generally useful and important. Each of these blog posts will be self contained in some way.…

By St John Grimbly| February 3rd, 2021|Background, English, Level: intermediate, Level: Simple|5 Comments