This code accompanies this paper, implementing the main algorithms and reproducing the core results.
The sequence_env
directory contains implementations relevant to the sequence learning task. The trail_env
directory contains implementations for the deep RL task, and corresponding teacher algorithms. Included in both directories are Jupyter notebooks that reproduce the numeric figures in the main text.
If you notice any errors are have any questions, please do get in touch and/or open a pull request!