Title: Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Idea: Modify the mathmatical target so that take exploration into consideration, come up with 2 meta RL algorithms E-MAML and E-RL, and propose a Krazy World environment to benchmark meta RL.