Learning from large world /state space
In Hierarchal Reinforcement Learning (HRL) agents can break down a complex task into smaller one. Q Learning is an algorithm used in hierarchal learning. However Q learning find it hard to learn from very big state space. It does not work in continuous space as exploring all states is hard. Learning policy is even harder. Key to success is - can you create an abstract world (dataset) that let you learn good policies.