Testing, generative coverage and debugging is hard
To build a reliable product or mission critical system it is important to test these throughly. Advance AI uses Neural Network, Deep Neural Network (DNN) Q Learning is an algorithm used in hierarchal learning. However Q learning find it hard to learn from very big state space. It does not work in continuous space as exploring all states is hard. Learning policy is even harder. Key to success is - can you create an abstract world (dataset) that let you learn good policies.