Deep reinforcement-learning algorithm

Summary
Deep reinforcement-learning algorithm