Hello! I wrote an article on the right way to construct a common synthetic intelligence from scratch utilizing a deep reinforcement studying algorithm referred to as Deep Q-learning:
https://medium.com/@lorenzotinfena/a-formal-introduction-to-deep-reinforcement-learning-db639d8c48b8