Proposta de arquitetura em Hardware para FPGA da técnica Qlearning de aprendizagem por reforço

Q-learning is a off-policy reinforcement learning technique which has as main advantage the possibility of obtaining an optimal policy interacting with an unknown model environment. This work proposes a parallel fixed-point Q-learning algorithm architecture, implemented in FPGA. Fundamental to th...

Description complète

Enregistré dans:
Détails bibliographiques
Auteur principal: Silva, Lucileide Medeiros Dantas da
Autres auteurs: Fernandes, Marcelo Augusto Costa
Format: Dissertação
Langue:por
Publié: Brasil
Sujets:
Accès en ligne:https://repositorio.ufrn.br/jspui/handle/123456789/22395
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!