Proposta de arquitetura em Hardware para FPGA da técnica Qlearning de aprendizagem por reforço
Q-learning is a off-policy reinforcement learning technique which has as main advantage the possibility of obtaining an optimal policy interacting with an unknown model environment. This work proposes a parallel fixed-point Q-learning algorithm architecture, implemented in FPGA. Fundamental to th...
Enregistré dans:
Auteur principal: | |
---|---|
Autres auteurs: | |
Format: | Dissertação |
Langue: | por |
Publié: |
Brasil
|
Sujets: | |
Accès en ligne: | https://repositorio.ufrn.br/jspui/handle/123456789/22395 |
Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|