4 月, 2019 - Darwin的小小AI天地

DDPG 原理說明

Post author:darren1231
Post published:2019 年 4 月 21 日
Post category:AI數學
Post comments:0 Comments

DDPG也是延續著之前的觀念而來，是融合了Actor-Cri...

Continue Reading

Actor Critic 原理說明

Post author:darren1231
Post published:2019 年 4 月 15 日
Post category:AI數學
Post comments:0 Comments

本方法使用兩個網路來達成學習動作，一為Actor網路，主要用...

Continue Reading

Policy gradient 原理說明

Post author:darren1231
Post published:2019 年 4 月 9 日
Post category:AI數學
Post comments:4 Comments

今天要介紹RL的另一個家族Policy gradient，p...

Continue Reading