Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It sounds like a simple PID loop would be sufficient to solve this problem. You have a control valve and an error signal. No need for anything more complicated.


It is a PID loop, which I guess may not be considered to be actual reinforcement learning.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: