I have implemented an RL agent for trading EUR/USD and I don't know what to do next...

So, after months of learning about RL and doing toy implentations, I have coded a DQN, with experience buffer and dual nets. The network design is like the most average thing you can come across in ML scene. A simple deep feed forward with Relu and Linear as activation functions.


