You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am running into an odd issue where if I change the mass by a small amount, say 0.05 kg, using the pybullet.changeDynamics() method in the _housekeeping() method, the training for one_d_rpm gets stuck at a very small reward and never improves. In some cases, during the rendering/evaluation it just oscillates above and below 1m. I have no idea why this is. It seems that one_d_pid does not have this issue and works well. I'm also slightly confused about the 5% parameter when updating RPMs in relation to the hover RPM. Does anyone have any experience with this?
The text was updated successfully, but these errors were encountered:
Yes. There is a bug in this HoverAviary.py. It only works for the given example. You make small changes like mass, or action or even goal point, it will not work. Particularly, if you change goal point from (0,0,1) to (1,0,1), it will not work. I guess you have to modify reward function.
I don't think it's the policy or the environment (reward function) that's the problem but rather the simulation settings itself. It doesn't make sense that a reward function would suddenly fail with marginal changes in mass. I'm suspecting the PyBullet physics implementation may be wrong. I saw that the thrust2weight ratio was also set as a constant in the urdf file, which doesn't make sense to me either.
I agree with you. I am also facing the same problem. Given the fact that the developers are busy, I think it would be much better to make own environment using ChatGPT.
Hi,
I am running into an odd issue where if I change the mass by a small amount, say 0.05 kg, using the
pybullet.changeDynamics()
method in the_housekeeping()
method, the training forone_d_rpm
gets stuck at a very small reward and never improves. In some cases, during the rendering/evaluation it just oscillates above and below 1m. I have no idea why this is. It seems thatone_d_pid
does not have this issue and works well. I'm also slightly confused about the 5% parameter when updating RPMs in relation to the hover RPM. Does anyone have any experience with this?The text was updated successfully, but these errors were encountered: