Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

model training process setup #121

Open
zzt007 opened this issue Mar 6, 2024 · 6 comments
Open

model training process setup #121

zzt007 opened this issue Mar 6, 2024 · 6 comments

Comments

@zzt007
Copy link

zzt007 commented Mar 6, 2024

Hi , thanks for your great job! I have done all the work mentioned in readme. I also start to train the model, however, the car moves very slowly in rviz and gazebo , and does not run smoothly as in the example ,even needs to wait a minute before changing positon. Is this because of my computer's poor performance?

  1. my computer and environment as follows:
  • cpu : 13700k i7
  • no NVIDIA Graphics Card
  • Windows WSL2 & Ubuntu20.04 ROS noetic
  1. the following two pictures are 1 minute apart
    image
    image

  2. the terminal belike 👇
    image

@reiniscimurs
Copy link
Owner

Hi,

I have deployed this model on i5 and i3 cpus without cuda so the training part where episode is collected is not that resource intensive. If it takes a minute to change a single (or even a couple) step it does not seem normal to me. However, it does seem you are using a virtual machine, and it either might not have enough resources or not configured correctly. So I would suggest checking if any other ros application works there, and if it does not run smoothly, you could know where the problem lies. In any case, this seems more of a hardware issue and I don't think I can help you much there.

@zzt007
Copy link
Author

zzt007 commented Mar 6, 2024

Thank you for your reply. Does that situation mean running the program successfully? I want to debug this program to learn the connection between DRL and ROS&gazebo simulation, and then I could work on my own project.

@reiniscimurs
Copy link
Owner

From what I can see, the software is launched properly.

@zzt007
Copy link
Author

zzt007 commented Mar 7, 2024

Hi there, I want to know how to judge whether the trained model is convergent? This is my first contact with RL training. So what metrics that I need to check? loss curve like DL model? or reward value reaches a stable value?
The following is my terminal information during the training .
-- training start
image

-- epoch increase
image

I find that with the increase of epoch, its average rewards becomes negative . Looking forward to your reply.

@reiniscimurs
Copy link
Owner

Hi,

Better indicator would be curves in tensorboard. Evaluation in the beginning in your case probably places the goals really close to the robot so it randomly "collects" them. As training goes on, the goal distance gets increased and situations become more complex to the robot.

Loss is not a good indicator and you can see more on the topic here:
#89 (comment)

Generally I would look for the convergence of the maxQ reward on tensorboard.

@zzt007
Copy link
Author

zzt007 commented Mar 8, 2024

Many thanks for your kind help and share.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants