Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distributed multi-GPU and multi-node learning (PyTorch implementation) #162

Merged
merged 21 commits into from
Jun 24, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
cea621f
Add PyTorch framework configuration
Toni-SM Jun 19, 2024
b9c7645
Initialize distributed process in trainer base class
Toni-SM Jun 19, 2024
b0deb24
Improve property annotation and docstrings
Toni-SM Jun 20, 2024
42f9f57
Add PyTorch framework config to docs
Toni-SM Jun 20, 2024
e06dcfd
Increase seed according to worker rank in distributed runs
Toni-SM Jun 20, 2024
5abed23
Update agent and trainer configuration to avoid duplicated data in di…
Toni-SM Jun 20, 2024
4f8e3d8
Add method to broadcast and reduce distributed model parameters
Toni-SM Jun 20, 2024
104313d
Setup distributed runs
Toni-SM Jun 20, 2024
36d4a57
Add distributed implementation to PPO agent
Toni-SM Jun 21, 2024
9f265af
Fix torch deprecated warning
Toni-SM Jun 21, 2024
391506d
Reduce and broadcast learning rate across all workers/processes
Toni-SM Jun 21, 2024
f2aca29
Merge branch 'develop' into toni/distributed_torch
Toni-SM Jun 21, 2024
734221e
Update CHANGELOG
Toni-SM Jun 21, 2024
ffd6503
Merge branch 'develop' into toni/distributed_torch
Toni-SM Jun 23, 2024
a336bbd
Implement distributed runs for on-policy agents
Toni-SM Jun 23, 2024
1954ab3
Add distributed implementation to agent features
Toni-SM Jun 23, 2024
b821879
Implement distributed runs for off-policy agents
Toni-SM Jun 24, 2024
de86f51
Update off-policy agents features table in docs
Toni-SM Jun 24, 2024
c739b75
Unify code style for distributed implementation
Toni-SM Jun 24, 2024
c6301fb
Implement distributed runs for multi-agents
Toni-SM Jun 24, 2024
e09d7fd
Update multi-agents features table in docs
Toni-SM Jun 24, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update off-policy agents features table in docs
  • Loading branch information
Toni-SM committed Jun 24, 2024
commit de86f514b9302d0dd3e08f8242d1ce986dd9d0d1
4 changes: 2 additions & 2 deletions docs/source/api/agents/ddpg.rst
Original file line number Diff line number Diff line change
Expand Up @@ -237,8 +237,8 @@ Support for advanced features is described in the next table
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`
* - Distributed
- \-
- .. centered:: :math:`\square`
- Single Program Multi Data (SPMD) multi-GPU
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`

.. raw:: html
Expand Down
4 changes: 2 additions & 2 deletions docs/source/api/agents/ddqn.rst
Original file line number Diff line number Diff line change
Expand Up @@ -185,8 +185,8 @@ Support for advanced features is described in the next table
- .. centered:: :math:`\square`
- .. centered:: :math:`\square`
* - Distributed
- \-
- .. centered:: :math:`\square`
- Single Program Multi Data (SPMD) multi-GPU
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`

.. raw:: html
Expand Down
4 changes: 2 additions & 2 deletions docs/source/api/agents/dqn.rst
Original file line number Diff line number Diff line change
Expand Up @@ -185,8 +185,8 @@ Support for advanced features is described in the next table
- .. centered:: :math:`\square`
- .. centered:: :math:`\square`
* - Distributed
- \-
- .. centered:: :math:`\square`
- Single Program Multi Data (SPMD) multi-GPU
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`

.. raw:: html
Expand Down
4 changes: 2 additions & 2 deletions docs/source/api/agents/sac.rst
Original file line number Diff line number Diff line change
Expand Up @@ -245,8 +245,8 @@ Support for advanced features is described in the next table
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`
* - Distributed
- \-
- .. centered:: :math:`\square`
- Single Program Multi Data (SPMD) multi-GPU
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`

.. raw:: html
Expand Down
4 changes: 2 additions & 2 deletions docs/source/api/agents/td3.rst
Original file line number Diff line number Diff line change
Expand Up @@ -259,8 +259,8 @@ Support for advanced features is described in the next table
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`
* - Distributed
- \-
- .. centered:: :math:`\square`
- Single Program Multi Data (SPMD) multi-GPU
- .. centered:: :math:`\blacksquare`
- .. centered:: :math:`\square`

.. raw:: html
Expand Down
Loading