Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trying to fix the hotloading incompatibility issue #1380

Closed
wants to merge 1 commit into from

Conversation

levythu
Copy link
Contributor

@levythu levythu commented Sep 5, 2023

Summary:
Having discussion with jgk, we believe this is the right fix:

check this out https://fburl.com/code/v60tnefc

here we require https://fburl.com/code/gf6v4asr

torchrec

https://fburl.com/code/jn8eoyxs is not part of "//torchrec/distributed:distributed"

distributed uses model_parallel
model_parallel uses sharding_plan
sharding_plan includes fp_embeddingbag

BUT it does not include mc_embeddingbag

so the better solve instead of D48973797 would be to add mc_embeddingbag in https://fburl.com/code/td7f2682

then this dependency will roll up into

mc_embeddingbag -> sharding_plan -> model_parallel -> distributed -> https://fburl.com/code/zjxooggp -> https://fburl.com/code/xgpq1y3m

Differential Revision: D48912014

@facebook-github-bot facebook-github-bot added CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported labels Sep 5, 2023
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D48912014

levythu pushed a commit to levythu/torchrec that referenced this pull request Sep 5, 2023
…h#1380)

Summary:

Having discussion with jgk, we believe this is the right fix:

check this out https://fburl.com/code/v60tnefc

here we require https://fburl.com/code/gf6v4asr

torchrec

https://fburl.com/code/jn8eoyxs is not part of "//torchrec/distributed:distributed"

distributed uses model_parallel
model_parallel uses sharding_plan
sharding_plan includes fp_embeddingbag

BUT it does not include mc_embeddingbag

so the better solve instead of D48973797 would be to add mc_embeddingbag in https://fburl.com/code/td7f2682

then this dependency will roll up into

mc_embeddingbag -> sharding_plan -> model_parallel -> distributed -> https://fburl.com/code/zjxooggp -> https://fburl.com/code/xgpq1y3m

Differential Revision: D48912014
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D48912014

levythu pushed a commit to levythu/torchrec that referenced this pull request Sep 6, 2023
…h#1380)

Summary:

Having discussion with jgk, we believe this is the right fix:

check this out https://fburl.com/code/v60tnefc

here we require https://fburl.com/code/gf6v4asr

torchrec

https://fburl.com/code/jn8eoyxs is not part of "//torchrec/distributed:distributed"

distributed uses model_parallel
model_parallel uses sharding_plan
sharding_plan includes fp_embeddingbag

BUT it does not include mc_embeddingbag

so the better solve instead of D48973797 would be to add mc_embeddingbag in https://fburl.com/code/td7f2682

then this dependency will roll up into

mc_embeddingbag -> sharding_plan -> model_parallel -> distributed -> https://fburl.com/code/zjxooggp -> https://fburl.com/code/xgpq1y3m

Differential Revision: D48912014
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D48912014

…h#1380)

Summary:

Having discussion with jgk, we believe this is the right fix:

check this out https://fburl.com/code/v60tnefc

here we require https://fburl.com/code/gf6v4asr

torchrec

https://fburl.com/code/jn8eoyxs is not part of "//torchrec/distributed:distributed"

distributed uses model_parallel
model_parallel uses sharding_plan
sharding_plan includes fp_embeddingbag

BUT it does not include mc_embeddingbag

so the better solve instead of D48973797 would be to add mc_embeddingbag in https://fburl.com/code/td7f2682

then this dependency will roll up into

mc_embeddingbag -> sharding_plan -> model_parallel -> distributed -> https://fburl.com/code/zjxooggp -> https://fburl.com/code/xgpq1y3m

Reviewed By: dstaay-fb

Differential Revision: D48912014
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D48912014

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants