Skip to content

Conversation

@sanrise
Copy link
Contributor

@sanrise sanrise commented Nov 6, 2024

Stack from ghstack (oldest at bottom):

Studying memory access patterns is the primary use cases.

Internal: The data may be used to find the % of operators that may cause alignment related overhead.

Differential Revision: D64413699

cc @robieta @chaekit @guotuofeng @guyang3532 @dzhulgakov @davidberard98 @briancoutinho @sraikund16

…nd addr hints for Tensor values. Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead on custom silicon. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) [ghstack-poisoned]
@sanrise sanrise requested a review from sraikund16 as a code owner November 6, 2024 01:33
@pytorch-bot
Copy link

pytorch-bot bot commented Nov 6, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139837

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 2 New Failures, 1 Unrelated Failure

As of commit b3e850c with merge base ffb9790 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64413699

@sanrise sanrise self-assigned this Nov 6, 2024
@sanrise sanrise requested a review from briancoutinho November 6, 2024 01:34
@sanrise sanrise added the oncall: profiler profiler-related issues (cpu, gpu, kineto) label Nov 6, 2024
@sanrise
Copy link
Contributor Author

sanrise commented Nov 6, 2024

@pytorchbot label "topic: not user facing"

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Nov 6, 2024
…start and end addr hints for Tensor values." Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead on custom silicon. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) cc robieta chaekit guotuofeng guyang3532 dzhulgakov davidberard98 briancoutinho sraikund16 [ghstack-poisoned]
sanrise added a commit that referenced this pull request Nov 6, 2024
…nd addr hints for Tensor values. Pull Request resolved: #139837 Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead on custom silicon. ghstack-source-id: 252025337 @exported-using-ghexport Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/)
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64413699

@sanrise sanrise changed the title [pytorch/profielr] Profiler NCCL metadata can now contain start and end addr hints for Tensor values. [pytorch/profiler] Profiler NCCL metadata can now contain start and end addr hints for Tensor values. Nov 6, 2024
@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 6, 2024
@sanrise sanrise changed the base branch from gh/sanrise/2/base to main November 6, 2024 02:01
@sraikund16 sraikund16 removed the oncall: profiler profiler-related issues (cpu, gpu, kineto) label Nov 6, 2024
@sanrise sanrise changed the base branch from main to gh/sanrise/2/base November 6, 2024 21:04
…start and end addr hints for Tensor values." Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) cc robieta chaekit guotuofeng guyang3532 dzhulgakov davidberard98 briancoutinho sraikund16 [ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64413699

…start and end addr hints for Tensor values." Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) cc robieta chaekit guotuofeng guyang3532 dzhulgakov davidberard98 briancoutinho sraikund16 [ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64413699

@sanrise sanrise changed the title [pytorch/profiler] Profiler NCCL metadata can now contain start and end addr hints for Tensor values. [pytorch/profiler] Profiler NCCL metadata can now contain collective Input and Ouput Tensor addrs Nov 6, 2024
…collective Input and Ouput Tensor addrs" Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) cc robieta chaekit guotuofeng guyang3532 dzhulgakov davidberard98 briancoutinho sraikund16 [ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D64413699

…collective Input and Ouput Tensor addrs" Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) cc robieta chaekit guotuofeng guyang3532 dzhulgakov davidberard98 briancoutinho sraikund16 [ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

14 similar comments
@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@facebook-github-bot
Copy link
Contributor

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: This PR has internal changes and must be landed via Phabricator! Please try reimporting/rexporting the PR!

Details for Dev Infra team Raised by workflow job
@sanrise
Copy link
Contributor Author

sanrise commented Nov 14, 2024

This diff was reverted due to a LOG statement, I am grafting a fix and will submit a new PR.

@sanrise sanrise closed this Nov 14, 2024
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
…Input and Ouput Tensor addrs (pytorch#139837) Studying memory access patterns is the primary use cases. Internal: The data may be used to find the % of operators that may cause alignment related overhead. Differential Revision: [D64413699](https://our.internmc.facebook.com/intern/diff/D64413699/) Pull Request resolved: pytorch#139837 Approved by: https://github.com/sraikund16
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
…lective Input and Ouput Tensor addrs (pytorch#139837)" This reverts commit 3e277eb. Reverted pytorch#139837 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](pytorch#139837 (comment)))
@github-actions github-actions bot deleted the gh/sanrise/2/head branch December 14, 2024 02:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci-no-td Do not run TD on this PR ciflow/trunk Trigger trunk jobs on your pull request fb-exported Merged Reverted topic: not user facing topic category

5 participants