Unbreak test models llama CI #6026

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

larryliu0820 wants to merge 1 commit into main from export-D64074891

Contributor

larryliu0820 commented Oct 8, 2024

Summary:
Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in ConvertToLinear but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the ConvertToLinear pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using to_edge_lower_and_transform API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Differential Revision: D64074891

pytorch-bot bot commented Oct 8, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6026

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 04b1dda with merge base b6e6d06 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

facebook-github-bot added the fb-exported label

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

larryliu0820 added a commit that referenced this pull request


          Unbreak test models llama CI (#6026)

7e54dab

Summary:
Pull Request resolved: #6026

Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in `ConvertToLinear` but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the `ConvertToLinear` pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using `to_edge_lower_and_transform` API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Differential Revision: D64074891

larryliu0820 force-pushed the export-D64074891 branch from 91fba09 to 7e54dab Compare

October 8, 2024 22:57

Jack-Khuu approved these changes

View reviewed changes

Jack-Khuu added the ciflow/trunk label

facebook-github-bot pushed a commit that referenced this pull request


          Unbreak test models llama CI (#6026)

bcc8b14

Summary:

Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in `ConvertToLinear` but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the `ConvertToLinear` pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using `to_edge_lower_and_transform` API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Reviewed By: Jack-Khuu

Differential Revision: D64074891

facebook-github-bot force-pushed the export-D64074891 branch from 7e54dab to bcc8b14 Compare

October 8, 2024 23:04

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

1 similar comment

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

larryliu0820 added a commit that referenced this pull request


          Unbreak test models llama CI (#6026)

0c2febd

Summary:
Pull Request resolved: #6026

Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in `ConvertToLinear` but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the `ConvertToLinear` pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using `to_edge_lower_and_transform` API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Reviewed By: Jack-Khuu

Differential Revision: D64074891

larryliu0820 force-pushed the export-D64074891 branch from bcc8b14 to 0c2febd Compare

October 8, 2024 23:07

facebook-github-bot pushed a commit that referenced this pull request


          Unbreak test models llama CI (#6026)

70cc91f

Summary:

Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in `ConvertToLinear` but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the `ConvertToLinear` pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using `to_edge_lower_and_transform` API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Reviewed By: Jack-Khuu

Differential Revision: D64074891

facebook-github-bot force-pushed the export-D64074891 branch from 0c2febd to 70cc91f Compare

October 8, 2024 23:23

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

tugsbayasgalan approved these changes

View reviewed changes


          Unbreak test models llama CI (#6026)

04b1dda

Summary:

Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in `ConvertToLinear` but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the `ConvertToLinear` pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using `to_edge_lower_and_transform` API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Reviewed By: digantdesai, Jack-Khuu, tugsbayasgalan

Differential Revision: D64074891

facebook-github-bot pushed a commit that referenced this pull request


          Unbreak test models llama CI (#6026)

75125be

Summary:

Did a bunch of debugging on OSS CI:https://github.com/pytorch/executorch/actions/runs/11241297226/job/31252590975

Was able to confirm although the problem happens in `ConvertToLinear` but the root cause is we are partitioning the graph differently between these two pytorch nightly: dev20240916 and dev20240917.

The exported graph looks the same but the partitioner was behaving differently and causes the `ConvertToLinear` pass to error out.

We can't really revert back to dev20240916 nightly because it breaks other CI jobs, see #5987.

The current approach I'm taking avoids decomposing linear by using `to_edge_lower_and_transform` API. This avoids jumping into the rabbit hole of debugging the partitioning & tagging logic.

Reviewed By: digantdesai, Jack-Khuu, tugsbayasgalan

Differential Revision: D64074891

facebook-github-bot force-pushed the export-D64074891 branch from 70cc91f to 75125be Compare

October 9, 2024 03:58

larryliu0820 force-pushed the export-D64074891 branch from 75125be to 04b1dda Compare

October 9, 2024 03:58

Contributor

facebook-github-bot commented Oct 9, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

1 similar comment

Contributor

facebook-github-bot commented Oct 9, 2024

This pull request was exported from Phabricator. Differential Revision: D64074891

facebook-github-bot closed this in

72b3bb3

facebook-github-bot added the Merged label

Contributor

facebook-github-bot commented Oct 9, 2024

This pull request has been merged in 72b3bb3.

qma added a commit to qma/executorch that referenced this pull request


          Enable quantization as default for XNNPack for previous failing models

98aa423

Summary:
Since master has migrated aot_compiler to use to_edge_transform_and_lower in a previous change pytorch#6026, quantization XNNPack options can be enabled by default for the following models:

- Quantized ViT
- Quantized Mobilebert
- Quantized Emformer Predict
- Quantized Emformer Transcribe

Differential Revision: D64081319

qma mentioned this pull request

Enable quantization as default for XNNPack for previous failing models #6242

Closed

qma added a commit to qma/executorch that referenced this pull request


          Enable quantization as default for XNNPack for previous failing models (

a435029

pytorch#6242)

Summary:

Since master has migrated aot_compiler to use to_edge_transform_and_lower in a previous change pytorch#6026, quantization XNNPack options can be enabled by default for the following models:

- Quantized ViT
- Quantized Mobilebert
- Quantized Emformer Predict
- Quantized Emformer Transcribe

Reviewed By: digantdesai

Differential Revision: D64081319

qma added a commit to qma/executorch that referenced this pull request


          Enable quantization as default for XNNPack for previous failing models (

2221ff4

pytorch#6242)

Summary:

Since master has migrated aot_compiler to use to_edge_transform_and_lower in a previous change pytorch#6026, quantization XNNPack options can be enabled by default for the following models:

- Quantized ViT
- Quantized Mobilebert
- Quantized Emformer Predict
- Quantized Emformer Transcribe

Reviewed By: digantdesai

Differential Revision: D64081319

qma added a commit to qma/executorch that referenced this pull request


          Enable quantization as default for XNNPack for previous failing models (

pytorch#6242)

Summary:

Since master has migrated aot_compiler to use to_edge_transform_and_lower in a previous change pytorch#6026, quantization XNNPack options can be enabled by default for the following models:

- Quantized ViT
- Quantized Mobilebert
- Quantized Emformer Predict
- Quantized Emformer Transcribe

Reviewed By: digantdesai

Differential Revision: D64081319

facebook-github-bot pushed a commit that referenced this pull request


          Enable quantization as default for XNNPack for previous failing models (

5f12f28

#6242)

Summary:
Pull Request resolved: #6242

Since master has migrated aot_compiler to use to_edge_transform_and_lower in a previous change #6026, quantization XNNPack options can be enabled by default for the following models:

- Quantized ViT
- Quantized Mobilebert
- Quantized Emformer Predict
- Quantized Emformer Transcribe

Reviewed By: digantdesai

Differential Revision: D64081319

fbshipit-source-id: 4e8ff77af442dfded043c5a5583466afec6beb4e

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk CLA Signed fb-exported Merged