-
Notifications
You must be signed in to change notification settings - Fork 517
Add approximate gelu replacement to opt level 2 #10129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10129
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit a23d0c1 with merge base 4022ff1 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D72935935 |
Summary: As titled. Gelu is prohibitively expensive to run on DSPs, due to the std::erf call in the function. The PT approximate version using an approximation based on `tanh`, which is faster on the ASR encoder 27M model for example. Seems like BUCK files (even with just on_call commands, the linter is complaining). Differential Revision: D72935935
aa57c9e
to
0b60121
Compare
This pull request was exported from Phabricator. Differential Revision: D72935935 |
Summary: Pull Request resolved: #10129 As titled. Gelu is prohibitively expensive to run on DSPs, due to the std::erf call in the function. The PT approximate version using an approximation based on `tanh`, which is faster on the ASR encoder 27M model for example. Seems like BUCK files (even with just on_call commands, the linter is complaining). Differential Revision: D72935935
0b60121
to
474a38c
Compare
Summary: As titled. Gelu is prohibitively expensive to run on DSPs, due to the std::erf call in the function. The PT approximate version using an approximation based on `tanh`, which is faster on the ASR encoder 27M model for example. Seems like BUCK files (even with just on_call commands, the linter is complaining). Differential Revision: D72935935
474a38c
to
108f90a
Compare
Summary: Pull Request resolved: #10129 As titled. Gelu is prohibitively expensive to run on DSPs, due to the std::erf call in the function. The PT approximate version using an approximation based on `tanh`, which is faster on the ASR encoder 27M model for example. Seems like BUCK files (even with just on_call commands, the linter is complaining). Differential Revision: D72935935
This pull request was exported from Phabricator. Differential Revision: D72935935 |
108f90a
to
a23d0c1
Compare
Summary:
As titled. Gelu is prohibitively expensive to run on DSPs, due to the std::erf call in the function. The PT approximate version using an approximation based on
tanh
, which is faster on the ASR encoder 27M model for example.Seems like BUCK files (even with just on_call commands, the linter is complaining).
Differential Revision: D72935935