Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to ensure gradient passing of subnetwork after using the subnetwork training result as filters parameters for calculation #74

Open
mxteaw opened this issue Jul 26, 2023 · 1 comment

Comments

@mxteaw
Copy link

mxteaw commented Jul 26, 2023

I tried to make the CNN_PP module using pytorch, but since the subnet and yolo connection need to use the results of the subnetwork as parameters of the filters, it seems to causes the gradient propagation to break and the program reports an error, can someone handle the problem?

@mxteaw
Copy link
Author

mxteaw commented Jul 27, 2023

I tried to make the CNN_PP module using pytorch, but since the subnet and yolo connection need to use the results of the subnetwork as parameters of the filters, it seems to causes the gradient propagation to break and the program reports an error, can someone handle the problem?

Since the formula is derivable, now I know that it does not affect the passing of the gradient, but the running program will report the error: "RuntimeError: Only Tensors created explicitly by the user (graph leaves) support the deepcopy protocol at the moment", how should I solve it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant