Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] sometimes' pod always being status of ContainterCreating when set inactivityTimeout #4560

Open
changluyi opened this issue Sep 26, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@changluyi
Copy link
Collaborator

Kube-OVN Version

1.12.25

Kubernetes Version

1.30

Operation-system/Kernel Version

3.10.0-1160.el7.x86_64

Description

kube-ovn-controller get a log like this, the pod gets the allocated IP, and then hang, the handleAddOrUpdatePod function will never exist
image

Steps To Reproduce

I don't know how to reproduce

Current Behavior

pod is always being in status of ContainerCreating

Expected Behavior

pod is created successfully

@changluyi changluyi added the bug Something isn't working label Sep 26, 2024
Copy link

dosubot bot commented Sep 26, 2024

I found related issues that might be helpful:

To continue talking to Dosu, mention @dosu.

@bobz965
Copy link
Collaborator

bobz965 commented Sep 26, 2024

是不是什么锁竞争导致的呀?

@changluyi
Copy link
Collaborator Author

image
目前追溯到是inactivityProbe开了后,这个锁hang住了

@changluyi
Copy link
Collaborator Author

changluyi commented Oct 23, 2024

目前追溯到rpclient的问题,在这两个位置都能hang住,导致rpcMutex没有释放,尝试更新rpc库没有解决,目前提issue给libovsdb社区。
image
image

ovn-org/libovsdb#391

看到这个pr 把echo 写成异步的,似乎能解决,但没弄明白当时为啥没合并
cenkalti/rpc2#38
ovn-org/libovsdb#368

@changluyi changluyi changed the title [BUG] sometimes' pod always being status of ContainterCreating [BUG] sometimes' pod always being status of ContainterCreating when set inactivityTimeout Oct 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants