[Bug]: intel_gpu_top polling once per MILLIsecond (versus intended once-per-second?) #19982
-
Checklist
Describe the problem you are havingI am running a new install. I've noticed after some time, the System Metrics > GPU Usage stat goes to 0% and sticks there. This is for a intel/qsv environment. The cams still work and CPU doesnt spike so it appear things are still "working". But, Ive also been working on some system hard crashes (which led me to this oddity). I am not sure they are connected but alas... I am running proxmox 9 with frigate in a debian (6.12.41+deb13-amd64) VM (and running in docker within that VM). It figured it might just been the newness of trixie and passing through the GPU. But, in debugging, I found that in services.py the call that gets this info (pasted below) was spitting out insane amounts of information when I tried it. LLMs were confused. Finally I looked at the command's help and notices that the -s flag, which the LLMs, and maybe frigate, expected to be in seconds is now in milliseconds. That explains the crazy polling output as frigate (and me copying frigate) were asking for data every 1ms.
Given the LLM assumption I wonder if this was a second flag in the past (as "-s" would imply) and recently got move to ms? And then I was wondering if polling every 1ms unintentionally would overload/crash something causing the metric to flatline. Anyway, not sure what the intent was but thought I'd file this as an fyi because 1ms seems extremely frequent. Steps to reproduce
![]() Version0.16.1-e664cb2 In which browser(s) are you experiencing the issue with?Version 18.6 (20621.3.11.11.3) Frigate config fileffmpeg:
hwaccel_args: preset-intel-qsv-h264 docker-compose file or Docker CLI commandservices:
frigate:
container_name: frigate
privileged: true # this may not be necessary for all setups
restart: unless-stopped
stop_grace_period: 30s # allow enough time to shut down the various services
image: ghcr.io/blakeblackshear/frigate:stable
shm_size: "512mb" # update for your cameras based on calculation above
devices:
- /dev/bus/usb:/dev/bus/usb
- /dev/dri/renderD128:/dev/dri/renderD128
volumes:
- /etc/localtime:/etc/localtime:ro
- /home/emily/frigate/config:/config
- /mnt/frigate:/media/frigate
- type: tmpfs
target: /tmp/cache
tmpfs:
size: 1000000000
ports:
- "8971:8971"
- "5000:5000" # Internal unauthenticated access. Expose carefully.
- "8554:8554" # RTSP feeds
- "8555:8555/tcp" # WebRTC over tcp
- "8555:8555/udp" # WebRTC over udp
- "1984:1984" #go2rtc Relevant Frigate log outputn/a Relevant go2rtc log outputn/a Operating systemHome Assistant OS Install methodDocker Compose Network connectionWired Camera make and modelmultiple Screenshots of the Frigate UI's System metrics pages![]() ![]() Any other information that may be helpfulHardware: Beelink EQi12 Intel® Core 1220P |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
Wow, that would be quite an interesting breaking change to just throw in there, I'd expect they'd have some kind of sanity check as 1ms seems way too frequent to even be a valid number. Your suggestion about this change also leading to the GPU stats being broken could absolutely be the case. I am not sure how to validate this longer-term, to make sure we are not hit by a problem if this behavior changes again. Seems like we should go ahead and try and change this for now though targeting 0.16.2 |
Beta Was this translation helpful? Give feedback.
-
Thanks, great find! |
Beta Was this translation helpful? Give feedback.
No worries. We've put up a fix here: #19984