Skip to content

Conversation

abrarsheikh
Copy link
Contributor

@abrarsheikh abrarsheikh commented Oct 12, 2025

This PR fixes critical documentation gaps and inaccuracies in Ray Serve autoscaling docs. Key additions include the previously undocumented aggregation_function parameter, 9 autoscaling environment variables, and a comprehensive "How autoscaling metrics work" section with diagrams explaining the 4-stage metrics pipeline. The PR also corrects the record_autoscaling_stats() timeout value (10s not 30s), clarifies the two-stage downscaling behavior, and fixes the max_replicas default value explanation. Additionally, it adds deprecation notices for simple mode→aggregate mode and handle-based→replica-based metrics collection transitions.

@abrarsheikh abrarsheikh added the go add ONLY when ready to merge, run all tests label Oct 12, 2025
Signed-off-by: abrar <[email protected]>
Signed-off-by: abrar <[email protected]>
@abrarsheikh abrarsheikh marked this pull request as ready for review October 12, 2025 23:24
@abrarsheikh abrarsheikh requested review from a team as code owners October 12, 2025 23:24
@ray-gardener ray-gardener bot added serve Ray Serve Related Issue docs An issue or change related to documentation labels Oct 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

docs An issue or change related to documentation go add ONLY when ready to merge, run all tests serve Ray Serve Related Issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant