-
Notifications
You must be signed in to change notification settings - Fork 399
Clickhouse Cloud production readiness guide #4706
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
The latest updates on your projects. Learn more about Vercel for GitHub.
3 Skipped Deployments
|
Blargian
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sentence casing please :-)
Left some comments for a few places which seem a bit vague unless linked to further resources.
|
|
||
| ## Environment Strategy {#environment-strategy} | ||
|
|
||
| Establish separate environments to safely test changes before impacting production workloads. Most production incidents trace back to untested queries or configuration changes deployed directly to production systems. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe it's obvious already, but I think we should specify that a separate "environment" will be a separate Cloud service.
|
|
||
| **Testing**: Test queries in staging before production deployment. Queries that work on small datasets often cause memory exhaustion, excessive CPU usage, or slow execution at production scale. Validate configuration changes including user permissions, quotas, and service settings in staging—configuration errors discovered in production create immediate operational incidents. | ||
|
|
||
| **Sizing**: Size your staging service to approximate production load characteristics. Testing on significantly smaller infrastructure may not reveal resource contention or scaling issues. Use production-representative datasets through periodic data refreshes or synthetic data generation. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we cross link to the scaling docs here? The reader might not know how to size their staging service
|
|
||
| ## Next Steps {#next-steps} | ||
|
|
||
| After implementing the integrations and procedures in this guide, focus on optimization and operational maturity. Review query patterns and optimize table structures for performance. Implement network isolation, audit logging, and compliance controls required for your industry. Establish runbooks for common scenarios, implement automated testing for schema changes, and build internal knowledge bases for your team. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where can I learn more about how to implement network isolation, audit logging, and compliance controls required for my industry?
Co-authored-by: Shaun Struwig <[email protected]>
Co-authored-by: Shaun Struwig <[email protected]>
Co-authored-by: Shaun Struwig <[email protected]>
Co-authored-by: Shaun Struwig <[email protected]>
Summary
Clickhouse Cloud production readiness guide
Checklist