executor, util/memory: keep global analyze memory usage non-negative #65503

wjhuang2016 · 2026-01-09T03:20:01Z

What problem does this PR solve?

Issue Number: close #65502

Problem Summary:
Global analyze memory "in-use" can temporarily become negative during Analyze v2 due to cleanup order, which may lead to confusing metrics and violates invariants in internal-check builds.

What changed and how does it work?

Adjust AnalyzeColumnsExecV2 worker cleanup so buffered Consume is applied before buffered Release.
Add an internal assertion to ensure LabelForGlobalAnalyzeMemory (in-use) is never negative.
Add a regression test for Analyze v2 memory usage invariant.

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
No need to test
- I checked and no code files have been changed.

Side effects

Performance regression: Consumes more CPU
Performance regression: Consumes more Memory
Breaking backward compatibility

Documentation

Release note

Please refer to Release Notes Language Style Guide to write a quality release note.

None

codecov · 2026-01-09T03:51:26Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 66.3495%. Comparing base (3455e86) to head (3c99c89).
⚠️ Report is 94 commits behind head on master.

Additional details and impacted files

@@               Coverage Diff                @@
##             master     #65503        +/-   ##
================================================
- Coverage   70.8020%   66.3495%   -4.4526%     
================================================
  Files          1901       1958        +57     
  Lines        518502     538869     +20367     
================================================
- Hits         367110     357537      -9573     
- Misses       126860     158634     +31774     
+ Partials      24532      22698      -1834

Flag	Coverage Δ
integration	`41.5173% <100.0000%> (-6.6495%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
dumpling	`52.8700% <ø> (ø)`
parser	`∅ <ø> (∅)`
br	`36.4013% <ø> (-21.8285%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

D3Hunter · 2026-01-09T03:59:04Z

pkg/executor/analyze_col_v2.go

-	defer e.memTracker.Consume(bufferedMemSize)
-	defer e.memTracker.Release(bufferedReleaseSize)


in this version, we always consume 0 and release 0, as arguments of defer is computed before defer function called

wjhuang2016 · 2026-01-09T04:50:34Z

Addressed the review comment from @D3Hunter: the worker cleanup now uses a deferred closure so bufferedMemSize/bufferedReleaseSize are evaluated at defer execution time (not at defer registration time), and we execute Consume before Release to avoid transient negative in-use values.

wjhuang2016 · 2026-01-09T04:50:43Z

/retest-required

wjhuang2016 · 2026-01-09T09:18:21Z

/retest

ti-chi-bot · 2026-01-09T09:21:30Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: winoros

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [winoros]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ti-chi-bot · 2026-01-09T09:21:31Z

[LGTM Timeline notifier]

Timeline:

2026-01-09 09:21:30.720288112 +0000 UTC m=+3734.782153021: ☑️ agreed by winoros.

winoros · 2026-01-09T09:26:00Z

/hold for discussing changes around the test case

pkg/executor/test/analyzetest/memorycontrol/memory_control_test.go

winoros · 2026-01-09T09:45:19Z

pkg/executor/test/analyzetest/memorycontrol/memory_control_test.go

+	tk.MustExec("set @@tidb_build_sampling_stats_concurrency=1")
+	tk.MustExec("use test")
+	tk.MustExec("drop table if exists t_mem_usage")
+	tk.MustExec("create table t_mem_usage(a text collate utf8mb4_general_ci)")


We can

Change the type to varchar or

Do an explicit set for the var tidb_analyze_skip_column_types to ensure that we will not skip the text type when building stats.

tiprow · 2026-01-09T12:48:07Z

@wjhuang2016: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
tidb_parser_test	`3c99c89`	link	true	`/test tidb_parser_test`
fast_test_tiprow	`3c99c89`	link	true	`/test fast_test_tiprow`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

winoros · 2026-01-09T12:51:38Z

/unhold

ti-chi-bot · 2026-01-09T13:39:02Z

@wjhuang2016: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
idc-jenkins-ci-tidb/mysql-test	`3c99c89`	link	true	`/test mysql-test`
pull-unit-test-next-gen	`3c99c89`	link	true	`/test pull-unit-test-next-gen`
idc-jenkins-ci-tidb/check_dev_2	`3c99c89`	link	true	`/test check-dev2`
idc-jenkins-ci-tidb/unit-test	`3c99c89`	link	true	`/test unit-test`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

executor, util/memory: keep global analyze memory usage non-negative

71e7621

ti-chi-bot bot added release-note-none Denotes a PR that doesn't merit a release note. do-not-merge/needs-triage-completed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Jan 9, 2026

bazel: update BUILD files for analyze memory assert

085562d

wjhuang2016 mentioned this pull request Jan 9, 2026

analyze: global analyze memory in-use metric can become negative (Analyze v2) #65502

Open

D3Hunter reviewed Jan 9, 2026

View reviewed changes

winoros approved these changes Jan 9, 2026

View reviewed changes

ti-chi-bot bot added the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Jan 9, 2026

ti-chi-bot bot added the approved label Jan 9, 2026

ti-chi-bot bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 9, 2026

winoros reviewed Jan 9, 2026

View reviewed changes

test: prevent skipping large samples in analyze v2 memory test

3c99c89

ti-chi-bot bot removed the do-not-merge/needs-triage-completed label Jan 9, 2026

ti-chi-bot bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 9, 2026

		defer e.memTracker.Consume(bufferedMemSize)
		defer e.memTracker.Release(bufferedReleaseSize)

executor, util/memory: keep global analyze memory usage non-negative #65503

Are you sure you want to change the base?

executor, util/memory: keep global analyze memory usage non-negative #65503

Conversation

wjhuang2016 commented Jan 9, 2026

What problem does this PR solve?

What changed and how does it work?

Check List

Release note

Uh oh!

codecov bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

D3Hunter Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

wjhuang2016 commented Jan 9, 2026

Uh oh!

wjhuang2016 commented Jan 9, 2026

Uh oh!

wjhuang2016 commented Jan 9, 2026

Uh oh!

ti-chi-bot bot commented Jan 9, 2026

Uh oh!

ti-chi-bot bot commented Jan 9, 2026

[LGTM Timeline notifier]

Uh oh!

winoros commented Jan 9, 2026

Uh oh!

Uh oh!

winoros Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

tiprow bot commented Jan 9, 2026

Uh oh!

winoros commented Jan 9, 2026

Uh oh!

ti-chi-bot bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Jan 9, 2026 •

edited

Loading