New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

MDEV-36226 Stall and crash when page cleaner fails to generate free p… #3885

Open

mariadb-DebarunBanerjee wants to merge 1 commit into 10.6 from 10.6-MDEV-36226

Contributor

mariadb-DebarunBanerjee commented Mar 10, 2025

The Jira issue number for this PR is: MDEV-36226

Description

During regular iteration the page cleaner does flush from flush list with some flush target and then goes for generating free pages from LRU tail. When asynchronous flush is triggered i.e. when 7/8 th of the LSN margin is filled in the redo log, the flush target for flush list is set to innodb_io_capacity_max. If it could flush all, the flush bandwidth for LRU flush is currently set to zero. If the LRU tail has dirty pages, page cleaner ends up freeing no pages in one iteration. The scenario could repeat across multiple iterations till async flush target is reached. During this time the DB system is starved of free pages resulting in apparent stall and in some cases dict_sys latch fatal error.

Fix: In page cleaner iteration, before LRU flush, ensure we provide enough flush limit so that freeing pages is no blocked by dirty pages in LRU tail. Log IO and flush state if double write flush wait is long.

Impact: It could result in increased IO due to LRU flush in specific cases.

Release Notes

None

How can this PR be tested?

Regular Innodb test should cover the path. Performance and stress Test should be run to judge for possible impact.

Reproducing the base issue would require large buffer pool, long run and synchronization between foreground and Innodb background threads.

Basing the PR against the correct MariaDB version

This is a new feature or a refactoring, and the PR is based against the main branch.
This is a bug fix, and the PR is based against the earliest maintained branch in which the bug can be reproduced.

PR quality check

I checked the CODING_STANDARDS.md file and my PR conforms to this where appropriate.
For any trivial modifications to the PR, I am ok with the reviewer making the changes themselves.

mariadb-DebarunBanerjee self-assigned this

CLAassistant commented Mar 10, 2025

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

dr-m reviewed

View reviewed changes

Contributor

dr-m left a comment

The change to the logic looks reasonable to me, but in the diagnostic output I’d avoid excessive numbers of calls to operator<<() and use the common logging functions sql_print_warning() or sql_print_information().

storage/innobase/buf/buf0dblwr.cc Outdated Show resolved Hide resolved

storage/innobase/buf/buf0flu.cc Outdated Show resolved Hide resolved

storage/innobase/include/buf0flu.h Outdated Show resolved Hide resolved

storage/innobase/buf/buf0flu.cc Outdated Show resolved Hide resolved

storage/innobase/buf/buf0flu.cc Outdated Show resolved Hide resolved

storage/innobase/buf/buf0flu.cc Outdated Show resolved Hide resolved

storage/innobase/buf/buf0dblwr.cc Outdated Show resolved Hide resolved

svoj added the MariaDB Corporation label

dr-m reviewed

View reviewed changes

storage/innobase/buf/buf0flu.cc Outdated Show resolved Hide resolved

mariadb-DebarunBanerjee force-pushed the 10.6-MDEV-36226 branch from e190624 to de96813 Compare

March 12, 2025 14:12

dr-m reviewed

View reviewed changes

storage/innobase/buf/buf0flu.cc Outdated

Comment on lines 2697 to 2704

+                sql_print_information("Innodb: LSN flush parameters\n"
+                  "-------------------\n"
+                  "System LSN     : %" PRIu64 "\n"
+                  "Checkpoint  LSN: %" PRIu64 "\n"
+                  "Flush ASync LSN: %" PRIu64 "\n"
+                  "Flush Sync  LSN: %" PRIu64 "\n"
+                  "-------------------",
+                  lsn, clsn, buf_flush_async_lsn, buf_flush_sync_lsn);

Contributor

dr-m Mar 13, 2025

Usually all InnoDB messages are prefixed by InnoDB: (note the case). Do we need this many rows for the output? You need to write uint64_t{buf_flush_async_lsn} or similar to avoid compilation errors:

error: cannot pass object of non-trivial type 'Atomic_relaxed<lsn_t>' (aka 'Atomic_relaxed<unsigned long long>') through variadic function; call will abort at runtime [-Wnon-pod-varargs]

storage/innobase/buf/buf0flu.cc

+                ulint lru_size= UT_LIST_GET_LEN(LRU);
+                ulint dirty_size= UT_LIST_GET_LEN(flush_list);
+                ulint free_size= UT_LIST_GET_LEN(free);
+                ulint dirty_pct= lru_size ? dirty_size * 100 / (lru_size + free_size) : 0;

Contributor

dr-m Mar 13, 2025

dirty_pct seems to be redundant information that can be calculated from the rest. It could also be totally misleading, because we were reading these fields without proper mutex or flush_list_mutex protection.

storage/innobase/buf/buf0dblwr.cc Outdated Show resolved Hide resolved


          MDEV-36226 Stall and crash when page cleaner fails to generate free p…

3ab4934

…ages during Async flush

During regular iteration the page cleaner does flush from flush list
with some flush target and then goes for generating free pages from LRU
tail. When asynchronous flush is triggered i.e. when 7/8 th of the LSN
margin is filled in the redo log, the flush target for flush list is
set to innodb_io_capacity_max. If it could flush all, the flush
bandwidth for LRU flush is currently set to zero. If the LRU tail has
dirty pages, page cleaner ends up freeing no pages in one iteration.
The scenario could repeat across multiple iterations till async flush
target is reached. During this time the DB system is starved of free
pages resulting in apparent stall and in some cases dict_sys latch
fatal error.

Fix: In page cleaner iteration, before LRU flush, ensure we provide
enough flush limit so that freeing pages is no blocked by dirty pages
in LRU tail. Log IO and flush state if double write flush wait is long.

Reviewed by: Marko Mäkelä

mariadb-DebarunBanerjee force-pushed the 10.6-MDEV-36226 branch from de96813 to 3ab4934 Compare

March 18, 2025 10:12

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

MariaDB Corporation