Verify flushed data are recovered upon reopen in crash test #12787

hx235 · 2024-06-21T08:28:05Z

Context/Summary:

This is to solve #12152. We persist the largest flushed seqno before crash just like how we persist the ExpectedState. And we verify the db lates seqno after recovery is no smaller than this flushed seqno.

Test:

Manually observe that the persisted sequence after flush completion is used to verify db's latest sequence
python3 tools/db_crashtest.py --simple blackbox --interval=30
CI

facebook-github-bot · 2024-06-21T08:28:29Z

@hx235 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-06-21T21:06:43Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-06-21T21:10:02Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-06-22T18:00:02Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-06-22T18:00:39Z

@hx235 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-06-24T23:20:00Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-06-26T23:57:33Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-07-22T22:34:50Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-07-22T22:35:03Z

@hx235 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-19T05:16:50Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

db_stress_tool/db_stress_listener.h

db_stress_tool/db_stress_shared_state.h

db_stress_tool/db_stress_test_base.cc

db_stress_tool/expected_state.cc

db_stress_tool/expected_state.h

db_stress_tool/db_stress_test_base.cc

archang19

This looks generally right to me.

I want to read through #12152 again to understand it better.

Some questions:

From #12152 (comment)

We should update the expected minimum value of that counter for (1) explicit flushes, and (2) OnFlushCompleted() events

Will a separate PR handle case 1? I only see case 2 being handled

From #12152 (comment)

WAL-disabled, atomic_flush-disabled may be worth testing at some future point but that is still an open topic to discuss. My thoughts are here - #11841 (comment).

Another case that may be worth testing at some point is WAL-disabled, atomic_flush-enabled, and manual flush triggered on a subset of column families (what you suggested we are already doing).

What is the current thinking on how many of these WAL {enabled, disabled}, atomic flush {enabled, disabled} combinations we will try to test?

facebook-github-bot · 2024-12-20T04:26:05Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-12-20T04:53:53Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-12-20T04:55:53Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

db_stress_tool/db_stress_shared_state.h

archang19

LGTM. I think we should be fine as long as we run the crash test for enough time before merging.

facebook-github-bot · 2024-12-20T22:44:58Z

@hx235 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-24T23:51:32Z

@hx235 has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-12-24T23:52:20Z

@hx235 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-12-25T00:30:38Z

@hx235 merged this pull request in e3024e7.

facebook-github-bot added the CLA Signed label Jun 21, 2024

hx235 force-pushed the verify_flushed_data_recovery branch from 6e5f737 to a4314e7 Compare June 21, 2024 21:06

hx235 force-pushed the verify_flushed_data_recovery branch from a4314e7 to 4be510e Compare June 22, 2024 17:59

hx235 requested a review from ajkr June 22, 2024 21:13

hx235 force-pushed the verify_flushed_data_recovery branch from 800b609 to 1dddd4a Compare July 22, 2024 22:34

hx235 force-pushed the verify_flushed_data_recovery branch from 1dddd4a to 9f98a86 Compare December 19, 2024 05:16

archang19 reviewed Dec 19, 2024

View reviewed changes

hx235 force-pushed the verify_flushed_data_recovery branch from 9f98a86 to 18d84d1 Compare December 20, 2024 04:26

hx235 force-pushed the verify_flushed_data_recovery branch from 18d84d1 to c26ae4d Compare December 20, 2024 04:53

hx235 force-pushed the verify_flushed_data_recovery branch from c26ae4d to a8d9041 Compare December 20, 2024 04:55

hx235 commented Dec 20, 2024

View reviewed changes

db_stress_tool/db_stress_shared_state.h Show resolved Hide resolved

archang19 approved these changes Dec 20, 2024

View reviewed changes

verify

fd22753

hx235 force-pushed the verify_flushed_data_recovery branch from a8d9041 to fd22753 Compare December 24, 2024 23:51

facebook-github-bot closed this in e3024e7 Dec 25, 2024

facebook-github-bot added the Merged label Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify flushed data are recovered upon reopen in crash test #12787

Verify flushed data are recovered upon reopen in crash test #12787

hx235 commented Jun 21, 2024 •

edited

Loading

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 22, 2024

facebook-github-bot commented Jun 22, 2024

facebook-github-bot commented Jun 24, 2024

facebook-github-bot commented Jun 26, 2024

facebook-github-bot commented Jul 22, 2024

facebook-github-bot commented Jul 22, 2024

facebook-github-bot commented Dec 19, 2024

archang19 left a comment •

edited

Loading

facebook-github-bot commented Dec 20, 2024

facebook-github-bot commented Dec 20, 2024

facebook-github-bot commented Dec 20, 2024

archang19 left a comment

facebook-github-bot commented Dec 20, 2024

facebook-github-bot commented Dec 24, 2024

facebook-github-bot commented Dec 24, 2024

facebook-github-bot commented Dec 25, 2024

Verify flushed data are recovered upon reopen in crash test #12787

Verify flushed data are recovered upon reopen in crash test #12787

Conversation

hx235 commented Jun 21, 2024 • edited Loading

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 22, 2024

facebook-github-bot commented Jun 22, 2024

facebook-github-bot commented Jun 24, 2024

facebook-github-bot commented Jun 26, 2024

facebook-github-bot commented Jul 22, 2024

facebook-github-bot commented Jul 22, 2024

facebook-github-bot commented Dec 19, 2024

archang19 left a comment • edited Loading

Choose a reason for hiding this comment

facebook-github-bot commented Dec 20, 2024

facebook-github-bot commented Dec 20, 2024

facebook-github-bot commented Dec 20, 2024

archang19 left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 20, 2024

facebook-github-bot commented Dec 24, 2024

facebook-github-bot commented Dec 24, 2024

facebook-github-bot commented Dec 25, 2024

hx235 commented Jun 21, 2024 •

edited

Loading

archang19 left a comment •

edited

Loading