Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Chinese manual compaction doc #62

Merged
merged 24 commits into from
Jan 24, 2024

Conversation

ninsmiracle
Copy link
Contributor

update Chinese manual_compaction_doc

@acelyc111
Copy link
Member

Please add a space between Chinese and letter/digitals, see https://github.com/vinta/pangu.js

* Manual Compact总开关:
* `manual_compact.disabled`(从v1.9.0版本开始支持):如果为true,则关闭Manual Compact功能,并且取消正在执行中的Manual Compact动作。如果不设置,默认为false。
* `manual_compact.max_concurrent_running_count`(从v1.11.3版本开始支持):指定最大并发数。实际上,可执行的最大并发数由`该env参数`和`服务端MANUAL_COMPACT_THRAD_POOL的线程数`共同决定,取两者的较小值。
* 扩展 RocksDB,在 Manifest 中记录上一次执行 Manual Compact 的时间,并提供 `GetLastManualCompactFinishTime()` 方法来获取该时间。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the implemantation before 2.1, since 2.1 (include 2.1), the metadata is stored in "meta" column family.
See apache/incubator-pegasus#535, apache/incubator-pegasus#556

You should describe both the two methods in different versions.

_docs/zh/administration/manual-compact.md Outdated Show resolved Hide resolved
_docs/zh/administration/manual-compact.md Outdated Show resolved Hide resolved
_docs/zh/administration/manual-compact.md Outdated Show resolved Hide resolved
_docs/zh/administration/manual-compact.md Outdated Show resolved Hide resolved

## 补充说明

manual compaction 常与 bulk load 功能配合使用,作为批量导入大量数据后统一优化读取的手段。在需要进行 bulk load 操作的表中,我们常将 **Usage Scenario** 参数设置为 bulk_load 模式,以便减小增加大量数据带来的性能损耗。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Better to write the docs objectively, avoid to say "We often do ...", because this is an open source project, who is we?


manual compaction 常与 bulk load 功能配合使用,作为批量导入大量数据后统一优化读取的手段。在需要进行 bulk load 操作的表中,我们常将 **Usage Scenario** 参数设置为 bulk_load 模式,以便减小增加大量数据带来的性能损耗。

- manual-compaction 的开销要比引擎层 compaction 低,因为我们可以通过参数主动控制并发度。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How do you infer this result? The RocksDB background compaction jobs could be tuned as well.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm going to add a limiting scenario to this result.

_docs/zh/administration/manual-compact.md Outdated Show resolved Hide resolved
_docs/zh/administration/manual-compact.md Outdated Show resolved Hide resolved

## 补充说明

manual compaction 常与 bulk load 功能配合使用,作为批量导入大量数据后统一优化读取的手段。在需要进行 bulk load 操作的表中,我们常将 **Usage Scenario** 参数设置为 bulk_load 模式,以便减小增加大量数据带来的性能损耗。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
manual compaction 常与 bulk load 功能配合使用,作为批量导入大量数据后统一优化读取的手段。在需要进行 bulk load 操作的表中,我们常将 **Usage Scenario** 参数设置为 bulk_load 模式,以便减小增加大量数据带来的性能损耗
Manual compaction 常与 bulk load 功能配合使用,作为批量导入大量数据后的一种优化读写性能的方式。在需要进行 bulk load 操作的表中,可将 [Usage Scenario](/administration/usage-scenario) 设置为`bulk_load`模式,以便减小导入大量数据带来的性能损耗

ninsmiracle and others added 15 commits January 23, 2024 19:12
@acelyc111 acelyc111 merged commit defe785 into apache:master Jan 24, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants