Modifies the Compactor to be a more generic job execution process #3801

dlmarion · 2023-10-02T17:30:12Z

No description provided.

…er thrift with strict typing

dlmarion · 2023-10-02T17:42:02Z

There is still more work to be done here, but wanted to get some feedback on:

The Thrift messaging / RPC changes
The changes to Compactor (which needs to be renamed to TaskWorker) around the TASK_RUNNER_WORKER_TYPE property.

The basic gist of the design here is:

A TaskWorker process (currently known as Compactor) is started with the property TASK_RUNNER_WORKER_TYPE, which has values COMPACTION, LOG_SORTING, SPLIT_POINT_CALCULATION. That instance of the TaskWorker only perform jobs of the configured type.
The TaskWorker gets the next job from the TaskManager (currently known as the CompactionCoordinator) by making a call via the Tasks Thrift API to the Manager.
The TaskManager returns the next highest priority task of the given type (COMPACTION, LOG_SORTING, SPLIT_CALCULATION).
The TaskWorker executes the Task and reports status back the TaskManager.

If I had to guess, I'm probably about 80-85% done at this point.

keith-turner · 2023-10-03T15:35:10Z

server/manager/src/main/java/org/apache/accumulo/manager/tasks/DeadCompactionDetector.java

  public void start() {
    long interval = this.context.getConfiguration()
-        .getTimeInMillis(Property.COMPACTION_COORDINATOR_DEAD_COMPACTOR_CHECK_INTERVAL);
+        .getTimeInMillis(Property.TASK_MANAGER_DEAD_COMPACTOR_CHECK_INTERVAL);


Nothing to do in this PR, but I think in general we may need reconsider some of the property prefixes. Opened #3803 for this. For this property maybe it should have a another prefix that is not related to where its running, but is more related to the compaction functionality. Not sure though, would be best to consider all properties are once for this instead of looking at this one in isolation.

keith-turner · 2023-10-03T17:07:25Z

core/src/main/thrift/tasks.thrift

+  /*
+   * Called by the Monitor to get progress information
+   */
+  Task getRunningTasks(


This comment is soley for discussion, not recommending any changes. Thinking it would be best to push as much serialization into thrift as possible for things that are common. Took a stab at this below, but not sure if its workable.

// renamed from WorkerType enum TaskType { COMPACTION LOG_SORTING SPLIT_POINT_CALCULATION } enum TaskState { RUNNING, COMPLETE, FAILED } // used to report the status of a task struct TaskStatus { string taskId long startTime // time a worker started running the task TaskType taskType TaskState taskState byte[] data // status data specific to the taskType and taskState } list<TaskStatus> getRunningTasks(

I have no issue renaming WorkerType to something like TaskRunnerType. I'm open to discussing any changes.

keith-turner · 2023-10-06T02:37:21Z

@dlmarion if you have some time to chat sometime I would like to discuss some questions I have. Wondering about the following.

Where things run? Maybe instead of having a task runner executable, task runner is more of an internal code library that user facing executable components instantiate. For example thinking through what the user facing accumulo commands could look like and what they could do.

accumulo compactor  -- runs compactions, instantiates a task runner in its impl to make this happen
accumulo tserver    -- hosts tablets and does log sorts, it runs a task runner inside of a tablet server process to do log sorts
accumulo manager    -- runs fate, assigns tablets, does split calculations... non-primary manager processes can run a task runner process to do split calculations

What task return? Maybe nothing, could we structure all task such that there is no return of data to manager? A task runner gets a task from the manager, runs it, and when done gets another task. It never reports completion to the manager or status.

compaction task run the compaction and commit it to the metadata table as part of the task
compaction task do not report stats back to the manager, but only to the metrics system. Could the monitor get the data it needs? Maybe the monitor contacts task runners directly if it wants info?
log sort task sort the logs and create the appropriate dirs in hdfs when done
split task could update the metadata table with the needed information instead of reporting back

If task do not return anything, then that simplifies the thrift API and the manager possibly. Would not need to worry about keeping info in memory in the manager and keeping that info consistent and avoiding using to much memory.

dlmarion · 2023-10-06T12:15:49Z

@dlmarion if you have some time to chat sometime I would like to discuss some questions I have. Wondering about the following.

Yeah, let's find a time.

Where things run? Maybe instead of having a task runner executable, task runner is more of an internal code library that user facing executable components instantiate. For example thinking through what the user facing accumulo commands could look like and what they could do.
accumulo compactor  -- runs compactions, instantiates a task runner in its impl to make this happen
accumulo tserver    -- hosts tablets and does log sorts, it runs a task runner inside of a tablet server process to do log sorts
accumulo manager    -- runs fate, assigns tablets, does split calculations... non-primary manager processes can run a task runner process to do split calculations

With on-demand tablets it's possible that a user only has enough TabletServers running to host the root and metadata tables - that would be my only concern with doing log sorting in the TabletServer. I do like the idea of non-primary managers doing some of this work (which BTW I had working a while back in #3262 ). If we could leverage the non-primary managers, then this PR could likely be closed leaving the compactors the way that they are today. I think we should explore that idea sooner rather than later. There was another item mentioned in #3796 - compaction selection functionality. If we performed compaction selection and split calculations in the non-primary manager, left compactors the way they are today, then we just need a server component to run log sorting. IIRC from #3262 you can have multiple non-primary managers.

What task return? Maybe nothing, could we structure all task such that there is no return of data to manager? A task runner gets a task from the manager, runs it, and when done gets another task. It never reports completion to the manager or status.

compaction task run the compaction and commit it to the metadata table as part of the task

compaction task do not report stats back to the manager, but only to the metrics system. Could the monitor get the data it needs? Maybe the monitor contacts task runners directly if it wants info?

log sort task sort the logs and create the appropriate dirs in hdfs when done

split task could update the metadata table with the needed information instead of reporting back

If task do not return anything, then that simplifies the thrift API and the manager possibly. Would not need to worry about keeping info in memory in the manager and keeping that info consistent and avoiding using to much memory.

keith-turner · 2023-10-06T14:28:37Z

If we could leverage the non-primary managers, then this PR could likely be closed leaving the compactors the way that they are today.

I think this PR is still, useful but maybe scoped down. Thinking of this at a really high level and seeing the following. What I am trying to puzzle out is where actual code links in to these high level concepts. Also, is the following the complete high level picture?

Generalizing distributed work in Accumulo

Work must be found, currently TGW+TabletMgtmIterator for log sort and compaction
Work must partitioned and prioritized (multiple in memory bounded prio queues that can replace stuff)
Workers need to find/request work (thrift task RPCs in this PR)
Work needs to be done (may need to emit status and/or metrics about work).
Results of work needs to be committed (this is highly dependent on the work)

dlmarion · 2023-10-06T14:33:24Z

If we could leverage the non-primary managers, then this PR could likely be closed leaving the compactors the way that they are today.

I think this PR is still, useful but maybe scoped down. Thinking of this at a really high level and seeing the following. What I am trying to puzzle out is where actual code links in to these high level concepts. Also, is the following the complete high level picture?

Generalizing distributed work in Accumulo

Work must be found, currently TGW+TabletMgtmIterator for log sort and compaction

Work must partitioned and prioritized (multiple in memory bounded prio queues that can replace stuff)

Workers need to find/request work (thrift task RPCs in this PR)

Work needs to be done (may need to emit status and/or metrics about work).

Results of work needs to be committed (this is highly dependent on the work)

There is also the issue of checking for dead workers so that the task can be queued up again. We have this today in the Manager with the DeadCompactionDetector.

dlmarion added 22 commits July 28, 2023 20:40

wip

dd1d6ee

Merge branch 'elasticity' into generic-task-api

64d3541

wip

d1a3c8a

Merge branch 'elasticity' into generic-task-api

ca7015c

wip

400cd6c

wip

e4a0e95

Merge branch 'elasticity' into generic-task-api

e81dfa5

Merge branch 'elasticity' into generic-task-api

2a62560

abandoned cross-language and cross-platform support, strictly gson ov…

53d46b7

…er thrift with strict typing

Merge branch 'elasticity' into generic-task-api

34e01ac

wip

7d90b1c

replaced Thrift API between CompactionCoordinator and Compactor

f95897f

reduced boilerplate code by moving into utility method

69c6264

Add missing annotations

5849631

wip

c6b9d82

Merge branch 'elasticity' into generic-task-api

bb7c50e

Merge branch 'elasticity' into generic-task-api

8bd4cea

Merge branch 'elasticity' into generic-task-api

5db1431

Merge branch 'elasticity' into generic-task-api

764bd49

Merge branch 'elasticity' into generic-task-api

de93216

Build working...

db97342

Fixup script and accumulo-cluster

dedf47c

dlmarion self-assigned this Oct 2, 2023

dlmarion linked an issue Oct 2, 2023 that may be closed by this pull request

Convert Compactor/Coordinator into generic distributed job execution service #3658

Open

dlmarion added 5 commits October 3, 2023 12:28

Changes to ExternalDoNothingCompactor to get ITs working

70887ca

Renamed CompactionCoordinator to TaskManager

121707f

Add override annotations

a89bc4d

Add override annotation

8cb4441

Resolve spotbugs issue

04e9294

Rename variables

38774d7

keith-turner reviewed Oct 3, 2023

View reviewed changes

More renames, handle message types better

da85761

dlmarion marked this pull request as ready for review October 3, 2023 17:37

dlmarion mentioned this pull request Oct 3, 2023

Implements missing compacton selection functionality #3796

Merged

dlmarion mentioned this pull request Oct 23, 2023

Can not split tablets with write ahead logs #3844

Closed

ctubbsii added this to the 4.0.0 milestone Jul 12, 2024

dlmarion changed the base branch from elasticity to main August 26, 2024 12:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modifies the Compactor to be a more generic job execution process #3801

Modifies the Compactor to be a more generic job execution process #3801

Uh oh!

dlmarion commented Oct 2, 2023

Uh oh!

dlmarion commented Oct 2, 2023

Uh oh!

keith-turner Oct 3, 2023

Uh oh!

keith-turner Oct 3, 2023

Uh oh!

dlmarion Oct 3, 2023

Uh oh!

keith-turner commented Oct 6, 2023

Uh oh!

dlmarion commented Oct 6, 2023

Uh oh!

keith-turner commented Oct 6, 2023 •

edited

Loading

Uh oh!

dlmarion commented Oct 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Modifies the Compactor to be a more generic job execution process #3801

Are you sure you want to change the base?

Modifies the Compactor to be a more generic job execution process #3801

Uh oh!

Conversation

dlmarion commented Oct 2, 2023

Uh oh!

dlmarion commented Oct 2, 2023

Uh oh!

keith-turner Oct 3, 2023

Choose a reason for hiding this comment

Uh oh!

keith-turner Oct 3, 2023

Choose a reason for hiding this comment

Uh oh!

dlmarion Oct 3, 2023

Choose a reason for hiding this comment

Uh oh!

keith-turner commented Oct 6, 2023

Uh oh!

dlmarion commented Oct 6, 2023

Uh oh!

keith-turner commented Oct 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dlmarion commented Oct 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

keith-turner commented Oct 6, 2023 •

edited

Loading