Make better use of available cpu on larger VMs #57

istreeter · 2024-11-17T21:15:35Z

These changes allow the loader to better utilize all cpu available on a larger instance.

1. CPU-intensive parsing/transforming is now parallelized. Parallelism is configured by a new config parameter cpuParallelismFraction. The actual parallelism is chosen dynamically based on the number of available CPU, so the default value should be appropriate for all sized VMs.

2. We now open a new Snowflake ingest client per channel. Note the Snowflake SDK recommends to re-use a single Client per VM and open multiple Channels on the same Client. So here we are going against the recommendations. But, we justify it because it gives the loader better visiblity of when the client's Future completes, signifying a complete write to Snowflake.

3. Upload parallelism chosen dynamically. Larger VMs benefit from higher upload parallelism, in order to keep up with the faster rate of batches produced by the cpu-intensive tasks. Parallelsim is configured by a new parameter uploadParallelismFactor, which gets multiplied by the number of available CPU. The default value should be appropriate for all sized VMs.

These new settings have been tested on pods ranging from 0.6 to 8 available CPU.

These changes allow the loader to better utilize all cpu available on a larger instance. **1. CPU-intensive parsing/transforming is now parallelized**. Parallelism is configured by a new config parameter `cpuParallelismFraction`. The actual parallelism is chosen dynamically based on the number of available CPU, so the default value should be appropriate for all sized VMs. **2. We now open a new Snowflake ingest client per channel**. Note the Snowflake SDK recommends to re-use a single Client per VM and open multiple Channels on the same Client. So here we are going against the recommendations. But, we justify it because it gives the loader better visiblity of when the client's Future completes, signifying a complete write to Snowflake. **3. Upload parallelism chosen dynamically**. Larger VMs benefit from higher upload parallelism, in order to keep up with the faster rate of batches produced by the cpu-intensive tasks. Parallelsim is configured by a new parameter `uploadParallelismFactor`, which gets multiplied by the number of available CPU. The default value should be appropriate for all sized VMs. These new settings have been tested on pods ranging from 0.6 to 8 available CPU.

benjben

Looks great 👍

benjben · 2024-11-18T09:30:35Z

config/config.azure.reference.hocon

-      # -- name to use for the snowflake channel.
+      # -- Prefix to use for the snowflake channels.
+      # -- The full name will be suffixed with a number, e.g. `snowplow-1`
+      # -- The prefix be unique per loader VM


Suggested change

# -- The prefix be unique per loader VM

# -- The prefix must be unique per loader VM

maybe ?

benjben · 2024-11-18T09:34:26Z

config/config.azure.reference.hocon

@@ -75,10 +77,17 @@
    # - Events are emitted to Snowflake for a maximum of this duration, even if the `maxBytes` size has not been reached
    "maxDelay": "1 second"

-    # - How many batches can we send simultaneously over the network to Snowflake.
-    "uploadConcurrency":  1
+    # - Controls ow many batches can we send simultaneously over the network to Snowflake.


Suggested change

# - Controls ow many batches can we send simultaneously over the network to Snowflake.

# - Controls how many batches can we send simultaneously over the network to Snowflake.

modules/core/src/main/resources/reference.conf

modules/core/src/main/scala/com.snowplowanalytics.snowplow.snowflake/Environment.scala

benjben · 2024-11-18T12:55:55Z

...les/core/src/main/scala/com.snowplowanalytics.snowplow.snowflake/processing/Processing.scala

@@ -152,8 +144,8 @@ object Processing {
    }

  /** Parse raw bytes into Event using analytics sdk */


Suggested change

/** Parse raw bytes into Event using analytics sdk */

These changes allow the loader to better utilize all cpu available on a larger instance. **1. CPU-intensive parsing/transforming is now parallelized**. Parallelism is configured by a new config parameter `cpuParallelismFraction`. The actual parallelism is chosen dynamically based on the number of available CPU, so the default value should be appropriate for all sized VMs. **2. We now open a new Snowflake ingest client per channel**. Note the Snowflake SDK recommends to re-use a single Client per VM and open multiple Channels on the same Client. So here we are going against the recommendations. But, we justify it because it gives the loader better visiblity of when the client's Future completes, signifying a complete write to Snowflake. **3. Upload parallelism chosen dynamically**. Larger VMs benefit from higher upload parallelism, in order to keep up with the faster rate of batches produced by the cpu-intensive tasks. Parallelsim is configured by a new parameter `uploadParallelismFactor`, which gets multiplied by the number of available CPU. The default value should be appropriate for all sized VMs. These new settings have been tested on pods ranging from 0.6 to 8 available CPU.

istreeter force-pushed the cpu-parallelism branch from b4453db to ed44537 Compare November 18, 2024 09:04

benjben approved these changes Nov 18, 2024

View reviewed changes

typos

836e43d

istreeter merged commit c462da4 into develop Nov 18, 2024
2 checks passed

istreeter deleted the cpu-parallelism branch November 18, 2024 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make better use of available cpu on larger VMs #57

Make better use of available cpu on larger VMs #57

istreeter commented Nov 17, 2024

benjben left a comment

benjben Nov 18, 2024

benjben Nov 18, 2024

benjben Nov 18, 2024

	# -- The prefix be unique per loader VM
	# -- The prefix must be unique per loader VM

	# - Controls ow many batches can we send simultaneously over the network to Snowflake.
	# - Controls how many batches can we send simultaneously over the network to Snowflake.

		@@ -152,8 +144,8 @@ object Processing {
		}

		/** Parse raw bytes into Event using analytics sdk */

Make better use of available cpu on larger VMs #57

Make better use of available cpu on larger VMs #57

Conversation

istreeter commented Nov 17, 2024

benjben left a comment

Choose a reason for hiding this comment

benjben Nov 18, 2024

Choose a reason for hiding this comment

benjben Nov 18, 2024

Choose a reason for hiding this comment

benjben Nov 18, 2024

Choose a reason for hiding this comment