Flytekit: Rename map_task to map, replace min_successes and min_success_ratio with tolerance, rename max_parallelism to concurrency #3107

ChihTsungLu · 2025-02-04T12:55:04Z

Tracking issue

Related to flyteorg/flyte#6139

Why are the changes needed?

The current Flytekit has several areas that could be improved for a better developer experience:

The map_task name is unnecessarily verbose when imported via the recommended import flytekit as fl
The failure tolerance parameters (min_successes and min_success_ratio) are powerful but overly verbose
The max_parallelism parameter naming in workflow and LaunchPlan needs to be aligned with map_task's concurrency parameter

What changes were proposed in this pull request?

Rename map_task to map
- While this conflicts with Python's built-in map, it's acceptable since we recommend using import flytekit as fl
- All changes will maintain backwards compatibility
Simplify failure tolerance parameters
- Deprecate min_successes and min_success_ratio
- Introduce new tolerance parameter that accepts both float and int types
- Maintain backwards compatibility with existing parameters
Standardize parallelism parameter
- Deprecate max_parallelism argument in workflow and LaunchPlan
- Introduce new concurrency parameter to match map_task's parameter
- Maintain backwards compatibility with existing parameter

Known issue

The changes introduce the concurrency field in Flytekit, which is not currently defined in flyteidl's LaunchPlanSpec

<img width="1561" alt="valueError" src="https://github.com/user-attachments/assets/e794e7d0-6393-4009-a320-988fdd1769cb" />

Code to Address the Issue:
The following code handles the transition between the concurrency and max_parallelism fields:

    @classmethod
    def from_flyte_idl(cls, pb2):
        """
        :param flyteidl.admin.launch_plan_pb2.LaunchPlanSpec pb2:
        :rtype: LaunchPlanSpec
        """

        auth_role = None
        # First check the newer field, auth_role.
        if pb2.auth_role is not None and (pb2.auth_role.assumable_iam_role or pb2.auth_role.kubernetes_service_account):
            auth_role = _common.AuthRole.from_flyte_idl(pb2.auth_role)
        # Fallback to the deprecated field.
        elif pb2.auth is not None:
            if pb2.auth.assumable_iam_role:
                auth_role = _common.AuthRole(assumable_iam_role=pb2.auth.assumable_iam_role)
            else:
                auth_role = _common.AuthRole(assumable_iam_role=pb2.auth.kubernetes_service_account)

        # Handle concurrency/max_parallelism transition
        concurrency = None
        max_parallelism = None

        if hasattr(pb2, "concurrency"):
            try:
                if pb2.HasField("concurrency"):
                    concurrency = pb2.concurrency
            except ValueError:
                pass  # Field doesn't exist in protobuf yet

        # Fallback to max_parallelism (deprecated field)
        if hasattr(pb2, "max_parallelism"):
            max_parallelism = pb2.max_parallelism

        # Use concurrency if available, otherwise use max_parallelism
        final_concurrency = concurrency if concurrency is not None else max_parallelism

        return cls(
            workflow_id=_identifier.Identifier.from_flyte_idl(pb2.workflow_id),
            entity_metadata=LaunchPlanMetadata.from_flyte_idl(pb2.entity_metadata),
            default_inputs=_interface.ParameterMap.from_flyte_idl(pb2.default_inputs),
            fixed_inputs=_literals.LiteralMap.from_flyte_idl(pb2.fixed_inputs),
            labels=_common.Labels.from_flyte_idl(pb2.labels),
            annotations=_common.Annotations.from_flyte_idl(pb2.annotations),
            auth_role=auth_role,
            raw_output_data_config=_common.RawOutputDataConfig.from_flyte_idl(pb2.raw_output_data_config),
            concurrency=final_concurrency,
            max_parallelism=pb2.max_parallelism,
            security_context=security.SecurityContext.from_flyte_idl(pb2.security_context)
            if pb2.security_context
            else None,
            overwrite_cache=pb2.overwrite_cache if pb2.overwrite_cache else None,
        )

How was this patch tested?

Ran tests with the command: make test

Setup process

Screenshots

Check all the applicable boxes

I updated the documentation accordingly.
All new and existing tests passed.
All commits are signed-off.

Related PRs

Docs link

Summary by Bito

This PR implements API improvements in Flytekit by renaming map_task to map, introducing a tolerance parameter to replace min_successes/min_success_ratio, and standardizing parallelism control by replacing max_parallelism with concurrency. The changes include comprehensive deprecation warnings and backward compatibility handling.

Unit tests added: True

Estimated effort to review (1-5, lower is better): 5

- Rename map_task to map for simpler API - Replace min_successes/min_success_ratio with tolerance parameter - Rename max_parallelism to concurrency for consistency

flyte-bot · 2025-02-04T12:55:20Z

Code Review Agent Run #d47fe6

Actionable Suggestions - 13

tests/flytekit/unit/types/directory/test_listdir.py - 2
- Consider implications of map vs map_task · Line 4-4
- Consider using map_task for workflow operations · Line 29-29
plugins/flytekit-papermill/tests/test_task.py - 1
- Consider using map_task for notebook tasks · Line 417-417
flytekit/__init__.py - 1
- Consider maintaining backward compatibility for imports · Line 222-222
flytekit/core/array_node_map_task.py - 1
- Consider keeping descriptive function name · Line 373-373
tests/flytekit/unit/core/test_array_node_map_task.py - 8
- Map task function call change · Line 66-66
- Potential task mapping behavior change · Line 578-578
- Function rename may affect compatibility · Line 413-413
- Verify intended function call change · Line 229-230
- Consider using map_task instead of map · Line 319-319
- Consider using map_task instead of map · Line 495-495
- Verify map function usage intention · Line 536-536
- Consider using map_task instead of map · Line 578-578

Additional Suggestions - 10

flytekit/core/options.py - 3
- Consider adding concurrency parameter validation · Line 26-27
- Consider adding validation for concurrency parameter · Line 38-38
- Consider using @deprecated decorator instead · Line 43-66
tests/flytekit/unit/core/test_array_node_map_task.py - 2
- Consider using map_task for clarity · Line 83-84
- Consider using more specific function name · Line 504-518
tests/flytekit/integration/remote/workflows/basic/array_map.py - 1
- Consider potential naming confusion with map · Line 4-4
tests/flytekit/unit/core/test_array_node.py - 1
- Consider more explicit map function name · Line 186-189
flytekit/models/launch_plan.py - 2
- Consider validating concurrency value before use · Line 277-277
- Consider simplifying concurrency handling logic · Line 301-318
flytekit/tools/translator.py - 1
- Consider consolidating duplicate warning logic · Line 355-382

Review Details

Files reviewed - 24 · Commit Range: 87dfe2f..d8e5d4b
- flytekit/__init__.py
- flytekit/clis/sdk_in_container/run.py
- flytekit/core/array_node_map_task.py
- flytekit/core/launch_plan.py
- flytekit/core/options.py
- flytekit/models/execution.py
- flytekit/models/launch_plan.py
- flytekit/remote/entities.py
- flytekit/remote/remote.py
- flytekit/tools/translator.py
- plugins/flytekit-k8s-pod/tests/test_pod.py
- plugins/flytekit-papermill/tests/test_task.py
- tests/flytekit/integration/remote/workflows/basic/array_map.py
- tests/flytekit/integration/remote/workflows/basic/pydantic_wf.py
- tests/flytekit/unit/core/test_array_node.py
- tests/flytekit/unit/core/test_array_node_map_task.py
- tests/flytekit/unit/core/test_artifacts.py
- tests/flytekit/unit/core/test_interface.py
- tests/flytekit/unit/core/test_launch_plan.py
- tests/flytekit/unit/core/test_node_creation.py
- tests/flytekit/unit/core/test_partials.py
- tests/flytekit/unit/core/test_type_hints.py
- tests/flytekit/unit/remote/test_remote.py
- tests/flytekit/unit/types/directory/test_listdir.py
Files skipped - 0
Tools
- Whispers (Secret Scanner) - ✔︎ Successful
- Detect-secrets (Secret Scanner) - ✔︎ Successful
- MyPy (Static Code Analysis) - ✔︎ Successful
- Astral Ruff (Static Code Analysis) - ✔︎ Successful

AI Code Review powered by

flyte-bot · 2025-02-04T13:26:02Z

Changelist by Bito

This pull request implements the following key changes.

Key Change	Files Impacted
Feature Improvement - API Standardization and Simplification	- `__init__.py` - Renamed map_task to map and added deprecation warning - `array_node_map_task.py` - Replaced min_successes/min_success_ratio with new tolerance parameter - `launch_plan.py` - Standardized parallelism control by replacing max_parallelism with concurrency - `options.py` - Updated options to use concurrency instead of max_parallelism - `execution.py` - Added concurrency field and deprecation warnings for max_parallelism - `launch_plan.py` - Added concurrency support and backwards compatibility handling - `entities.py` - Updated remote entities to use concurrency parameter - `translator.py` - Added concurrency parameter handling in translator
Testing - Test Updates for API Changes	- `test_array_node_map_task.py` - Updated tests to use map instead of map_task - `test_array_node.py` - Updated array node tests for new map function - `test_artifacts.py` - Updated artifact tests for map function - `test_pod.py` - Updated pod tests to use map - `test_task.py` - Updated papermill tests to use map - `array_map.py` - Updated workflow tests for map function - `pydantic_wf.py` - Updated pydantic workflow tests for map
Feature Improvement - API Standardization and Simplification	- `__init__.py` - Renamed map_task to map and added deprecation warning - `array_node_map_task.py` - Replaced min_successes/min_success_ratio with new tolerance parameter - `launch_plan.py` - Standardized parallelism control by replacing max_parallelism with concurrency - `options.py` - Updated options to use concurrency instead of max_parallelism - `execution.py` - Added concurrency field and deprecation warnings for max_parallelism - `launch_plan.py` - Added concurrency support and backwards compatibility handling - `entities.py` - Updated remote entities to use concurrency parameter - `translator.py` - Added concurrency parameter handling in translator
Testing - Test Updates for API Changes	- `test_interface.py` - Updated import and usage of map instead of map_task - `test_launch_plan.py` - Updated tests to use concurrency instead of max_parallelism - `test_node_creation.py` - Updated map_task imports and usage to map - `test_partials.py` - Updated array node map task imports - `test_type_hints.py` - Updated imports and usage to use map - `test_remote.py` - Updated imports and map_task usage - `test_listdir.py` - Updated imports and map_task usage - `test_array_node_map_task.py` - Updated tests to use map instead of map_task - `test_array_node.py` - Updated array node tests for new map function - `test_artifacts.py` - Updated artifact tests for map function

flyte-bot · 2025-02-04T13:26:04Z

tests/flytekit/unit/types/directory/test_listdir.py

@@ -1,7 +1,7 @@
 import tempfile
 from pathlib import Path

-from flytekit import FlyteDirectory, FlyteFile, map_task, task, workflow
+from flytekit import FlyteDirectory, FlyteFile, map, task, workflow


Consider implications of map vs map_task

Consider if replacing map_task with map is intentional as they might have different functionality in the Flyte framework. map_task is typically used for task parallelization while map might have different semantics.

Code suggestion

Check the AI-generated fix before applying

Suggested change

from flytekit import FlyteDirectory, FlyteFile, map, task, workflow

from flytekit import FlyteDirectory, FlyteFile, map_task, task, workflow

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:05Z

tests/flytekit/unit/types/directory/test_listdir.py

@@ -26,6 +26,6 @@ def list_dir(dir: FlyteDirectory) -> list[FlyteFile]:
    def wf() -> list[str]:
        tmpdir = setup()
        files = list_dir(dir=tmpdir)
-        return map_task(read_file)(file=files)
+        return map(read_file)(file=files)


Consider using map_task for workflow operations

Consider using map_task instead of map for task mapping operations in Flytekit workflows. The map function may not provide the same task-level parallelization and execution guarantees as map_task.

Code suggestion

Check the AI-generated fix before applying

Suggested change

return map(read_file)(file=files)

return map_task(read_file)(file=files)

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:06Z

plugins/flytekit-papermill/tests/test_task.py

@@ -414,7 +414,7 @@ def create_sd() -> StructuredDataset:
 def test_map_over_notebook_task():
    @workflow
    def wf(a: float) -> typing.List[float]:
-        return map_task(nb_sub_task)(a=[a, a])
+        return map(nb_sub_task)(a=[a, a])


Consider using map_task for notebook tasks

Consider using map_task instead of map for mapping over notebook tasks. The map function may not handle notebook task specific requirements correctly.

Code suggestion

Check the AI-generated fix before applying

Suggested change

return map(nb_sub_task)(a=[a, a])

return map_task(nb_sub_task)(a=[a, a])

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:07Z

flytekit/__init__.py

 from flytekit._version import __version__
 from flytekit.configuration import Config
-from flytekit.core.array_node_map_task import map_task
+from flytekit.core.array_node_map_task import map


Consider maintaining backward compatibility for imports

Consider keeping both map_task and map imports to maintain backward compatibility. The alias is defined later but importing directly as map may break existing code that uses map_task.

Code suggestion

Check the AI-generated fix before applying

Suggested change

from flytekit.core.array_node_map_task import map

from flytekit.core.array_node_map_task import map_task

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:09Z

flytekit/core/array_node_map_task.py

@@ -369,11 +370,12 @@ def _raw_execute(self, **kwargs) -> Any:
        return outputs


-def map_task(
+def map(


Consider keeping descriptive function name

Consider keeping the original function name map_task instead of renaming to map as it could conflict with Python's built-in map function and cause confusion. The original name was more descriptive of the function's purpose.

Code suggestion

Check the AI-generated fix before applying

Suggested change

def map(

def map_task(

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:10Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -63,7 +63,7 @@ def say_hello(name: str) -> str:

    @workflow
    def wf() -> List[str]:
-        return map_task(say_hello)(name=["abc", "def"])
+        return map(say_hello)(name=["abc", "def"])


Map task function call change

Consider if using map() instead of map_task() is intentional as it changes the behavior from using Flyte's map task functionality to Python's built-in map().

Code suggestion

Check the AI-generated fix before applying

Suggested change

return map(say_hello)(name=["abc", "def"])

return map_task(say_hello)(name=["abc", "def"])

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:11Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -575,7 +575,7 @@ def say_hello(name: str) -> str:
        for index, map_input_str in enumerate(list_strs):
            monkeypatch.setenv("BATCH_JOB_ARRAY_INDEX_VAR_NAME", "name")
            monkeypatch.setenv("name", str(index))
-            t = map_task(say_hello)
+            t = map(say_hello)


Potential task mapping behavior change

Consider if using map() instead of map_task() is intentional as this could change the behavior of task mapping functionality.

Code suggestion

Check the AI-generated fix before applying

Suggested change

t = map(say_hello)

t = map_task(say_hello)

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:12Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -410,7 +410,7 @@ def test_serialization_metadata(serialization_settings):
    def t1(a: int) -> int:
        return a + 1

-    arraynode_maptask = map_task(t1, metadata=TaskMetadata(retries=2))
+    arraynode_maptask = map(t1, metadata=TaskMetadata(retries=2))


Function rename may affect compatibility

Consider if changing from map_task to map could impact backward compatibility. The function name change from map_task to map may affect existing code that imports and uses the original function name.

Code suggestion

Check the AI-generated fix before applying

Suggested change

arraynode_maptask = map(t1, metadata=TaskMetadata(retries=2))

# Maintain both for backward compatibility

arraynode_maptask = map_task(t1, metadata=TaskMetadata(retries=2))

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:13Z

tests/flytekit/unit/core/test_array_node_map_task.py

+    t1 = map(say_hello, **kwargs1)
+    t2 = map(say_hello, **kwargs2)


Verify intended function call change

Consider if replacing map_task with map is intentional as this changes the function being called which could affect functionality. The map_task decorator appears to be imported but not used after this change.

Code suggestion

Check the AI-generated fix before applying

Suggested change

t1 = map(say_hello, **kwargs1)

t2 = map(say_hello, **kwargs2)

t1 = map_task(say_hello, **kwargs1)

t2 = map_task(say_hello, **kwargs2)

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:14Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -316,7 +316,7 @@ def test_bounded_inputs_vars_order(serialization_settings):
    def task1(a: int, b: float, c: str) -> str:
        return f"{a} - {b} - {c}"

-    mt = map_task(functools.partial(task1, c=1.0, b="hello", a=1))
+    mt = map(functools.partial(task1, c=1.0, b="hello", a=1))


Consider using map_task instead of map

Consider using map_task() instead of map() as it appears to be the intended function based on the test context and imports. Using map() could lead to unexpected behavior since it's a built-in Python function.

Code suggestion

Check the AI-generated fix before applying

Suggested change

mt = map(functools.partial(task1, c=1.0, b="hello", a=1))

mt = map_task(functools.partial(task1, c=1.0, b="hello", a=1))

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:15Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -492,7 +492,7 @@ def test_supported_node_type():
    def test_task():
        ...

-    map_task(test_task)
+    map(test_task)


Consider using map_task instead of map

The function call has been changed from map_task(test_task) to map(test_task). This could potentially cause confusion with Python's built-in map() function. Consider using the imported map_task decorator/function to maintain clarity and avoid potential naming conflicts.

Code suggestion

Check the AI-generated fix before applying

Suggested change

map(test_task)

map_task(test_task)

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:16Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -533,7 +533,7 @@ def consume_directories(dirs: List[FlyteDirectory]):
            for path_info, other_info in d.crawl():
                print(path_info)

-    mt = map_task(generate_directory, min_success_ratio=0.1)
+    mt = map(generate_directory, min_success_ratio=0.1)


Verify map function usage intention

Consider if using map() instead of map_task() is intentional as it may change the expected behavior. The map_task() function is typically used for array node map tasks in Flytekit.

Code suggestion

Check the AI-generated fix before applying

Suggested change

mt = map(generate_directory, min_success_ratio=0.1)

mt = map_task(generate_directory, min_success_ratio=0.1)

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-04T13:26:17Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -575,7 +575,7 @@ def say_hello(name: str) -> str:
        for index, map_input_str in enumerate(list_strs):
            monkeypatch.setenv("BATCH_JOB_ARRAY_INDEX_VAR_NAME", "name")
            monkeypatch.setenv("name", str(index))
-            t = map_task(say_hello)
+            t = map(say_hello)


Consider using map_task instead of map

Consider using map_task instead of map as it appears to be the intended decorator based on the imports and test context. The map function could be confused with Python's built-in map function.

Code suggestion

Check the AI-generated fix before applying

Suggested change

t = map(say_hello)

t = map_task(say_hello)

Code Review Run #d47fe6

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

Signed-off-by: Chih Tsung Lu <[email protected]>

Signed-off-by: lu00122 <[email protected]> Signed-off-by: Chih Tsung Lu <[email protected]>

flyte-bot · 2025-02-05T08:45:19Z

Code Review Agent Run #99b31d

Actionable Suggestions - 8

tests/flytekit/unit/core/test_array_node_map_task.py - 4
- Parameter type mismatch in task call · Line 319-319
- Parameter type mismatch in task call · Line 319-319
- Consider using map_task for array testing · Line 307-309
- Consider using more specific function name · Line 461-475
flytekit/remote/remote.py - 1
- Parameter rename may break compatibility · Line 1554-1554
tests/flytekit/unit/core/test_node_creation.py - 1
- Consider using map_task for workflow testing · Line 276-276
tests/flytekit/unit/remote/test_remote.py - 1
- Consider using map_task for consistency · Line 729-729
flytekit/core/launch_plan.py - 1
- Refactor init method signature · Line 339-339

Additional Suggestions - 10

flytekit/core/options.py - 4
- Consider adding concurrency parameter validation · Line 26-27
- Consider adding validation for concurrency parameter · Line 38-38
- Consider using standard deprecation decorator pattern · Line 43-66
- Consider consolidating duplicate warning message · Line 48-64
flytekit/models/execution.py - 1
- Consider adding property setter for deprecation · Line 290-302
flytekit/clis/sdk_in_container/run.py - 1
- Consider updating deprecated parameter name · Line 529-529
tests/flytekit/unit/core/test_node_creation.py - 1
- Consider using more specific map function · Line 276-276
tests/flytekit/unit/core/test_array_node_map_task.py - 3
- Consider using map_task for clarity · Line 390-402
- Consider using explicit map_task name · Line 83-84
- Consider using map_task for clarity · Line 100-100

Review Details

Files reviewed - 24 · Commit Range: 87dfe2f..09755a2
- flytekit/__init__.py
- flytekit/clis/sdk_in_container/run.py
- flytekit/core/array_node_map_task.py
- flytekit/core/launch_plan.py
- flytekit/core/options.py
- flytekit/models/execution.py
- flytekit/models/launch_plan.py
- flytekit/remote/entities.py
- flytekit/remote/remote.py
- flytekit/tools/translator.py
- plugins/flytekit-k8s-pod/tests/test_pod.py
- plugins/flytekit-papermill/tests/test_task.py
- tests/flytekit/integration/remote/workflows/basic/array_map.py
- tests/flytekit/integration/remote/workflows/basic/pydantic_wf.py
- tests/flytekit/unit/core/test_array_node.py
- tests/flytekit/unit/core/test_array_node_map_task.py
- tests/flytekit/unit/core/test_artifacts.py
- tests/flytekit/unit/core/test_interface.py
- tests/flytekit/unit/core/test_launch_plan.py
- tests/flytekit/unit/core/test_node_creation.py
- tests/flytekit/unit/core/test_partials.py
- tests/flytekit/unit/core/test_type_hints.py
- tests/flytekit/unit/remote/test_remote.py
- tests/flytekit/unit/types/directory/test_listdir.py
Files skipped - 0
Tools
- Whispers (Secret Scanner) - ✔︎ Successful
- Detect-secrets (Secret Scanner) - ✔︎ Successful
- MyPy (Static Code Analysis) - ✔︎ Successful
- Astral Ruff (Static Code Analysis) - ✔︎ Successful

AI Code Review powered by

flyte-bot · 2025-02-05T09:14:06Z

tests/flytekit/unit/core/test_array_node_map_task.py

@@ -315,7 +316,7 @@ def test_bounded_inputs_vars_order(serialization_settings):
    def task1(a: int, b: float, c: str) -> str:
        return f"{a} - {b} - {c}"

-    mt = map_task(functools.partial(task1, c=1.0, b="hello", a=1))
+    mt = map(functools.partial(task1, c=1.0, b="hello", a=1))


Parameter type mismatch in task call

The function call parameters c=1.0, b="hello", a=1 appear to have mismatched types with the task definition. The task expects a: int, b: float, c: str but receives c as float, b as string, and a as int. Consider adjusting the parameter types to match the task signature.

Code suggestion

Check the AI-generated fix before applying

Suggested change

mt = map(functools.partial(task1, c=1.0, b="hello", a=1))

mt = map(functools.partial(task1, c="1.0", b=1.0, a=1))

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-05T09:14:07Z

flytekit/remote/remote.py

@@ -1551,7 +1551,7 @@ def _execute(
                    annotations=options.annotations,
                    raw_output_data_config=options.raw_output_data_config,
                    auth_role=None,
-                    max_parallelism=options.max_parallelism,
+                    concurrency=options.concurrency,


Parameter rename may break compatibility

Consider verifying if renaming max_parallelism to concurrency maintains backward compatibility. This change could potentially break existing code that relies on the max_parallelism parameter.

Code suggestion

Check the AI-generated fix before applying

Suggested change

concurrency=options.concurrency,

concurrency=options.max_parallelism if hasattr(options, 'max_parallelism')

else options.concurrency,

# TODO: Remove max_parallelism support in next major version

# Deprecated in favor of concurrency parameter

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-05T09:14:08Z

tests/flytekit/unit/core/test_node_creation.py

@@ -273,7 +273,7 @@ def t1(a: str) -> str:

    @workflow
    def my_wf(a: typing.List[str]) -> typing.List[str]:
-        mappy = map_task(t1)
+        mappy = map(t1)


Consider using map_task for workflow testing

Consider using map_task instead of map as it appears to be the intended function based on the test context. The map function may not provide the same task mapping functionality needed for workflow testing.

Code suggestion

Check the AI-generated fix before applying

Suggested change

mappy = map(t1)

mappy = map_task(t1)

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-05T09:14:10Z

tests/flytekit/unit/remote/test_remote.py

@@ -726,7 +726,7 @@ def t1(x: int, y: int) -> int:

    @workflow
    def w() -> int:
-        return map_task(partial(t1, y=2))(x=[1, 2, 3])
+        return map(partial(t1, y=2))(x=[1, 2, 3])


Consider using map_task for consistency

Consider using map_task instead of map as it appears to be testing map task functionality based on the test name and context.

Code suggestion

Check the AI-generated fix before applying

Suggested change

return map(partial(t1, y=2))(x=[1, 2, 3])

return map_task(partial(t1, y=2))(x=[1, 2, 3])

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-05T09:14:11Z

tests/flytekit/unit/core/test_array_node_map_task.py

+    m1 = map(functools.partial(task1, c=param_c))(a=param_a, b=param_b)
+    m2 = map(functools.partial(task2, c=param_c))(a=param_a, b=param_b)
+    m3 = map(functools.partial(task3, c=param_c))(a=param_a, b=param_b)


Consider using map_task for array testing

Consider using map_task instead of map for consistency with the test name and module being tested (test_array_node_map_task.py). The test appears to be validating array node map task functionality.

Code suggestion

Check the AI-generated fix before applying

Suggested change

m1 = map(functools.partial(task1, c=param_c))(a=param_a, b=param_b)

m2 = map(functools.partial(task2, c=param_c))(a=param_a, b=param_b)

m3 = map(functools.partial(task3, c=param_c))(a=param_a, b=param_b)

m1 = map_task(functools.partial(task1, c=param_c))(a=param_a, b=param_b)

m2 = map_task(functools.partial(task2, c=param_c))(a=param_a, b=param_b)

m3 = map_task(functools.partial(task3, c=param_c))(a=param_a, b=param_b)

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-05T09:14:11Z

tests/flytekit/unit/core/test_array_node_map_task.py

+        map(test_dynamic)

    @eager
    def test_eager():
        ...

    with pytest.raises(ValueError):
-        map_task(test_eager)
+        map(test_eager)

    @workflow
    def test_wf():
        ...

    with pytest.raises(ValueError):
-        map_task(test_wf)
+        map(test_wf)


Consider using more specific function name

Consider using map_task() instead of map() to avoid confusion with Python's built-in map() function. The change from map_task() to map() could lead to confusion.

Code suggestion

Check the AI-generated fix before applying

- map(test_dynamic) + map_task(test_dynamic) @@ -468,1 +468,1 @@ - map(test_eager) + map_task(test_eager) @@ -475,1 +475,1 @@ - map(test_wf) + map_task(test_wf)

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

flyte-bot · 2025-02-05T09:14:12Z

flytekit/core/launch_plan.py

@@ -326,7 +336,8 @@ def get_or_create(
                labels,
                annotations,
                raw_output_data_config,
-                max_parallelism,
+                concurrency=concurrency,


Refactor init method signature

The 'init' method has too many parameters (14 > 5) and is missing docstring and return type annotation.

Code suggestion

Check the AI-generated fix before applying

- def __init__( - self, - name: str, - workflow: _annotated_workflow.WorkflowBase, - parameters: _interface_models.ParameterMap, - fixed_inputs: _literal_models.LiteralMap, - schedule: Optional[_schedule_model.Schedule] = None, - notifications: Optional[List[_common_models.Notification]] = None, - labels: Optional[_common_models.Labels] = None, - annotations: Optional[_common_models.Annotations] = None, - raw_output_data_config: Optional[_common_models.RawOutputDataConfig] = None, - max_parallelism: Optional[int] = None, - security_context: Optional[security.SecurityContext] = None, - trigger: Optional[LaunchPlanTriggerBase] = None, - overwrite_cache: Optional[bool] = None, - auto_activate: bool = False, - ): + @dataclass + class Config: + """Configuration for LaunchPlan initialization.""" + name: str + workflow: _annotated_workflow.WorkflowBase + parameters: _interface_models.ParameterMap + fixed_inputs: _literal_models.LiteralMap + schedule: _schedule_model.Schedule | None = None + notifications: list[_common_models.Notification] | None = None + labels: _common_models.Labels | None = None + annotations: _common_models.Annotations | None = None + raw_output_data_config: _common_models.RawOutputDataConfig | None = None + max_parallelism: int | None = None + security_context: security.SecurityContext | None = None + trigger: LaunchPlanTriggerBase | None = None + overwrite_cache: bool | None = None + auto_activate: bool = False + + def __init__(self, config: Config) -> None:

Code Review Run #99b31d

Is this a valid issue, or was it incorrectly flagged by the Agent?

it was incorrectly flagged

feat: rename map_task and improve task parameters

87dfe2f

- Rename map_task to map for simpler API - Replace min_successes/min_success_ratio with tolerance parameter - Rename max_parallelism to concurrency for consistency

ChihTsungLu requested review from wild-endeavor, kumare3, eapolinario, pingsutw, cosmicBboy, samhita-alla, thomasjpfan and Future-Outlier as code owners February 4, 2025 12:55

flyte-bot reviewed Feb 4, 2025

View reviewed changes

ChihTsungLu added 2 commits February 5, 2025 15:52

Fixed typo and DockerFile

00653e7

Signed-off-by: Chih Tsung Lu <[email protected]>

fix: map implementation in test_array_node_map_task.py

38af43b

Signed-off-by: lu00122 <[email protected]> Signed-off-by: Chih Tsung Lu <[email protected]>

ChihTsungLu force-pushed the master branch from 10c299b to 38af43b Compare February 5, 2025 07:53

Merge branch 'master' into master

09755a2

flyte-bot reviewed Feb 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flytekit: Rename map_task to map, replace min_successes and min_success_ratio with tolerance, rename max_parallelism to concurrency #3107

Flytekit: Rename map_task to map, replace min_successes and min_success_ratio with tolerance, rename max_parallelism to concurrency #3107

ChihTsungLu commented Feb 4, 2025 •

edited by flyte-bot

Loading

flyte-bot commented Feb 4, 2025 •

edited

Loading

Code Review Agent Run #d47fe6

flyte-bot commented Feb 4, 2025 •

edited

Loading

Changelist by Bito

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot Feb 4, 2025

flyte-bot commented Feb 5, 2025 •

edited

Loading

Code Review Agent Run #99b31d

flyte-bot Feb 5, 2025

flyte-bot Feb 5, 2025

flyte-bot Feb 5, 2025

flyte-bot Feb 5, 2025

flyte-bot Feb 5, 2025

flyte-bot Feb 5, 2025

flyte-bot Feb 5, 2025

	from flytekit import FlyteDirectory, FlyteFile, map, task, workflow
	from flytekit import FlyteDirectory, FlyteFile, map_task, task, workflow

	return map(read_file)(file=files)
	return map_task(read_file)(file=files)

	return map(nb_sub_task)(a=[a, a])
	return map_task(nb_sub_task)(a=[a, a])

	from flytekit.core.array_node_map_task import map
	from flytekit.core.array_node_map_task import map_task

	return map(say_hello)(name=["abc", "def"])
	return map_task(say_hello)(name=["abc", "def"])

	arraynode_maptask = map(t1, metadata=TaskMetadata(retries=2))
	# Maintain both for backward compatibility
	arraynode_maptask = map_task(t1, metadata=TaskMetadata(retries=2))

		t1 = map(say_hello, **kwargs1)
		t2 = map(say_hello, **kwargs2)

	mt = map(functools.partial(task1, c=1.0, b="hello", a=1))
	mt = map_task(functools.partial(task1, c=1.0, b="hello", a=1))

	mt = map(generate_directory, min_success_ratio=0.1)
	mt = map_task(generate_directory, min_success_ratio=0.1)

	mt = map(functools.partial(task1, c=1.0, b="hello", a=1))
	mt = map(functools.partial(task1, c="1.0", b=1.0, a=1))

-                    concurrency=options.concurrency,
+                    concurrency=options.max_parallelism if hasattr(options, 'max_parallelism')
+                              else options.concurrency,
+                    # TODO: Remove max_parallelism support in next major version
+                    # Deprecated in favor of concurrency parameter

	return map(partial(t1, y=2))(x=[1, 2, 3])
	return map_task(partial(t1, y=2))(x=[1, 2, 3])

Flytekit: Rename map_task to map, replace min_successes and min_success_ratio with tolerance, rename max_parallelism to concurrency #3107

Are you sure you want to change the base?

Flytekit: Rename map_task to map, replace min_successes and min_success_ratio with tolerance, rename max_parallelism to concurrency #3107

Conversation

ChihTsungLu commented Feb 4, 2025 • edited by flyte-bot Loading

Tracking issue

Why are the changes needed?

What changes were proposed in this pull request?

Known issue

How was this patch tested?

Setup process

Screenshots

Check all the applicable boxes

Related PRs

Docs link

Summary by Bito

flyte-bot commented Feb 4, 2025 • edited Loading

Code Review Agent Run #d47fe6

flyte-bot commented Feb 4, 2025 • edited Loading

Changelist by Bito

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

flyte-bot commented Feb 5, 2025 • edited Loading

Code Review Agent Run #99b31d

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ChihTsungLu commented Feb 4, 2025 •

edited by flyte-bot

Loading

flyte-bot commented Feb 4, 2025 •

edited

Loading

flyte-bot commented Feb 4, 2025 •

edited

Loading

flyte-bot commented Feb 5, 2025 •

edited

Loading