[ci] init e2e test struct #1269

zhulinJulia24 · 2025-11-17T06:51:25Z

Initializing the testing framework allows for quick addition of sft、rl and pretrain related test cases through configuration.
On the quality side, focus on the code development of the validation and regression points themselves(train、infer、eval and so on).

HAOCHENYE · 2025-11-19T04:56:21Z

.github/workflows/e2e_test.yaml

+      repo_org:
+        required: false
+        description: 'Tested repository organization name. Default is InternLM'
+        type: string
+        default: 'InternLM/xtuner'


Why this arguments is required?

HAOCHENYE · 2025-11-19T05:19:20Z

autotest/cluster/clusterx.py

+
+    def execute_task(self, task_config: Dict[str, Any]) -> Dict[str, Any]:
+        resource = task_config.get("resource", {})
+        command = task_config.get("command", "")


Suggest directly returning False here.

ok, it make sense

HAOCHENYE · 2025-11-19T05:21:09Z

autotest/cluster/clusterx.py

+        self.cluster = cluster
+        self.params_cls = params_cls
+
+    def execute_task(self, task_config: Dict[str, Any]) -> Dict[str, Any]:


Suggest declaring a TypedDict or Pydantic BaseModel here for easier checking of config correctness.

autotest/cluster/clusterx.py

HAOCHENYE · 2025-11-19T05:42:20Z

autotest/cluster/clusterx.py

+        self.cluster = cluster
+        self.params_cls = params_cls
+
+    def execute_task(self, task_config: Dict[str, Any]) -> Dict[str, Any]:


Miss matched return statement.

HAOCHENYE · 2025-11-19T05:58:24Z

autotest/config.yaml

+        -
+            type: sft
+            parameters:
+                config: /mnt/shared-storage-user/llmrazor-share/qa-llm-cicd/xtuner-fork/autotest/config/qwen3.py


Suggest using relative path

autotest/config/qwen3.py

HAOCHENYE · 2025-11-19T06:05:29Z

autotest/utils/common_utils.py

+    config = get_config()
+    case_list = config["case"]
+
+    if type == "all":


type is a built-in keyword in python, suggest renaming it

HAOCHENYE · 2025-11-19T06:19:49Z

autotest/config.yaml

+                - HF_DATASETS_OFFLINE=1
+                - HF_HUB_OFFLINE=1
+
+case: 


The hierarchy here seems unusual. From the original semantics, the xtuner repository should have the following test levels:

Tasking: sft & pretrain & rl

Training config: config path

Resource: test resource

Xtuner's configuration files can determine the first two, while the last one is determined by clusterx's configuration. What do you think about adjusting the hierarchy like this:

case: - task: <rl | sft | pretrain> config: <config path> assert_info: <...> resources: [<resource_name, including env>]

Then resources should contain test resources like gpu, npu, etc., with different names. In each case item, you can select which platforms and resource specifications to test on.

HAOCHENYE · 2025-11-19T06:25:17Z

autotest/utils/common_utils.py

+    return env_config
+
+
+def get_case_list(type: str = "all"):


Maybe we need to consider how to platform specific test case

Co-authored-by: Mashiro <[email protected]>

zhulinJulia24 and others added 21 commits November 10, 2025 10:32

clusterx testinfra

34ae857

update train success job

cfb5320

first version

978fe99

update

cacd8ac

update

1981ebb

update

48961f6

update

506f43c

Merge branch 'InternLM:main' into init_test_struct

52acbf2

add workflow

4a9c756

Update e2e_test.yaml

ae6bb10

update

230fe28

update

4b329aa

update

c821e3b

fix ut status

0ec5341

update

74f47f9

fix slice error

e8c9257

update

10ee80a

update

7929a8a

update

d2aa095

remove unused config

9e54df5

update

a54251d

HAOCHENYE reviewed Nov 19, 2025

View reviewed changes

zhulinJulia24 and others added 3 commits November 19, 2025 14:39

Update autotest/config/qwen3.py

0b6c79e

Co-authored-by: Mashiro <[email protected]>

Update autotest/cluster/clusterx.py

c7de665

Co-authored-by: Mashiro <[email protected]>

add post and pre action

ac314c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ci] init e2e test struct #1269

[ci] init e2e test struct #1269

Uh oh!

zhulinJulia24 commented Nov 17, 2025

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

zhulinJulia24 Nov 19, 2025

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

HAOCHENYE Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ci] init e2e test struct #1269

Are you sure you want to change the base?

[ci] init e2e test struct #1269

Uh oh!

Conversation

zhulinJulia24 commented Nov 17, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants