Skip to content

Refactor log split to improve performance #166

@david-chapela

Description

@david-chapela

Currently, the log split is performed by iterating trace by trace, adding it to split_1, until the number of events in split_1 is enough given the specified partition ratio (e.g., 80%-20%).

This process could be done following an approach that selects the % of cases matching the partition ratio, checks whether this is a good event-wise partition, and if not, adjusts the selected partition by adding/subtracting half of the remaining cases, if not enough, again adjust by half. In this way, the operation should converge faster, especially for event logs with many cases (like CallCentre), which currently are taking too long for this.

Metadata

Metadata

Assignees

Labels

performanceTemporal/memory performance of the algorithm

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions