-
Notifications
You must be signed in to change notification settings - Fork 10
Home
Rule-based models are often used for data analysis as they combine interpretability with predictive power. We present RuleKit 2, a versatile tool for rule learning. Based on a sequential covering induction algorithm, it is suitable for classification, regression, and survival problems. The presence of user-guided induction mode facilitates verifying hypotheses concerning data dependencies which are expected or of interest. The powerful and flexible experimental environment allows straightforward investigation of different induction schemes. Unlike the first revision, RuleKit 2 does not depend on RapidMiner. The analysis can be performed in batch mode and through Python package. A documented Java API is also provided for convenience. Running RuleKit as a RapidMiner plugin and R package is no longer supported in version 2.
RuleKit provides latest versions of our algorithms (some of them were initially published as independent packages and integrated later):
- LR-Rules (Wróbel et al, 2017) - survival rules induction,
- GuideR (Sikora et al, 2019) - user-guided induction.
- RuleKit-CS (Gudyś et al, 2024) - contrast set mining.
The software requires Java Development Kit in version 8 to work properly. In Windows one can download the installer from Oracle webpage. In Linux, a system package manager should be used instead. For instance, in Ubuntu 16.04 execute the following command:
sudo apt-get install default-jdk