Thanks for your awesome work! @donglee-afar
I have two questions about hdfs_train, hdfs_test_normal, and hdfs_test_abnormal:
-
How to get them from the whole dataset? I mean, how to divide the whole dataset into train and test after we already have an event id sequence for each BlockId in the HDFS log?
-
I learn from data/hdfs/gen_train_data.py that, hdfs_train contains only normal data. I wonder if I'm right.
Looking forward to your reply!
Thank you!