Open
Description
I notice that the class Hatememes set the text_column_name to 'plots' as follows
`class HateMemesDataset(BaseDataset):
def init(self, *args, split="", missing_info={}, **kwargs):
assert split in ["train", "val", "test"]
self.split = split
if split == "train":
names = ["hatememes_train"]
elif split == "val":
names = ["hatememes_dev"]
elif split == "test":
names = ["hatememes_test"]
super().__init__(
*args,
**kwargs,
names=names,
text_column_name="plots",
remove_duplicate=False,
)`
However, the make_arrow in write_hatememes.py, no column named 'plots' is defined,
dataframe = pd.DataFrame( data_list, columns=[ "image", "text", "label", "split", ], )
This may cause error "KeyError: 'Field "plots" does not exist in schema'" when training. I wonder if its a mistake or my misunderstood?
Metadata
Metadata
Assignees
Labels
No labels