大规模多模态数据集训练 #9268
Unanswered
Lin-Tianwei
asked this question in
Q&A
大规模多模态数据集训练
#9268
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
请教大家,我发现在多模态数据预处理阶段使用
.map()进行处理,其中使用processor对图像进行了resize和normliza等操作。代码如下:因此有两个问题:
map()有cache机制,是否会导致所有处理过的多模态数据进入缓存,占用巨大容量(我的数据集有30T)。Beta Was this translation helpful? Give feedback.
All reactions