Hello, thanks for providing this awesome repository introducing different instruction datasets!
Could you consider adding our CoT Collection dataset? It's a massive instruction dataset consisted of 1.84 million rationales across 1060 NLP tasks!
https://arxiv.org/abs/2305.14045
Thank you in advance!