How use first five layer and apply RoI on it and add a new branch for classification? I want to use 5 first layer [caffeNet], then add RoI, like what this paper has done: Instance-aware representation learning and association for online multi-person tracking. But I don't know how.