What changes are to be made to adapt the divided space time attention to a join space-time attention model?