The idea is to add 2 traces: one without PREF instruction and one with, and see performance difference (if any is expected on 5-staged MIPS)