Hi I tried to run the crfasrnn_demo.py on aws g3.16xlarge gpu instance. but it took about 4 seconds. how to can make it more fast? cheers.