Add loop for gpu multithread; add some timing.