>looked closely and I think what you did is perfect for GPU
Really? The random series generation is pretty straight forward. But the permutation stage is all out-of-order memory access. And the decrypt stage is very heavy in conditional branching.
>still need more target images
https://pastebin.com/Mj4d1jXM