I think that an optimizing compiler will realise that width*y is constant in the loop and optimize out the multiplication. Try it, generate the assembly and see.
I wouldn't worry about it though, if that's the slowest point of your program you're doing very well.