I've finally hit the point where getting a program running is not enough. I have a functional Gaussian process and need to speed it up. Most of the time is used up by a conjugate gradient descent with ...