Nice paper! Algorithm 4, line 14: remove $new$ from $x^{new}_v=x_v+r_v$. I don't understand the parallelization/scheduling part. May come back to that latter. I have an implementation of the push method here: https://github.com/maxdan94/push I will compare the speed.