Changing from one central projection to another central projection is more or less fail-safe as long as you keep the center the same. Once you move PoV you need distances to pixels and also reconstruct revealing areas. This is much more challenging. AFAIK this part of research is called "structure from motion".
Have you tried to find any software which already does that? Like Microsoft Hyperlapse for example.
|