Systems, devices and methods are described including using object motion appearing in pre-capture images to perform 3D reconstruction of a scene. Objects may be segmented and tracked within the pre-capture images using image processing techniques such as image segmentation and/or object recognition. The image processing results may then be used to automatically tag subsequently captured images. Further, the image processing results may also be used to interactively control an imaging device's focusing mechanism prior to image capture.