Augmented Reality (AR) is a concept and a set of technologies for merging of real and virtual elements to produce new visualizations – typically a video – where physical and digital objects co-exist and interact in real time. Most AR applications support real-time interaction with content (AR scene with virtual objects) which has been produced in advance or offline. In many cases, for example in ad hoc remote guidance applications, AR interaction over a network is required. AR interaction over a network means, for example, adding virtual objects into a video feed captured in a remote location. AR interaction over networks includes solutions for both: 1) real-time situations, where users are simultaneously interacting with each other and with AR content, and 2) off-line situations, where the users are not simultaneously interacting with each other, but still want to produce or share AR content over a network. Support for remote AR interaction needs to also be available when real-time and offline sessions are following or alternating with each other. This requires that the AR content be produced, stored, and updated seamlessly in successive sessions.