Fig. 11From: Lifelogging caption generation via fourth-person vision in a human–robot symbiotic environmentExample captions with the single perspective model and our proposed model. Methods “First”, “Second”, and “Third”: UpDown model with a single image. Method “Ours”: KMeans model with three types of images. All results are generated with beam search decodingBack to article page