Skip to main content

Table 2 The relative improvement rate (%) of triple input vs ablated double input. For example, the “First-Person” scores are computed from KMeans-123 and KMeans-23

From: Lifelogging caption generation via fourth-person vision in a human–robot symbiotic environment

 

Method

BLEU-1

BLEU-2

BLEU-3

BLEU-4

ROUGE-L

METEOR

CIDEr-D

SPICE

First-person

KMeans

+4.5

+7.9

+8.6

+8.1

+6.3

+12.9

+88.1

+28.7

Second-person

KMeans

+7.0

+9.4

+12.4

+15.4

+6.2

+7.7

+11.0

+4.7

Third-person

KMeans

+4.2

+6.9

+9.5

+11.7

+3.2

+4.2

+8.6

+3.1

  1. Highest values are in italic