Characterization and classification of semantic image-text relations