Large Language Models estimate fine-grained human color-concept associations

发布于 2024-06-27  7 次阅读


AI 摘要

这篇文章研究了人类颜色概念关联的细微差异,以及GPT-4这种多模式大型语言模型在估计人类化的颜色概念关联方面的能力。研究人员使用了71种跨越感知色彩空间的颜色集合(UW-71),以及在抽象性方面存在差异的概念,来评估GPT-4生成的关联评分与人类评分之间的相关性。结果表明,GPT-4的评分与人类评分相关,并且在自动估计图像中的颜色概念关联方面表现出与最先进方法相当的水平。研究显示,语言和感知之间的高级协变性在互联网的自然环境中包含足够的信息,以支持学习人类化的颜色概念关联,也证明了学习系统可以在没有初始约束的情况下编码这些关联。这项工作进一步表明,GPT-4可用于高效地估计各种概念的颜色关联分布,可能成为设计有效直观信息可视化的关键工具。

[PDF] [Site] [Kimi]

Concepts, both abstract and concrete, elicit a distribution of association strengths across perceptual color space, which influence aspects of visual cognition ranging from object recognition to interpretation of information visualizations. While prior work has hypothesized that color-concept associations may be learned from the cross-modal statistical structure of experience, it has been unclear whether natural environments possess such structure or, if so, whether learning systems are capable of discovering and exploiting it without strong prior constraints. We addressed these questions by investigating the ability of GPT-4, a multimodal large language model, to estimate human-like color-concept associations without any additional training. Starting with human color-concept association ratings for 71 color set spanning perceptual color space (texttt{UW-71}) and concepts that varied in abstractness, we assessed how well association ratings generated by GPT-4 could predict human ratings. GPT-4 ratings were correlated with human ratings, with performance comparable to state-of-the-art methods for automatically estimating color-concept associations from images. Variability in GPT-4's performance across concepts could be explained by specificity of the concept's color-concept association distribution. This study suggests that high-order covariances between language and perception, as expressed in the natural environment of the internet, contain sufficient information to support learning of human-like color-concept associations, and provides an existence proof that a learning system can encode such associations without initial constraints. The work further shows that GPT-4 can be used to efficiently estimate distributions of color associations for a broad range of concepts, potentially serving as a critical tool for designing effective and intuitive information visualizations.

Hello
最后更新于 2024-08-02