AIは密かに注意を払うことを学ぶ(AI learns to pay covert attention)

2024-05-23

2024-05-22 カリフォルニア大学サンタバーバラ校(UCSB)

カリフォルニア大学サンタバーバラ校の研究者たちは、これまで霊長類に特有と考えられていた隠れた注意の行動指標が、実際には高度な脳構造によるものではなく、よりシンプルな知能の形態から生まれる可能性があることを示しました。彼らは、フィードフォワード型畳み込みニューラルネットワーク(CNN)を使用して、明示的な注意メカニズムを組み込まなくても、知覚課題で隠れた注意の主な特徴が現れることを証明しました。この研究により、霊長類以外の動物にも隠れた注意が存在し、その背後には単純なプロセスがあることが示されました。CNNは、目立ったリソース制約を持たずにターゲットを検出する最適な方法を見つけようとする過程で、隠れた注意の行動指標を示しました。

<関連情報>

フィードフォワード畳み込みニューラルネットワークにおける人間のような秘密の注意の出現 Emergent human-like covert attention in feedforward convolutional neural networks

Sudhanshu Srivastava,William Yang Wang,Miguel P. Eckstein

Current Biology Published:January 19, 2024

DOI:https://doi.org/10.1016/j.cub.2023.12.058

Highlights

CNNs predict the influence of predictive cues/contexts on human perceptual accuracy
The CNNs do not incorporate any explicit attention mechanism or limited resources
The CNNs’ cue/context effects are comparable to those of a Bayesian ideal observer
The CNNs might be a viable model for attention in animals with no neocortex

Summary

Covert attention allows the selection of locations or features of the visual scene without moving the eyes. Cues and contexts predictive of a target’s location orient covert attention and improve perceptual performance. The performance benefits are widely attributed to theories of covert attention as a limited resource, zoom, spotlight, or weighting of visual information. However, such concepts are difficult to map to neuronal populations. We show that a feedforward convolutional neural network (CNN) trained on images to optimize target detection accuracy and with no explicit incorporation of an attention mechanism, a limited resource, or feedback connections learns to utilize cues and contexts in the three most prominent covert attention tasks (Posner cueing, set size effects in search, and contextual cueing) and predicts the cue/context influences on human accuracy. The CNN’s cueing/context effects generalize across network training schemes, to peripheral and central pre-cues, discrimination tasks, and reaction time measures, and critically do not vary with reductions in network resources (size). The CNN shows comparable cueing/context effects to a model that optimally uses image information to make decisions (Bayesian ideal observer) but generalizes these effects to cue instances unseen during training. Together, the findings suggest that human-like behavioral signatures of covert attention in the three landmark paradigms might be an emergent property of task accuracy optimization in neuronal populations without positing limited attentional resources. The findings might explain recent behavioral results showing cueing and context effects across a variety of simple organisms with no neocortex, from archerfish to fruit flies.

月	火	水	木	金	土	日
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31