Content
Tag
1 entry tagged Research · 1 term.
A neural network architecture, introduced in 2017, that uses attention mechanisms to process sequences of tokens in parallel and powers every modern large language model.