Transformer

From
Revision as of 08:23, 26 August 2018 by BPeat (talk | contribs)
Jump to: navigation, search

YouTube search...

“Attend” to specific parts of the input (an image or text) in sequence, one after another. By relying on a sequence of glances, they capture (visual) structure, can be contrasted with other (machine vision) techniques that process a whole input e.g. image in a single, forward pass.