GLOM: How to represent part-whole hierarchies in a neural network (Geoff Hinton’s Paper Explained)
#glom #hinton #capsules
Geoffrey Hinton describes GLOM, a Computer Vision model that combines transformers, neural fields, contrastive learning, capsule networks, denoising autoencoders and RNNs. GLOM decomposes an image into a parse tree of objects and their parts. However, unlike previous systems, the parse tree is constructed dynamically and differently for each input, without changing the underlying neural network. This is done by a multi-step consensus algorithm that runs over different levels of abstraction at each location of an image simultaneously. GLOM is just an idea for now but suggests a radically new approach to AI visual scene understanding.
OUTLINE:
0:00 - Intro & Overview
3:10 - Object Recognition as Parse Trees
5:40 - Capsule Networks
8:00 - GLOM Architecture Overview
13:10 - Top-Down and Bottom-Up communication
18:30 - Emergence of Islands
22:00 - Cross-Column Attention Mechanism
27:10 - My Improvements for the Attention Mechanism
35:25 - Some Design Decisions
43:25 - Training GLOM as a D
1 view
41
0
2 months ago 00:02:42 1
Hannes, waterbaby - Stockholmsvy (lyrics)
10 months ago 00:13:41 1
9 Impossible Phrasal Verbs…EXPLAINED!
1 year ago 01:00:13 1
Baldur’s Gate 3 Best Multiclass for Tactician Difficulty | Tier List Ranking every Multiclass in BG3
1 year ago 02:59:21 1
KARL FRISTON - INTELLIGENCE 3.0
2 years ago 00:07:27 1
Wall Street Journal: Sverige är Europas mordhuvudstad
2 years ago 00:06:41 1
Jordan Peterson And Ben Shapiro Are ‘Smart Guys’ For Dumb People
4 years ago 01:24:01 12
Paper Review - GLOM: How to Represent Part-Whole Hierarchies in a Neural Network by Geoffrey Hinton
4 years ago 01:03:26 1
GLOM: How to represent part-whole hierarchies in a neural network (Geoff Hinton’s Paper Explained)
9 years ago 00:15:26 1
Smokey Purple Eyes - Linda Hallberg Make up tutorials