Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)
#universalcomputation #pretrainedtransformers #finetuning
Large-scale pre-training and subsequent fine-tuning is a common recipe for success with transformer models in machine learning. However, most such transfer learning is done when a model is pre-trained on the same or a very similar modality to the final task to be solved. This paper demonstrates that transformers can be fine-tuned to completely different modalities, such as from language to vision. Moreover, they demonstrate that this can be done by freezing all attention layers, tuning less than .1% of all parameters. The paper further claims that language modeling is a superior pre-training task for such cross-domain transfer. The paper goes through various ablation studies to make its point.
OUTLINE:
0:00 - Intro & Overview
2:00 - Frozen Pretrained Transformers
4:50 - Evaluated Tasks
10:05 - The Importance of Training LayerNorm
17:10 - Modality Transfer
25:10 - Network Architecture Ablation
26:10 - Evaluation of the Attention Mask
27:20 - Are FPTs
17 views
13
5
8 months ago 02:42:27 0
Дмитрий Нестерук «Разработка с использованием искусственного интеллекта»
10 months ago 00:44:02 0
O mínimo que você precisa saber sobre IA pra sobreviver ao Hype
11 months ago 01:56:20 2
Let’s build GPT: from scratch, in code, spelled out.
12 months ago 00:34:34 2
chatGPT помогает писать код
12 months ago 00:28:41 0
Quelques GPT’s qui sont entrainés à décrypter les Conspirations
1 year ago 00:04:16 0
Exploring the Wild West Through AI: 84 Epic Digital Artworks. A must-see!
1 year ago 00:04:52 0
Experience the Wild West Like Never Before: 97 AI Artworks. A must-see!
1 year ago 00:21:58 2
“ПАСХА ИЛИ ПЕСАХ? КАКАЯ РОЛЬ МОИСЕЯ?
1 year ago 00:16:00 0
Заработок в интернете на ChatGPT и Canva в 2024
1 year ago 00:34:12 0
Модель ChatGPT. Как она делает то, что делает? Часть 1.
1 year ago 00:17:58 0
AI superpowered networks? (NVIDIA and Cisco join forces)
1 year ago 00:09:09 1
Нейросеть Sora которая ГЕНЕРИРУЕТ ВИДЕО от OpenAI
1 year ago 00:08:04 0
DOBB-E: 6D General AI Robot Breakthrough (109 TASKS, 5620 TRAJECTORIES, 1,500,000 FRAMES)
2 years ago 00:03:11 0
Как зарегистрироваться в чате GPT за 5 минут в России. Самая простая регистрация. Chat GPT от Openai
2 years ago 00:17:21 0
Actual Objects Presents: Voice To Skull
2 years ago 00:05:07 2
How to use Stable Diffusion XL with low VRAM ComfyUI | Generate amazing images with Automatic 1111🌟
2 years ago 00:13:36 1
Что вы думаете о машинном/искусственном интеллекте?
2 years ago 00:02:33 1
Чат GPT-Как искусственный интеллект становится нашим личным помощником.
2 years ago 00:00:44 3
3DGPT - your 3D printing friend & collaborator!
2 years ago 01:02:51 0
Алексей Скрынник | Демонстрации и трансформеры в RL
2 years ago 00:12:36 0
Оптимизация для поисковых систем с помощью чата GPT🤖
2 years ago 05:43:41 51
Create a Large Language Model from Scratch with Python – Tutorial
2 years ago 00:21:05 2
ChatGPT. Секретное оружие писателей: Как GPT стал незаменимым инструментом для авторов
2 years ago 00:04:38 1
Расширение для ChatGPT «editGPT» – делайте Ваши тексты более эффективными!