HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning (w/ Author)
#hypertransformer #metalearning #deeplearning
This video contains a paper explanation and an interview with author Andrey Zhmoginov!
Few-shot learning is an interesting sub-field in meta-learning, with wide applications, such as creating personalized models based on just a handful of data points. Traditionally, approaches have followed the BERT approach where a large model is pre-trained and then fine-tuned. However, this couples the size of the final model to the size of the model that has been pre-trained. Similar problems exist with “true“ meta-learners, such as MaML. HyperTransformer fundamentally decouples the meta-learner from the size of the final model by directly predicting the weights of the final model. The HyperTransformer takes the few-shot dataset as a whole into its context and predicts either one or multiple layers of a (small) ConvNet, meaning its output are the weights of the convolution filters. Interestingly, and with the correct engineering care, this actually appears to deliver promisin
6 views
10
0
3 months ago 00:48:20 2
Project Blue Beam: Staging a Fake Alien Attack to Take Over the World
4 months ago 00:25:30 1
Man Builds Hyperrealistic RC Truck at Scale Only Using PVC | by @MAN_Creative_86
6 months ago 00:02:09 7
Pedigree - Adoptable (case study)
6 months ago 00:33:23 1
JESZCZE 90 DNI
10 months ago 00:40:17 1
What If Police Actually Used Supercars?
1 year ago 00:02:03 1
Sectes/Loges : Synthèse par Léon de Poncins
1 year ago 00:59:29 1
Bugsy - Jackies Music House Session Podcast #168
1 year ago 00:22:50 1
SSL X Joe Carrell: Mixing with the 4K E plug-in, SSL 360° and the SSL Complete plug-in catalogue
1 year ago 00:01:44 1
The Line | Official Trailer for the Oculus Quest
1 year ago 00:13:19 1
Stable Warpfusion Tutorial: Turn Your Video to an AI Animation