Reinforcement Learning 5: Методы на основе политики агента
В этом видео разберемся с новой группой методов, которые основаны непосредственно на политике агента. Познакомимся с методом REINFORCE, рассмотрим комбинацию алгоритмов Actor Critic, основанных на значениях, похожих на Policy Gradient и Q-Learning.
In this video, we will understand a new group of methods that are based directly on the agent’s policy. Let’s get acquainted with the REINFORCE method, consider a combination of Actor Critic algorithms based on values similar to Policy Gradient and Q-Learning.
00:00:00 Начало видео
00:01:05 Deep Q-Network (DQN) method
00:03:26 Policy function
00:05:34 Policy Gradients method
00:17:14 Метод REINFORCE
00:23:51 Actor-Critic
00:25:05 A2C (Advantage Actor-Critic)
00:34:00 A3C (Asynchronous Advantage Actor-Critic)
00:45:40 Actor-Critic for continuous action spaces
00:53:25 Actor-Critic: Model
00:56:58 Actor-Critic: Policy and Training
01:10:07 Mountain Car Continuous
01:14:42 Actor-Critic: Гиперпараметры
Ukrainian IT-company. Machine Learning | Data Science | Artificial Intelligence
#artificialintelligence
#MachineLearning #ReinforcementLearning
#ИскусственныйИнтеллект #Машинноеобучение
8 views
5
2
2 months ago 00:02:02 12
DayZ Update Teaser
2 months ago 00:38:19 1
L’horreur existentielle de l’usine à trombones.
2 months ago 00:01:29 1
Introducing the World’s Coolest Humanoid Robot — EngineAI SE01!
2 months ago 00:25:48 1
What do tech pioneers think about the AI revolution? - BBC World Service
2 months ago 00:19:32 1
How to Make a Carbon Fiber Car Bonnet/Hood - Part 2/3 : Resin Infusion
2 months ago 00:01:44 1
Unitree Introducing | Unitree G1 Humanoid Agent | AI Avatar | Price from $16K
2 months ago 00:02:50 1
LimX Dynamics’ Biped Robot P1 Conquers the Wild Based on Reinforcement Learning
2 months ago 00:32:28 1
Free English Class! Topic: Our Daily Routines! 🐕⏰🥙 (Lesson Only)
2 months ago 00:11:18 1
Learn How To Talk About Your Daily Routine in English Part 2
2 months ago 00:07:45 1
Learn How To Talk About Your Daily Routine in English by Watching Me Act Out Mine
2 months ago 00:14:45 1
Как установить Stable Diffusion 3.5 Large и Turbo на компьютер? Пошаговая инструкция для Windows.
2 months ago 00:02:15 1
Number and Counting song | Learn Counting to 1000 | Math for 2nd Grade | Kids Academy
2 months ago 00:01:58 1
Morse Code Alphabet Receiving Practice (1)
2 months ago 00:09:12 3
Learn English Through Story Level 1, Graded Reader Level 1, Stories Short Beginners, Basic English
2 months ago 00:29:52 6
[LeatherCraft] Baguette Bag 4K / FREE PDF PATTERN
2 months ago 00:59:40 1
’Little Learning Machines’ Postmortem: A Game About Training Neural Networks
2 months ago 01:08:43 1
#dobetter Podcast Episode 4: Learned Behavior during Extinction
2 months ago 00:01:11 3
MEVIUS: A Quadruped Robot Easily Constructed through E-Commerce (Humanoids 2024)
3 months ago 00:11:46 1
Jobs and Occupations - Vocabulary for Kids - Compilation
3 months ago 00:02:28 1
Yamaha | Artist Profile | Krissy Morash of Escuela Grind
3 months ago 00:04:08 1
Wild animals for kids - Vocabulary for kids
3 months ago 00:10:12 1
Let’s Learn English Around the House and Home | English Video with Subtitles
3 months ago 00:32:16 1
The Teacher Series #10
3 months ago 00:03:30 1
The Ancient Egypt - 5 things you should know - History for kids