RL Optimization PPO Algorithm - Search Videos

Proximal Policy Optimization (PPO) - How to train Large Language Models

YouTubeLuis Serrano Academy

Proximal Policy Optimization (PPO) - How to train Large Language Models

Reinforcement Learning with Human Feedback (RLHF) is a method used for training Large Language Models (LLMs). In the heart of RLHF lies a very powerful reinforcement learning method called Proximal Policy Optimization. Learn about it in this simple video! This is the first one in a series of 3 videos dedicated to the reinforcement learning ...

83.3K viewsJan 24, 2024

OPEN 6 | MAIN STREAM | EUROPE | CHAMPIONSHIP SUNDAY | RLCS 2026

OPEN 6 | MAIN STREAM | EUROPE | CHAMPIONSHIP SUNDAY | RLCS 2026

YouTubeRocket League Esports

179.6K views2 weeks ago

OPEN 5 | MAIN STREAM | EUROPE | CHAMPIONSHIP SUNDAY | RLCS 2026

OPEN 5 | MAIN STREAM | EUROPE | CHAMPIONSHIP SUNDAY | RLCS 2026

YouTubeRocket League Esports

134.1K views1 month ago

OPEN 4 | MAIN STREAM | EUROPE | GROUP STAGE | RLCS 2026

OPEN 4 | MAIN STREAM | EUROPE | GROUP STAGE | RLCS 2026

YouTubeRocket League Esports

108K views1 month ago

Top videos

L4 TRPO and PPO (Foundations of Deep RL Series)

L4 TRPO and PPO (Foundations of Deep RL Series)

YouTubePieter Abbeel

50.1K viewsAug 25, 2021

Simply Explaining Proximal Policy Optimization (PPO) | Deep Reinforcement Learning

Simply Explaining Proximal Policy Optimization (PPO) | Deep Reinforcement Learning

YouTubeJohnny Code

23.7K viewsApr 11, 2025

4 Months of RL in 4 Hours | Deep Reinforcement Learning Course (PPO, DQN, SAC, A2C)

4 Months of RL in 4 Hours | Deep Reinforcement Learning Course (PPO, DQN, SAC, A2C)

YouTubeMadhav Malhotra

1.1K views4 months ago

Rocket League Montage

Rocket League Montage: Epic Goals & Skills

Rocket League Montage: Epic Goals & Skills

TikTokvision_940

3.4K views4 weeks ago

Rocket League Clip Compilation: Peaks, Mustys & Pop Resets

Rocket League Clip Compilation: Peaks, Mustys & Pop Resets

TikTokjs.muesli

1.6K views3 weeks ago

Rocket League Montage #rocketleague

Rocket League Montage #rocketleague

2.1K views1 month ago

L4 TRPO and PPO (Foundations of Deep RL Series)

L4 TRPO and PPO (Foundations of Deep RL Series)

50.1K viewsAug 25, 2021

YouTubePieter Abbeel

Simply Explaining Proximal Policy Optimization (PPO) | Deep Reinforcement Learning

Simply Explaining Proximal Policy Optimization (PPO) | Deep Reinforcement Learning

23.7K viewsApr 11, 2025

YouTubeJohnny Code

4 Months of RL in 4 Hours | Deep Reinforcement Learning Course (PPO, DQN, SAC, A2C)

4 Months of RL in 4 Hours | Deep Reinforcement Learning Course (PPO, DQN, SAC, A2C)

1.1K views4 months ago

YouTubeMadhav Malhotra

Deep Reinforcement Learning with Proximal Policy Optimization (PPO) with Code example!

Find in video from 09:00Trust Region Policy Optimization (PPO)

Deep Reinforcement Learning with Proximal Policy Optimization (PP…

8.1K viewsJan 15, 2024

YouTubeLuke Ditria

Reinforcement Learning Explained: Model-Free vs Model-Based RL | DQN, PPO, AlphaZero

Reinforcement Learning Explained: Model-Free vs Model-Based RL | DQN, PPO, AlphaZero

281 views4 months ago

[UCLA RL-LLM] Chapter 1.4: Deep policy gradient methods (PPO, GRPO)

[UCLA RL-LLM] Chapter 1.4: Deep policy gradient methods (PPO, GRPO)

2.3K views10 months ago

YouTubeErnest Ryu

UofT RL Course - Lecture 52: PPO Algorithm

UofT RL Course - Lecture 52: PPO Algorithm

77 views5 months ago

YouTubeAli Bereyhi

PPO Algorithm Explained 🤖 | Proximal Policy Optimization in Reinforcement Learning

144 views2 months ago

YouTubeQybrenthak AI Pvt. Ltd.

Proximal Policy Optimization in Reinforcement Learning Simplified

27 views2 months ago

YouTubeRITEC AI Tech

Lecture 18 - Proximal Policy Optimization|Reinforcement Learning Phase | Reasoning LLMs from Scratch

1.7K views10 months ago

PPO Implementation from Scratch | Reinforcement Learning

15.7K viewsDec 7, 2024

YouTubePapers in 100 Lines of Code

GRPO: The Reinforcement Learning Trick That Changed Everything

217 views5 months ago

YouTubemathtartic

What is Proximal Policy Optimization ( PPO)?

88 views6 months ago

YouTubeData Science Made Easy

Proximal Policy Optimization (PPO) Explained | Reinforcement Learning for Game AI

12 views4 months ago

YouTubeSystemDR - Scalable System Design

Proximal Policy Optimization (PPO) & Group Relative Policy Optimization (GRPO) | Paper Explained

5.6K views6 months ago

GRPO Family: Group Relative Policy Optimization RL opt [TIC-GRPO, Scaf-GRPO, XRPO, GRPO-CARE, CPPO]

68 views4 months ago

YouTubeByte Goose AI.

PPO Algorithm in Gaming 🚀 Reinforcement Learning AI Plays Games

73 views4 months ago

YouTubeSystemDR - Scalable System Design

From GRPO to SAMPO: Solving Training Collapse in Agentic RL

1.8K views2 months ago

YouTubeDiscover AI

GDPO Explained: NVIDIA Fixes GRPO for LLM Reinforcement Learning

3.5K views3 months ago

YouTubeAI Papers Academy

[RL Fine-Tuning] From RLHF to GRPO: The Evolution and Optimization of AI LLM Models Alignment.

275 views3 months ago

YouTubeAI Podcast Series. Byte Goose AI.

Proximal Policy Optimization PPO for Autonomous Drone Target Chasing

156 views6 months ago

YouTubeTechMon TC

Proximal Policy Optimization Explained

Find in video from 04:27Proximal Policy Optimization (PPO)

Proximal Policy Optimization Explained

78.7K viewsMay 20, 2021

YouTubeEdan Meyer

Proximal Policy Optimization | ChatGPT uses this

44.2K viewsDec 4, 2023

YouTubeCodeEmporium

Part 1 of 3 — Proximal Policy Optimization Implementation: 11 Core Implementation Details

65.6K viewsSep 10, 2021

YouTubeWeights & Biases

The RL Fine-Tuning Playbook: CoreWeave's Kyle Corbitt on GRPO, Rubrics, Environments, Reward Hacking

34.5K views2 weeks ago

Policy Gradient in 30 min

4.6K views6 months ago

YouTubeZachary Huang

PPO Coding | Proximal Policy Optimization (PPO) Code implementation | PPO in RL

535 viewsMar 5, 2025

YouTubeAILinkDeepTech

🔥 PPO (Proximal Policy Optimization) – OpenAI’s Most Advanced Reinforcement Learning Algorithm! 🤖

371 viewsMar 31, 2025

YouTubeNobleX Infinity Labs®️

Reinforcement Learning Models - Live Review 2

584 views9 months ago

YouTubeDr Mehrdad Arashpour

See more