DeepSeek R1 Explained to your grandma

Name: DeepSeek R1 Explained to your grandma
Uploaded: 2025-01-23T05:11:34.000Z

Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation. Paper: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf Ollama link for local use: https://ollama.com/library/deepseek-r1 0:00 Introduction 0:43 Chain of Thought 1:33 Reinforcement Learning 3:53 Group Relative Policy Optimization 6:26 Distillation #deepseek #ai #largelanguagemodels

Top Bluesky Posts

Just some Boomer that u used to know
OK now I'm going off the reservation. Trying to get up to speed at a very basic level on DeepSeek R1 I just watched a video where it occurred to me that it could be used to leverage the HW3 chipset with a subset of parameters to get the same (or better?) performance as HW4? 😮 youtu.be/kv8frWeKoeo
1
View on Bluesky
The Voodoo Kudzu Podcast
youtu.be/kv8frWeKoeo?...
0
View on Bluesky
Yowiman
youtu.be/kv8frWeKoeo?...
0
View on Bluesky
Gabu
youtu.be/kv8frWeKoeo?... Capaz da minha avó entender e eu não
0
View on Bluesky
Dion Posdijk
DeepSeek R1 Explained to your grandma youtu.be/kv8frWeKoeo?... via @YouTube #DeepSeek
0
View on Bluesky
güven
DeepSeek için önce büyük bir model eğitilmiş, sonra o büyük modelle küçük model eğitilmiş. Küçük modelin performansı büyük modeli bile geçmiş. www.youtube.com/watch?v=kv8f...
0
View on Bluesky
alex
good video that explains the new r1 model by deepseek www.youtube.com/watch?v=kv8...
0
View on Bluesky