Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation. Paper: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf Ollama link for local use: https://ollama.com/library/deepseek-r1 0:00 Introduction 0:43 Chain of Thought 1:33 Reinforcement Learning 3:53 Group Relative Policy Optimization 6:26 Distillation #deepseek #ai #largelanguagemodels
OK now I'm going off the reservation. Trying to get up to speed at a very basic level on DeepSeek R1 I just watched a video where it occurred to me that it could be used to leverage the HW3 chipset with a subset of parameters to get the same (or better?) performance as HW4? 😮 youtu.be/kv8frWeKoeo
youtu.be/kv8frWeKoeo?...
youtu.be/kv8frWeKoeo?...
youtu.be/kv8frWeKoeo?... Capaz da minha avó entender e eu não
DeepSeek R1 Explained to your grandma youtu.be/kv8frWeKoeo?... via @YouTube #DeepSeek
DeepSeek için önce büyük bir model eğitilmiş, sonra o büyük modelle küçük model eğitilmiş. Küçük modelin performansı büyük modeli bile geçmiş. www.youtube.com/watch?v=kv8f...
good video that explains the new r1 model by deepseek www.youtube.com/watch?v=kv8...
You may also like
Powered by
(but not affiliated with)
Created by mjd.dev