BlueTube

DeepSeek R1 Explained to your grandma

Describing the key insights from the DeepSeek R1 paper in a way even your grandma could understand. I focus on the key concepts of chain of thought reasoning, reinforcement learning, and model distillation. Paper: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf Ollama link for local use: https://ollama.com/library/deepseek-r1 0:00 Introduction 0:43 Chain of Thought 1:33 Reinforcement Learning 3:53 Group Relative Policy Optimization 6:26 Distillation #deepseek #ai #largelanguagemodels

Top Bluesky Posts

You may also like

  • Epstein Spills Intel on Trump’s White House

  • Attention, all you pardoned criminals…

  • Pritzker: “White House is either lying to us or they’re critically incompetent.”

  • DISGUSTING Dr Phil is cashing on immigration raids

  • Dr. Phil Accidentally EXPOSES Trump

  • ICE Conducts Made-for-TV Raids as Cities from Chicago to Newark Resist Trump's Immigration Crackdown

  • Something BIG Is Cooking: Tricky Confluence - Bitcoin Today

  • Catching The Lies: Media Literacy in the Age of Misinformation | deep dive$, episode 24

  • How Trump plans to block Democrats from voting in future elections.

  • Airport Drops Bombshell on Trump, CALLS OUT HIS LIES

Powered by

(but not affiliated with)

Bluesky
YouTube

Created by mjd.dev