TreePO: Faster RL for LLM Reasoning
AI Research Roundup
22 Views
0 Likes
Understanding the LLM Inference Workload - Mark Moyou, NVIDI
PyTorch
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eif
AI Engineer
Proximal Policy Optimization (PPO) for LLMs Explained Intuit
Julia Turc
Faster LLMs: Accelerate Inference with Speculative Decoding
IBM Technology
The Strange Math That Predicts (Almost) Anything
Veritasium
Proximal Policy Optimization (PPO) - How to train Large Lang
Serrano.Academy
[Full Workshop] Reinforcement Learning, Kernels, Reasoning,
AI Engineer
Visualizing transformers and attention | Talk for TNG Big Te
Grant Sanderson
All Machine Learning algorithms explained in 17 min
Infinite Codes
Why Does Diffusion Work Better than Auto-Regression?
Algorithmic Simplicity
NVIDIA JetBlock: Faster AI Through Surgical Architecture. Po
AI Podcast Series. Byte Goose AI.
Reinforcement Learning from Human Feedback (RLHF) Explained
IBM Technology
LLM Training & Reinforcement Learning from Google Engineer |
Martin Is A Dad
you need to learn MCP RIGHT NOW!! (Model Context Protocol)
NetworkChuck
THIS is why large language models can understand the world
Algorithmic Simplicity
Can AI Think? Debunking AI Limitations
IBM Technology
But how do AI images and videos actually work? | Guest video
3Blue1Brown
Transformers, the tech behind LLMs | Deep Learning Chapter 5
3Blue1Brown
5 Unbelievably Useful AI Tools For Research in 2025 (better
Academic English Now
Reinforcement Learning with Neural Networks: Essential Conce
StatQuest with Josh Starmer