DEV Community

# reinforcementlearning

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Q-Learning for Games: Teaching an Agent Tic-Tac-Toe Through Self-Play

Q-Learning for Games: Teaching an Agent Tic-Tac-Toe Through Self-Play

Comments
14 min read
Value Iteration vs Q-Learning: Dynamic Programming Meets RL

Value Iteration vs Q-Learning: Dynamic Programming Meets RL

Comments
12 min read
Solving CartPole Without Gradients: Simulated Annealing

Solving CartPole Without Gradients: Simulated Annealing

Comments
13 min read
The Cross-Entropy Method: Solving RL Without Gradients

The Cross-Entropy Method: Solving RL Without Gradients

1
Comments
12 min read
Self-Learning AI Agents; Architectures and Challenges

Self-Learning AI Agents; Architectures and Challenges

1
Comments 1
3 min read
Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
Deep Q-Networks: Experience Replay and Target Networks

Deep Q-Networks: Experience Replay and Target Networks

Comments
18 min read
Q-Learning from Scratch: Navigating the Frozen Lake

Q-Learning from Scratch: Navigating the Frozen Lake

Comments
11 min read
Evolution Is Back: A New Way to Fine‑Tune LLMs

Evolution Is Back: A New Way to Fine‑Tune LLMs

1
Comments
7 min read
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

1
Comments
4 min read
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Comments
11 min read
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

5
Comments
4 min read
Top 15 Reinforcement Learning Questions That Will Appear in Exams

Top 15 Reinforcement Learning Questions That Will Appear in Exams

6
Comments
2 min read
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

1
Comments
52 min read
Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.