unlord l4b
Posts
Tags
Categories
CV
unlord l4b
Posts
Tags
Categories
CV
Algoritmos
2025
RLHF (Reinforcement Learning from Human Feedback). El algoritmo de la angustia.
11-09