Synthetic training data in 2026: when it works

Synthetic data has moved from precarious substitute for real data to central component of modern training. These are the patterns that work and those still failing.

155 5 min April 28, 2026 4.3

Inteligencia Artificial

DPO and alternatives to RLHF: practical state in 2026

Direct Preference Optimization and its relatives have displaced RLHF as the preferred alignment method in much of the ecosystem. This is the practical state of the field in 2026.

846 5 min April 28, 2026 4.7

Inteligencia Artificial

Alignment evaluation: RLHF, DPO, and recent alternatives

Tres años después de que RLHF se hiciera popular, el paisaje del alineamiento de modelos es más rico. Repaso de RLHF, DPO y los métodos más recientes como KTO o ORPO, con criterios para elegir.

255 11 min February 8, 2025

Inteligencia Artificial

LoRA and QLoRA: Efficient Fine-Tuning on a Single Laptop

LoRA reduce el coste del fine-tuning de forma dramática. QLoRA va aún más allá combinando cuantización y adaptadores de bajo rango. Cómo funcionan, cuándo usarlos y qué calidad esperar.

168 13 min October 29, 2024 4.6

Desarrollo de Software

LLM Fine-Tuning: When It’s Worth Training Your Own

Fine-tuning sigue siendo caro y operativamente complejo. Guía para decidir entre RAG, prompt engineering y entrenamiento propio.

160 9 min July 13, 2023 4.6

Inteligencia Artificial

Pre-trained Models and Transfer Learning

La transferencia de aprendizaje permite reutilizar modelos entrenados en grandes conjuntos de datos para resolver tareas nuevas con mucho menos datos y tiempo de cómputo. Cómo funciona y cuándo usarla.

202 11 min March 18, 2023 4.1