Skip to content
Monday, April 20, 2026
Latest Posts
  • Pair programming with AI in 2025: habits that stick
  • vLLM in 2025: the improvements that matter to LLM-serving teams
  • Microsoft's GraphRAG in enterprise: patterns that work
  • Alignment evaluation: RLHF, DPO, and recent alternatives
  • Gemma 2: Google's open model one year later

Jacar

Passion for Technology

  • en
    • es

Tag: llm

vLLM in 2025: the improvements that matter to LLM-serving teams

February 14, 2025 javi
Tarjetas gráficas GPU alineadas en un chasis abierto con luces azules

vLLM has consolidated as the most widely adopted LLM serving engine in production. A review of recent improvements, what changes for operators, and what remains a weak point.

Read more
Inteligencia Artificial 

Microsoft’s GraphRAG in enterprise: patterns that work

February 11, 2025 javi
Nodos y conexiones abstractas formando un grafo sobre fondo violeta

GraphRAG has been in real enterprise use for a year. A balance of which question types it answers better than classic RAG, what it costs to operate, and when the extra complexity pays off.

Read more
Arquitectura Inteligencia Artificial 

Alignment evaluation: RLHF, DPO, and recent alternatives

February 8, 2025 javi
Balanza antigua sobre fondo neutro simbolizando equilibrio y ajuste

Three years after RLHF became popular, the alignment landscape is richer. A review of RLHF, DPO, and recent methods like KTO or ORPO, with criteria for choosing.

Read more
Inteligencia Artificial Metodologías 

Gemma 2: Google’s open model one year later

February 5, 2025 javi
Flor de loto abierta sobre superficie azul, metáfora visual de apertura

Google released Gemma 2 in mid-2024 and it’s been in real-world use for a while. A balance of how it competes in the open-model ecosystem, which sizes make sense, and where adoption has taken hold.

Read more
Inteligencia Artificial 

o3 in public: the reasoning leap is confirmed

February 2, 2025 javi
Tablero de ajedrez con piezas dispuestas evocando cálculo estratégico complejo

OpenAI’s o3 series is starting to become available and marks a real shift in complex reasoning. A look at where it shines, where it still fails, and what changes for those building products with LLMs.

Read more
Inteligencia Artificial 

Gemini 2.0: integrated tools and agent mode

January 30, 2025 javi
Fondo abstracto con luces brillantes formando patrones de red neuronal

Google has released Gemini 2.0 with a clear emphasis on tool use and agents. A look at what it brings, where it lags behind competitors, and in what kind of applications it fits best.

Read more
Inteligencia Artificial 

NPU in the PC: faster, cheaper local AI

January 6, 2025 javi
Placa base con chip procesador visible bajo luz azulada

Copilot+ processors from Qualcomm, Intel, and AMD have normalized NPUs in consumer PCs. What really changes for running local models, and when it’s worth it.

Read more
Inteligencia Artificial Tecnología 

Mistral Large: European Contender Against GPT-4

September 29, 2024 javi
Paisaje europeo con torres antiguas representando presencia europea en tecnología

Mistral Large 2 closes gap with GPT-4 and Claude from Europe. EU residency, pricing, and when to choose vs alternatives.

Read more
Inteligencia Artificial 

GPT-4 Turbo: Long Context and More Reasonable Costs

July 4, 2024 javi
Teclado de ordenador con teclas retroiluminadas en azul representando interacción AI

GPT-4 Turbo doubled GPT-4’s context and cut price 3x. Six months later, does it remain relevant or has GPT-4o replaced everything.

Read more
Inteligencia Artificial 

Constrained Decoding for Structured LLM Outputs

April 26, 2024 javi
Sistema de tuberías industriales con flujos controlados representando decodificación restringida

Outlines, Guidance, and jsonformer force LLMs to generate valid JSON, regex, or grammars. How they work and when they beat prompting.

Read more
Desarrollo de Software Inteligencia Artificial 

Posts navigation

Older posts

Recent Posts

  • Pair programming with AI in 2025: habits that stick
  • vLLM in 2025: the improvements that matter to LLM-serving teams
  • Microsoft’s GraphRAG in enterprise: patterns that work
  • Alignment evaluation: RLHF, DPO, and recent alternatives
  • Gemma 2: Google’s open model one year later
  • o3 in public: the reasoning leap is confirmed
  • Gemini 2.0: integrated tools and agent mode
  • Home lab: self-hosted lab as a testing ground
  • Full-stack TypeScript in 2025: the good, the meh, the bad
  • WASI preview 3: threads and async in WebAssembly
  • AI-assisted code review: an honest adoption story
  • Llama 3.2 at the edge: Meta bets on small
  • Cloudflare Workers in 2025: from edge to enterprise
  • Generics in Go: three years later, what has survived
  • NPU in the PC: faster, cheaper local AI

Copyright © All rights reserved

Usamos cookies para asegurar que te damos la mejor experiencia en nuestra web. Si continúas usando este sitio, asumiremos que estás de acuerdo con ello. Pulsa en "Aceptar todo" si estás de acuerdo.
Ajustes de CookiesAceptar Todo
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT