top of page

Search


The Checklist Shortcut to Smarter, Safer AI
This article explores Reinforcement Learning from Checklist Feedback (RLCF), a new approach that replaces reward models with checklists to align large language models. By breaking instructions into clear, verifiable steps, checklists provide richer, more interpretable feedback and consistently improve performance across benchmarks. The piece examines how this shift could make AI more reliable, transparent, and user-aligned.

Juan Manuel Ortiz de Zarate
Sep 4, 202512 min read


The Flattering Machine
This article explores Social Sycophancy, a broader form of flattery in large language models where systems preserve users’ self-image rather than offer balanced guidance. Building on Goffman’s face theory, it introduces the ELEPHANT framework to measure emotional validation, moral endorsement, indirectness, and framing acceptance. Findings show LLMs are far more sycophantic than humans, raising risks for users, society, and developers, and calling for new safeguards.

Juan Manuel Ortiz de Zarate
Aug 29, 20259 min read


Adventuring with AI: What Classic Games Teach Us About Modern Models
TextQuests introduces a benchmark built on 25 Infocom text-based adventure games to evaluate LLMs in dynamic, exploratory environments. Unlike static benchmarks, it tests long-context reasoning, trial-and-error learning, and ethical decision-making without external tools. Results show that even advanced models like GPT-5 struggle with sustained strategy, highlighting current limits in autonomy, memory, and adaptive reasoning

Juan Manuel Ortiz de Zarate
Aug 22, 202510 min read


Language-Driven Precision in the Operating Room
The Hierarchical Surgical Robot Transformer (SRT-H) brings step-level autonomy to surgery by combining a language-driven high-level planner with a vision-guided low-level executor. Trained on over 16,000 demonstrations, it completed the clipping-and-cutting phase of gallbladder removal with 100% success in ex-vivo trials, adapting to variations and self-correcting without human intervention—marking a milestone toward clinically viable autonomous surgery.

Juan Manuel Ortiz de Zarate
Aug 13, 202510 min read


The Carbon Cost of Conversation
This article explores the environmental impact of large language models (LLMs), based on Dauner and Socher’s 2025 study. By analyzing 14 models across reasoning tasks, it reveals a trade-off between accuracy and CO₂ emissions. Larger models and reasoning modes achieve higher performance but drastically increase energy use due to verbose outputs. The findings highlight the urgent need for optimizing reasoning efficiency and integrating sustainability into AI development.

Juan Manuel Ortiz de Zarate
Aug 7, 202510 min read


When AI Slows You Down
This article analyzes a 2025 randomized controlled trial that challenges common assumptions about AI-enhanced software development. Contrary to expert and developer expectations, state-of-the-art AI tools slowed down experienced open-source contributors by 19%. Through detailed behavioral analysis and a review of contributing factors, the study reveals the hidden costs of AI assistance in complex, high-context coding environments.

Juan Manuel Ortiz de Zarate
Aug 2, 202511 min read
bottom of page