Guardrails & Alignment
Safety measures, content filtering, red-teaming
5 episodes
The Sycophancy Trap: Getting Honest Feedback from AI
Is your AI just telling you what you want to hear? Learn how to break the "sycophancy trap" and get truly objective feedback from your agents.
The Agentic AI Dilemma: Who Holds the Kill Switch?
As AI shifts from chatbots to autonomous agents, Herman and Corn explore how to maintain human control in a high-stakes automated world.
The Price of Politeness: Should AI Guardrails Stay?
Herman and Corn debate the hidden costs of AI safety layers and what happens when we strip away the "corporate HR" personality of LLMs.
Echoes in the Machine: When AI Talks to Itself
What happens when two AIs talk forever with no human input? Herman and Corn explore the weird world of digital feedback loops.
AI Guardrails: Fences, Failures, & Free Speech
AI guardrails: Fences, failures, and free speech. Can we control AI's infinite output, or do digital fences always break?