The AI Agent Trust Gap: Bridging Risk to Reliability | Elastic’s Philipp Krenn
The age of ubiquitous AI agents is here, bringing immense potential - and unprecedented risk.Hosts Conor Bronsdon and Vikram Chatterji open the episode by discussing the urgent need for building trust and reliability into next-generation AI agents. Vikram unveils Galileo's free AI reliability platform for agents, featuring Luna 2 SLMs for real-time guardrails and its Insights Engine for automatic failure mode analysis. This platform enables cost-effective, low-latency production evaluations, significantly transforming debugging. Achieving trustworthy AI agents demands rigorous testing, continuous feedback, and robust guardrailing—complex challenges requiring powerful solutions from partners like Elastic.Conor welcomes Philipp Krenn, Director of Developer Relations at Elastic, to discuss their collaboration in ensuring AI agent reliability, including how Elastic leverages Galileo's platform for evaluation. Philipp details Elastic's evolution from a search powerhouse to a key AI enabler, transforming data access with Retrieval-Augmented Generation (RAG) and new interaction modes. He discusses Elastic's investment in SLMs for efficient re-ranking and embeddings, emphasizing robust evaluation and observability for production. This collaborative effort aims to equip developers to build reliable, high-performing AI systems for every enterprise.Chapters:00:00 Introduction 01:09 Galileo's AI Reliability Platform01:43 Challenges in AI Agent Reliability06:17 Insights Engine and Its Importance11:00 Luna 2: Small Language Models14:42 Custom Metrics and Agent Leaderboard19:16 Galileo's Integrations and Partnerships21:04 Philipp Krenn from Elastic24:47 Optimizing LLM Responses 25:41 Galileo and Elastic: A Powerful Partnership28:20 Challenges in AI Production and Trust30:02 Guardrails and Reliability in AI Systems32:17 The Future of AI in Customer InteractionFollow the hostsFollow AtinFollow ConorFollow VikramFollow YashFollow Today's Guest(s)Connect with Philipp on LinkedInLearn more about ElasticCheck out GalileoTry GalileoAgent Leaderboard
--------
44:11
--------
44:11
Architecting Reliable Agentic AI | Cisco’s Giovanna Carofiglio on the AGNTCY Collective
The Internet of Agents is rapidly taking shape, necessitating innovative foundational standards, protocols, and evaluation methods for its success.Recorded at Cisco's office in San Jose, we welcome Giovanna Carofiglio, Distinguished Engineer and Senior Director at Outshift by Cisco. As a leader of the AGNTCY Collective (an open-source initiative by Cisco, Galileo, LangChain, and many other participating companies), Giovanna outlines the vision for agents to collaborate seamlessly across the enterprise and the internet. She details the collective's pillars, from agent discovery and deployment using new agentic protocols like Slim, to ensuring a secure, low-latency communication transport layer. This groundbreaking work aims to make distributed agentic communication a reality.The conversation then explores the critical role of observability and evaluation in building trustworthy agent applications, including defining an interoperable standard schema for communications. Giovanna highlights the complex challenges of scaling agents to thousands or millions, emphasizing the need for robust security (agent identity with OSF schema) and predictable agent behavior through extensive testing and characterization. She distinguishes between protocols like MCP (agent-to-tool) and A2A (agent-to-agent), advocating for open standards and underlying transport layers akin to TCP. Chapters:00:00 Introduction01:00 Overview of Agent Interoperability02:20 What is AGNTCY03:45 Agent Discovery and Composition04:38 Agent Protocols and Communication05:45 Observability and Evaluation07:00 Metrics and Standards for Agents09:45 Challenges in Agent Evaluation14:15 Low Latency and Active Evaluation23:34 Synthetic Data and Ground Truth25:07 Interoperable Agent Schema26:37 MCP & A2A30:17 Future of Agent Communication32:03 Security and Agent Identity34:37 Collaboration and Community Involvement38:28 Conclusion Follow the hostsFollow AtinFollow ConorFollow VikramFollow YashFollow Today's Guest(s)AGNTCY Collective: agntcy.orgConnect with Giovanna on LinkedInLearn more about Outshift: outshift.cisco.comCheck out GalileoTry GalileoAgent Leaderboard
--------
41:02
--------
41:02
Taste Is The New Moat | Intangible CEO on Brand, Distribution, and Winning in AI
When AI makes creating content and code nearly free, how do you stand out? Differentiation now hinges on two things: unique taste and effective distribution.This week, Bharat Vasan, founder & CEO at Intangible and a "recovering VC," explains why the AI landscape compelled him to return to founding. He sees AI sparking a new creative revolution, similar to the early internet, that makes it easier than ever to bring ideas to life. The conversation delivers essential advice for founders, revealing why relentless shipping is the ultimate clarifier for a business and why resilience, not just intelligence, is the key to survival.Drawing from his experience on both sides of the venture table, Bharat breaks down the brutally competitive VC landscape and shares Intangible's mission: to simplify 3D creative tools with AI, finally bridging the gap between human vision and machine power. Listeners will gain insights on company building, brand strategy, and why customer obsession is the ultimate moat in the AI age.Chapters:00:00 Introduction 00:45 From Founder to VC and Back03:17 Human Creativity in the Age of AI07:50 The Role of Taste and Distribution11:49 Building a Brand in the AI Era16:17 The Venture Capital Landscape for AI Startups20:11 Advice for Founders in the AI Boom23:55 Incumbents vs. Startups27:10 The New Generation of Innovators29:19 Pirate Mentality in Startups30:00 Building a Brand36:28 Shipping and Resilience41:49 Customer Obsession46:58 The Vision for Intangible51:52 ConclusionFollow the hostsFollow AtinFollow ConorFollow VikramFollow YashFollow Today's Guest(s)Connect with Bharat on LinkedIn.Follow Bharat on X.Learn more about Intangible at intangible.ai.Check out GalileoTry GalileoAgent Leaderboard
--------
53:01
--------
53:01
The Emerging AI Agent Stack | CrewAI’s João Moura
Unlocking AI agents for knowledge work automation and scaling intelligent, multi-agent systems within enterprises fundamentally requires measurability, reliability, and trust.João Moura, founder & CEO of CrewAI, joins Galileo’s Conor Bronsdon and Vikram Chatterji to unpack and define the emerging AI agent stack. They explore how enterprises are moving beyond initial curiosity to tackle critical questions around provisioning, authentication, and measurement for hundreds or thousands of agents in production. The discussion highlights a crucial "gold rush" among middleware providers, all racing to standardize the orchestration and frameworks needed for seamless agent deployment and interoperability. This new era demands a re-evaluation of everything from cloud choices to communication protocols as agents reshape the market.João and Vikram then dive into the complexities of building for non-deterministic multi-agent systems, emphasizing the challenges of increased failure modes and the need for rigorous testing beyond traditional software. They detail how CrewAI is democratizing agent access with a focus on orchestration, while Galileo provides the essential reliability platform, offering advanced evaluation, observability, and automated feedback loops. From specific use cases in financial services to the re-emergence of core data science principles, discover how companies are building trustworthy, high-quality AI products and prepare for the coming agent marketplace. Chapters:00:00 Introduction and Guest Welcome02:04 Defining the AI Agent Stack03:49 Challenges in Building AI Agents05:52 The Future of AI Agent Marketplaces06:59 Infrastructure and Protocols09:05 Interoperability and Flexibility20:18 Governance and Security Concerns24:12 Industry Adoption and Use Cases25:57 Unlocking Faster Development with Success Metrics28:40 Challenges in Managing Complex Systems30:10 Introducing the Insights Engine30:33 The Importance of Observability and Control32:33 Democratizing Access with No-Code Tools35:39 Ensuring Quality and Reliability in Production41:08 Future of Agentic Systems and Industry TransformationFollow the hostsFollow AtinFollow ConorFollow VikramFollow YashFollow Today's Guest(s)Joao Moura: LinkedIn | X/TwitterCrewAI: crewai.com | X/Twitter Check out GalileoTry GalileoAgent Leaderboard
--------
49:53
--------
49:53
AMD's Vision for an Open Ecosystem | Anush Elangovan & Sharon Zhou
How is an open ecosystem powering the next generation of AI for developers and leaders?Broadcasting live from the heart of the action at AMD's Advancing AI 2025, Chain of Thought host Conor Bronsdon welcomes AMD’s Anush Elangovan, VP of AI Software, and Sharon Zhou, VP of AI. They unpack AMD's groundbreaking transformation from a hardware giant to a leader in full-stack AI, committed to an open ecosystem. Discover how new MI350 GPUs deliver mind-blowing performance with advanced data types and why ROCm 7 and AMD Developer Cloud offer Day Zero support for frontier models.Then Conor welcomes Sharon Zhou, VP of AI at AMD, to discuss making AMD's powerful software stack truly accessible and how to drive developer curiosity. Sharon explains strategies for creating a "happy path" for community contributions, fostering engagement through teaching, and listening to developers at every stage. She shares her predictions for the future, including the rise of self-improving AI, the critical role of heterogeneous compute, and the potential of "vibes based feedback" to guide models. This vision for democratizing access to high-performance AI, driven by a deep understanding of the developer journey, promises to unlock the next generation of applications.Chapters:00:00 Live from AMD's Advancing AI 2025 Event00:30 Introduction to Anush Elangovan01:38 The MI350 GPU Series Unveiled04:57 CDNA4 Architecture Explained07:00 The Future of AI Infrastructure08:32 AMD's Developer Cloud and ROCm 711:50 Cultural Shift at AMD14:48 Open Source and Community Contributions18:35 Software Longevity and Ecosystem Strategy22:19 AI Agents and Performance Gains27:36 AI's Role in Solving Power Challenges28:11 Thanking Anush28:42 Introduction to Sharon Zhou29:45 Sharon's Focus at AMD30:39 Engaging Developers with AMD's AI Tools31:24 Listening to the AI Community33:56 Open Source and AI Development45:04 Future of AI and Self-Improving Models48:04 Final Thoughts and FarewellFollow the hostsFollow AtinFollow ConorFollow VikramFollow YashFollow Today's Guest(s)Anush Elangovan: LinkedInSharon Zhou: LinkedInAMD Official Site: amd.comAMD Developer Resources: AMD Developer CentralCheck out GalileoTry GalileoAgent Leaderboard
Introducing Chain of Thought, the podcast for software engineers and leaders that demystifies artificial intelligence.
Join us each week as we tell the stories of the people building the AI revolution, unravel actionable strategies and share practical techniques for building effective GenerativeAI applications.