PodcastTecnologiaGenerative AI 101

Generative AI 101

Emily Laird
Generative AI 101
Ultimo episodio

253 episodi

  • Generative AI 101

    Terminal-Bench 2.0 & the Fight for Real Autonomy

    19/02/2026 | 0 min
    In this episode of Generative AI 101, host Emily Laird drags AI agents out of their cozy demo theaters and drops them into the command line arena, where pretty prose means nothing and only passing tests keep you alive. We break down Terminal-Bench 2.0, the 89-task obstacle course that exposes whether frontier models can actually compile code, patch vulnerabilities, and survive containerized environments without hallucinating their way into a crater. With scores under 65 percent for top systems, this is less victory lap and more reality check, a sharp look at the gap between sounding smart and finishing the job. If you have ever wondered whether AI autonomy is Iron Man or just a very confident intern with sudo access, this one is for you.

    Join the AI Weekly Meetups

    Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about the Terminal Bench 2.0 benchmark.

    Connect with Emily Laird on LinkedIn
  • Generative AI 101

    OpenClaw & the Delegation Dilemma

    17/02/2026 | 10 min
    In this episode of Generative AI 101, host Emily Laird examines OpenClaw, the open source AI assistant that jumped from polite chatbot to full blown operator with access to your apps, files, and digital identity. Drawing on reporting from Reuters and security warnings from Cisco and The Verge, she unpacks how OpenClaw’s rise, 100,000 GitHub stars and millions of visitors, signals a shift from chat to action, from suggestions to delegation. But with malicious skills, prompt injection risks, and policy alarms ringing, this is less Iron Man’s Jarvis and more a very confident intern with your passwords. If you have ever wondered what happens when convenience gets admin rights, this episode is your cautionary tale with a WiFi connection.

    Join the AI Weekly Meetups

    Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about OpenClaw.

    Connect with Emily Laird on LinkedIn
  • Generative AI 101

    BrowseComp vs The Bots that Bluff

    17/02/2026 | 10 min
    Can AI actually read the internet, or is it just faking it with confidence? In this high-voltage episode, host Emily Laird cracks open BrowseComp, OpenAI’s benchmark built to test whether web-browsing agents can find facts that are hard to uncover but easy to verify. Humans had two hours per question and still bailed most of the time, so what does it mean when a model claims victory? From compute budgets and canary strings to the rise of multimodal chaos, Emily exposes the difference between sounding right and being right, and why in an era of polished, source-backed answers, persistence beats plausible every time.

    Join the AI Weekly Meetups

    Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about the BrowseComp benchmark.

    Connect with Emily Laird on LinkedIn
  • Generative AI 101

    GDPval-AA & the AI Hunger Games for Your Job

    16/02/2026 | 9 min
    Is AI just good at trivia, or can it actually take your job? In this episode, host Emily Laird breaks down GDPval-AA, the benchmark pitting models against humans across 1,320 real world tasks, scored like chess and judged blind. With top models working faster and cheaper than any employee, this is less sci-fi and more spreadsheet reality. If you’ve ever wondered whether the robots are coming for your role, this is your warning shot.

    Join the AI Weekly Meetups

    Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about the GDPval-AA benchmark.

    Connect with Emily Laird on LinkedIn
  • Generative AI 101

    Claude Opus 4.6

    12/02/2026 | 11 min
    Host Emily Laird cracks open Claude Opus 4.6, Anthropic’s Feb 5, 2026 release that feels less like a chatbot and more like a full-time coworker who never blinks. This episode breaks down what “agentic” really means, why a million-token memory is basically an elephant with a spreadsheet addiction, and how “effort levels” let you pick between quick replies or deep, careful reasoning. You’ll also hear how Claude can spawn agent teams inside Claude Cowork (think The Bear, but with fewer knives and more revenue forecasts), plus the benchmarks that back up the hype across finance, law, terminal tasks, research hunts, and brutal exams. Emily closes with the spicy stuff, alignment, red-teaming, and the uneasy thrill of realizing your “assistant” might start running the meeting.

    Join the AI Weekly Meetups

    Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about Anthropic's Claude Opus 4.6.

    Connect with Emily Laird on LinkedIn

Altri podcast di Tecnologia

Su Generative AI 101

Welcome to Generative AI 101, your go-to podcast for learning the basics of generative artificial intelligence in easy-to-understand, bite-sized episodes. Join host Emily Laird, AI Integration Technologist and AI lecturer, to explore key concepts, applications, and ethical considerations, making AI accessible for everyone.
Sito web del podcast

Ascolta Generative AI 101, StoryTech e molti altri podcast da tutto il mondo con l’applicazione di radio.it

Scarica l'app gratuita radio.it

  • Salva le radio e i podcast favoriti
  • Streaming via Wi-Fi o Bluetooth
  • Supporta Carplay & Android Auto
  • Molte altre funzioni dell'app

Generative AI 101: Podcast correlati