• EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

  • Jul 17 2024
  • Length: 10 mins
  • Podcast

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

  • Summary

  • In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

    Show More Show Less
activate_samplebutton_t1

What listeners say about EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.