• EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

  • Jul 17 2024
  • Durée: 10 min
  • Podcast

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

  • Résumé

  • In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

    Voir plus Voir moins

Ce que les auditeurs disent de EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Moyenne des évaluations de clients

Évaluations – Cliquez sur les onglets pour changer la source des évaluations.