Épisodes

  • Mixed Attention & LLM Context | Data Brew | Episode 35
    Nov 21 2024

    In this episode, Shashank Rajput, Research Scientist at Mosaic and Databricks, explores innovative approaches in large language models (LLMs), with a focus on Retrieval Augmented Generation (RAG) and its impact on improving efficiency and reducing operational costs.

    Highlights include:
    - How RAG enhances LLM accuracy by incorporating relevant external documents.
    - The evolution of attention mechanisms, including mixed attention strategies.
    - Practical applications of Mamba architectures and their trade-offs with traditional transformers.

    Voir plus Voir moins
    39 min
  • Kumo AI & Relational Deep Learning | Data Brew | Episode 34
    Oct 14 2024

    In this episode, Jure Leskovec, Co-founder of Kumo AI and Professor of Computer Science at Stanford University, discusses Relational Deep Learning (RDL) and its role in automating feature engineering.

    Highlights include:
    - How RDL enhances predictive modeling.
    - Applications in fraud detection and recommendation systems.
    - The use of graph neural networks to simplify complex data structures.

    Voir plus Voir moins
    43 min
  • LLMs: Internals, Hallucinations, and Applications | Data Brew | Episode 33
    Jul 21 2023

    Our fifth season dives into large language models (LLMs), from understanding the internals to the risks of using them and everything in between. While we're at it, we'll be enjoying our morning brew.

    In this session, we interviewed Chengyin Eng (Senior Data Scientist, Databricks), Sam Raymond (Senior Data Scientist, Databricks), and Joseph Bradley (Lead Production Specialist - ML, Databricks) on the best practices around LLM use cases, prompt engineering, and how to adapt MLOps for LLMs (i.e., LLMOps).

    Voir plus Voir moins
    39 min
  • Demonstrate–Search–Predict Framework | Data Brew | Episode 32
    Jun 29 2023

    We will dive into LLMs for our fifth season, from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew.

    In this session, we interviewed Omar Khattab - Computer Science Ph.D. Student at Stanford, creator of DSP (Demonstrate–Search–Predict Framework), to discuss DSP, common applications, and the future of NLP.

    Voir plus Voir moins
    33 min
  • Generative AI Risks | Data Brew | Episode 31
    Jun 8 2023

    We will dive into LLMs for our fifth season, from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew.

    In this session, we interviewed Yaron Singer, CEO of Robust Intelligence, Professor of Computer Science at Harvard University, and guest of Data Brew Season 3 (our first repeat guest!). In this session, we discuss generative AI, the trends toward embracing LLMs, and how the surface area for vulnerabilities in generative AI is much bigger.

    Voir plus Voir moins
    35 min
  • John Snow Labs & SparkNLP | Data Brew | Episode 30
    Jun 1 2023

    We are back and we will dive into LLMs from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew.

    In this session, we interviewed David Talby who is the CTO at John Snow Labs; they help healthcare & life science companies put AI to good use. David's interests include natural language processing, applied artificial intelligence in healthcare, and responsible AI.

    Voir plus Voir moins
    43 min
  • Data Brew Season 4 Episode 6: Professional Athletes
    Jun 9 2022

    For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew.

    Shayna Powless and Eli Ankou, professional cyclist for L39ion of Los Angeles and defensive tackle for the Buffalo Bills, respectively, provide valuable insight on how professional athletes leverage data to improve their performance and how they combine their passion for sports with the Dreamcatcher Foundation.

    See more at databricks.com/data-brew

    Voir plus Voir moins
    36 min
  • Data Brew Season 4 Episode 5: Public Health: Education, Access, and Policy
    May 5 2022

    For our fourth season, we focus on connected health and how data & AI augment and improve our daily health. While we’re at it, we’ll be enjoying our morning brew.

    Matt Willis, Marin County Public Health Officer, shares the three pillars of public health: education, access, and policy, and the critical role data plays in addressing the COVID-19 pandemic & opioid epidemic.

    See more at databricks.com/data-brew

    Voir plus Voir moins
    35 min