• UnfoldAI — Building production-ready AI systems

  • Auteur(s): simeon.emanuilov
  • Podcast

UnfoldAI — Building production-ready AI systems

Auteur(s): simeon.emanuilov
  • Résumé

  • Welcome to the UnfoldAI podcast. I'm Simeon Emanuilov, a Senior Software Engineer focused on Python, Deep Learning and System design. This show explores how to build AI systems that address real-world challenges. We discuss everything from developing your initial AI product to ensuring its effectiveness in practical environments. Each episode offers concrete tips, thought-provoking discussions, and expert perspectives. Our content caters to both newcomers and experienced professionals, breaking down complex ideas into accessible, actionable information.
    simeon.emanuilov
    Voir plus Voir moins
activate_Holiday_promo_in_buybox_DT_T2
Épisodes
  • GPU memory management for Large Language Models
    Sep 30 2024

    Join us as we dive deep into the fascinating world of large language models and the intricate dance of GPU memory management that powers them.

    In this episode, we break down the complexities of running these massive AI models, exploring everything from model parameters and KV caches to cutting-edge optimization techniques like PagedAttention and vLLM.

    We'll unpack why efficient memory usage matters for everyday users, developers, and researchers alike. Using relatable analogies, we'll explain concepts like beam search, quantization, and the delicate balance between performance and memory constraints. Whether you're a tech enthusiast or an AI developer, this episode offers valuable insights into the challenges and innovations shaping the future of AI language models.

    Tune in to learn about the creative solutions tackling memory limitations and making advanced AI more accessible. We'll discuss real-world implications, provide practical examples, and offer a glimpse into the exciting developments on the horizon. Don't miss this informative and engaging exploration of the memory management techniques powering the AI revolution!

    Read the article: https://unfoldai.com/gpu-memory-requirements-for-llms/

    Voir plus Voir moins
    16 min
  • ColPali — Seeing beyond words in document search
    Sep 29 2024

    In this episode of UnfoldAI, we dive deep into ColPali, a groundbreaking AI system that's transforming how we search and understand documents. We explore how ColPali combines advanced language processing with visual comprehension to decode not just text, but charts, diagrams, and document layouts.

    Learn about the innovative "late interaction" technique that allows ColPali to make connections between text and visuals in real-time, and discover how multi-vector embeddings enable lightning-fast, context-aware search across vast document collections. We discuss ColPali's performance on the ViDoRe benchmark and its potential to revolutionize fields like academic research and healthcare.


    Full article: https://unfoldai.com/colpali/

    Voir plus Voir moins
    9 min
  • FastAPI's secret weapon — Unleashing the power of background tasks
    Sep 29 2024

    In this episode of UnfoldAI, we dive into the world of responsive web applications with FastAPI's background tasks. We break down complex concepts like asynchronous processing and event loops into easy-to-understand analogies, making them accessible to developers of all levels.

    You'll discover how background tasks can dramatically improve user experience by handling time-consuming operations without freezing up your app. We explore real-world examples, from processing large files to sending notifications, and discuss advanced techniques like task chaining and connection pooling.

    Whether you're building your first API or optimizing existing ones, this episode offers practical insights into creating lightning-fast, efficient web applications. Join us as we unpack FastAPI's powerful features and learn how to take your web development skills to the next level.

    Full article is here: https://unfoldai.com/fastapi-background-tasks/

    Voir plus Voir moins
    8 min

Ce que les auditeurs disent de UnfoldAI — Building production-ready AI systems

Moyenne des évaluations de clients

Évaluations – Cliquez sur les onglets pour changer la source des évaluations.