InfoQ Homepage Machine Learning Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

AI for Food Image Generation in Production: How & Why

Iaroslav Amerkhanov discusses how his team at Delivery Hero leveraged GenAI to generate food images, detailing the architecture, optimization, and business impact.

Iaroslav Amerkhanov
on Aug 27, 2025

Icon

44:46
AI, ML & Data Engineering

10 Reasons Your Multi-Agent Workflows Fail and What You Can Do about It

Victor Dibia discusses multi-agent systems, detailing how to build them with AutoGen, common failure points, and strategic approaches for senior software developers and engineering leaders.

Victor Dibia
on Aug 14, 2025

Icon

48:24
AI, ML & Data Engineering

Maximizing Deep Learning Performance on CPUs using Modern Architectures

Bibek Bhattarai demystifies Intel AMX, explaining how this CPU architecture accelerates deep learning workloads via low-precision matrix multiplication and efficient data handling.

Bibek Bhattarai
on Jul 31, 2025

Icon

39:25
AI, ML & Data Engineering

A Framework for Building Micro Metrics for LLM System Evaluation

Denys Linkov discusses critical lessons for senior developers and leaders on building robust LLM systems and actionable metrics that prevent production issues and drive business value.

Denys Linkov
on Jul 01, 2025

Icon

29:10
AI, ML & Data Engineering

Supporting Diverse ML Systems at Netflix

David Berg and Romain Cledat discuss Metaflow, Netflix's ML infrastructure for diverse use cases from computer vision to recommendations.

David Berg Romain Cledat
on Jun 17, 2025

Icon

49:00
AI, ML & Data Engineering

From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models

Sebastiano Galazzo shares practical tips and mistakes in creating custom LLMs for cost-effective AI. Learn LoRA, merging, MoE & optimization.

Sebastiano Galazzo
on Apr 23, 2025

Icon

48:19
AI, ML & Data Engineering

How Green is Green: LLMs to Understand Climate Disclosure at Scale

Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.

Leo Browning
on Apr 22, 2025

Icon

47:29
AI, ML & Data Engineering

LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries

Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.

Stefania Chaplin Azhir Mahmood
on Apr 17, 2025

Icon

43:50
AI, ML & Data Engineering

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.

Anil Rajput Rema Hariharan
on Apr 07, 2025

Icon

48:11
AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.

Meryem Arik
on Mar 28, 2025

Icon

39:49
Architecture & Design

The Harsh Reality of Building a Real-Time ML Feature Platform

Ivan Burmistrov shares how ShareChat built their own Real-Time Feature Platform serving more than 1 billion features per second, and how they managed to make it cost efficient.

Ivan Burmistrov
on Mar 20, 2025

Icon

47:16
AI, ML & Data Engineering

Recommender and Search Ranking Systems in Large Scale Real World Applications

Moumita Bhattacharya overviews the industry search and recommendations systems, goes into modeling choices, data requirements and infrastructural requirements, while highlighting challenges.

Moumita Bhattacharya
on Mar 17, 2025

Icon

48:46

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations