Talk details

AI & ML Engineering

16h - 16h30

Salle Adenauer - 500 personnes

From Product Images to Structured Data: VLMs at Marketplace Scale

GPU budgets do not have to scale with the number of images they process. At Mirakl, we’ve built a cloud-native inference stack for our Catalog Transformer that processes product images at scale and extracts structured facts for downstream use cases such as image ordering and background removal. Catalogs with thousands of products are preprocessed with Apache Spark, then served through vision language models on KServe with a vLLM backend, optimized with fine-tuned LoRAs, and amortized in cost with caching. We will unpack the core building blocks we chose and the trade offs we met in production, as a blueprint other teams can reuse. We will close with two operational pillars for scale: parallelizing and regulating traffic with event-driven queues, and the introduction of an AI gateway on our roadmap.

Speaker(s)

Jeanine Harb

Senior Data Engineer @Mirakl

see all talks

You want to be part of it this year? It's the 24th of november :

Buy a ticket Become a sponsor

Talk details

AI & ML Engineering

From Product Images to Structured Data: VLMs at Marketplace Scale

Speaker(s)

Jeanine Harb

You want to be part of it this year? It's the 24th of november :

bronze partner

silver partner

gold partner