KAFKA SUMMIT APAC

July 27 - 28, 2021

Building More Reliable Data Pipelines for Nearmap's Deep Learning Models: An Evolutionary Case Study

Date : July 28, 2021

Time : 02:00 PM - 02:45 PM

Continual learning using a continually evolving dataset is the norm for the AI team at Nearmap. We have had a software system & data pipelines to facilitate the management of this ever-growing dataset in place for several years of operation. During that time, both our needs & the system have evolved – we improvised and learned from early limitations & challenges. One of the biggest challenges of MLOps is building data systems right! Reliable, Fault-tolerant, & continually flowing pipelines are the foundation, with necessary additional capabilities for data quality control, reconciliations, & lineage/tracking. Based on our learnings, we have rebuilt a new generation of our system (based on Kafka) with one aim – the much discussed ""operation vacation"". The aim is to facilitate full automation and zero manual intervention of the system. In this session, we will go into details of the challenges we encountered, the lessons we learned, what we improved, and lastly; are we on vacation yet?

Speakers

Suneeta Mall

Principal Machine Learning Engineer, Nearmap

Samanvay Karambhe

Data Scientist, Nearmap

Privacy Policy | Terms & Conditions,
Apache, Apache Kafka, Kafka, Apache Flink, Flink and associated open source project names are trademarks of the Apache Software Foundation.
The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event
Copyright © Confluent, Inc. 2016 - 2024

#kafkasummit