OLake by Datazip invites you to their event

Best practices for migrating to Apache Iceberg

Thursday, November 21st 2024 - 3:00 PM (GMT)

The event is over. See you in the next one.

About this event

[Key highlights]

  • Diving into File formats, compression strategies, and write patterns 
  • Practical guidance on Merge on Read (MoR) vs Copy on Write (CoW) implementation
  • Essential configurations for maintenance and monitoring 
  • Benchmarks [duration & cost] compared with Amazon EMR Trino, Snowflake, Snowflake Iceberg, Starburst, Athena

[Further learn] 

  • How to select tables for migration and assess critical queries
  • Optimal compaction strategies (BinPack, Sort, Z-order)
  • Key configurations for production deployment
  • Monitoring best practices using Iceberg virtual tables

Hosted by

  • Team member
    T
    Harsha Kalbalia GTM @ Datazip | Founding Member @ Datazip

    Harsha is a user-first GTM specialist at Datazip, transforming early-stage startups from zero to one. With a knack for technical market strategy and a startup enthusiast's mindset, she bridges the gap between innovative solutions and meaningful market adoption.

  • Guest speaker
    G
    Amit Gilad Data Engineer

    Amit Gilad, a Data Engineer who's been actively working with Apache Iceberg and data lakes. Currently leading data engineering in stealth, he previously worked as a data engineer at Cloudinary. He has hands-on experience with EMR, Athena, and Spark, and recently shared insights about Iceberg implementations without Spark at the Chill Data Summit.

  • Guest speaker
    G
    Yonatan Dolan Principal Analytics Specialist @ AWS

    Yonatan Dolan, a Principal Analytics Specialist at AWS, focusing on Big Data & Analytics in Israel. He's an Apache Iceberg evangelist and actively drives data lake innovations. Before AWS, he led Intel's Pharma Analytics Platform, developing edge-to-cloud AI solutions for clinical trials, and spent 9 years driving advanced analytics projects at Intel.

OLake by Datazip

Composable Lakehouse Platform for 10X Data Engineering Productivity

OLake is built as the fastest replication of Databases into Data Lakehouse (currently, Apache Iceberg)