Technical Talks

Sumedh Sakdeo
Sumedh Sakdeo
Senior Staff Software Engineer | LinkedIn
Jesus Camacho
Jesus Camacho
Principal Engineering Manager | Microsoft

Optimizing Iceberg Table Layouts at Scale: A Multi-Objective Approach

video
Missing value detected...
Video will be populated after the conference

ABOUT THE TALK
  • Data Eng & Infrastructure

Optimizing Iceberg Tables: Advanced Data Layout Strategies for Enterprise Data Lakes | Master data layout optimization techniques for managing large-scale Iceberg deployments with 100K+ tables. Learn comprehensive approaches to multi-objective optimization, balancing storage efficiency with query performance through intelligent file management and compaction strategies. This session covers practical implementation of table scoring algorithms, automated optimization workflows, and real-world performance insights from OpenHouse deployment. Includes detailed case studies and benchmarks using LST-bench, demonstrating measurable improvements in query performance and storage efficiency.

Sumedh Sakdeo

Senior Staff Software Engineer

Sumedh Sakdeo

LinkedIn

Sumedh Sakdeo is a Senior Staff Software Engineer at LinkedIn. He has 15+ years of experience working in data infrastructure. He previously contributed to a log-structured file system at Tintri and led data infrastructure, engineering, and visualization for Lyft's Autonomous Vehicles division. Notably, at LinkedIn, he spearheaded the development of OpenHouse, an open source control plane for Apache Iceberg Tables, enabling efficient table maintenance and data layout optimization at scale.

Jesus Camacho

Principal Engineering Manager

Jesus Camacho

Microsoft

Jesús Camacho Rodríguez is a Principal Engineering Manager at Microsoft, leading a team focused on query optimization and processing for Fabric Data Warehouse. Previously, he led a team in the Gray Systems Lab (GSL), driving innovation in data systems performance, interoperability, and benchmarking across Azure Data's product portfolio. Before Microsoft, he worked at Cloudera on query optimization for SQL data warehouse engines. A member of the Apache Software Foundation, he has made significant contributions to open-source projects like Apache Calcite and Apache Hive, and, more recently, has focused on projects in the table formats space, including LST-Bench and Apache XTable (incubating). He holds a PhD from the University of Paris-Sud and Inria, France.