Data Council Blog
Pushing Kafka to the Limit at Heroku
How Everyone's Favorite PaaS Operates Kafka at Scale
Because of the scope of Heroku's challenges, Software Engineer of Data Infrastructure, Jeff Chao, experienced a multitude of fascinating failure scenarios with Kafka that most of us would likely never see in our typical implementations. For example, Jeff discovered that there are a variety of situations where brokers can enter into cascading failure and eventually render clusters completely unavailable. He learned that the key to preventing these failures is to ensure that when one broker fails remaining brokers are resilient enough to be able to take on additional partitions from the downed broker. Although this might seem somewhat obvious in theory, Jeff discovered that in practice there are many details that can easily be overlooked which he will cover in depth in his talk.
Meet Jeff Chao of Heroku
Jeff Chao is a software engineer at Heroku where he is a member of the Department of Data. Prior to Heroku, he worked on streaming data processing systems where he built event processing engines, data ingestion pipelines, and pubsub services. Jeff's current work involves delivering and maintaining Apache Kafka as a service.
To up your data pipeline game, and learn how Jeff and the data team pushed the limits of Kafka at Heroku, check out the full talk at DataEngConf SF '17.
Subscribe to Email Updates
Receive relevant content, news and event updates from our community directly into your inbox. Enter your email (we promise no spam!) and tell us below which region(s) you are interested in:
Fresh Posts
Categories
- Analytics (15)
- Apache Arrow (3)
- Artificial Intelligence (7)
- Audio Research (1)
- big data (7)
- BigQuery (2)
- Careers (2)
- Data Discovery (2)
- data engineer salary (1)
- Data Engineering (46)
- Data Infrastructure (2)
- Data Lakes (1)
- Data Pipelines (6)
- Data Science (33)
- Data Strategy (14)
- Data Visualization (6)
- Data Warehouse (10)
- Data Warehousing (2)
- Databases (4)
- datacoral (1)
- disaster management (1)
- Event Updates (12)
- functional programming (1)
- Learning (1)
- Machine Learning (18)
- memsql (1)
- nosql (1)
- Open Source (21)
- ops (1)
- postgresql (1)
- Redshift (1)
- sharding (1)
- Snowflake (1)
- Speaker Spotlight (5)
- SQL (2)
- Startups (12)