DataEngConf: Apache Kafka at Rocana: a scalable, distributed log for machine data

Apache Kafka at Rocana Persistent Machine Data Collection at Scale

© 2015 Rocana, Inc. All Rights Reserved. History 6 • Designed at LinkedIn • Documented in a 2013 blog post by Jay Kreps • LinkedIn moved from a monolith to multiple data stores and services https://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying

© 2015 Rocana, Inc. All Rights Reserved. Design Goals 11 • A centralized data bus that: • Scales horizontally • Delivers (some) events in order • Decouples producers and consumers • Has low latency end-to-end

© 2015 Rocana, Inc. All Rights Reserved. Low-Latency, Durable Writes 14 • Kafka writes all events to disk • Events are stored on disk in the wire protocol • Zero-copy reads and writes avoid events ever entering user space • Kafka relies on the page cache for low-latency serving of recent events

© 2015 Rocana, Inc. All Rights Reserved.18 Resource Constraints • Customer machines are doing real work • Agent footprint must be small • Can’t depend on availability of back- end services • Batching is crucial

© 2015 Rocana, Inc. All Rights Reserved. 23 { “syslog_arrival_ts":"1444489076463", "syslog_conn_dns":"localhost", “syslog_conn_port":"57788", “body”: …, “id”:”KLE5GZF7WB2WSA5…”, … } { “tailed_file_inode”:”2371810", “tailed_file_offset”:"384930", “timestamp":"", “body”: …, “id”:”73XXMLRJNHKA76…”, … } Ephemeral Source Durable Source

© 2015 Rocana, Inc. All Rights Reserved. Unclean Elections 25 • Kafka maintains a set of up-to-date replicas in ZK • “In-sync replicas” or the ISR • ISR can dynamically grow or shrink • by default Kafka will accept writes with a single ISR • It is possible for the set to shrink to 0 nodes, which either leads to: • partition unavailability until an in-sync replica returns to life • OR data loss when an out-of-sync node begins accepting writes • This is tunable with the “unclean leader election” property • Defaults to true in 0.8.2 http://blog.empathybox.com/post/62279088548/a-few-notes-on-kafka-and-jepsen

© 2015 Rocana, Inc. All Rights Reserved. Schema Versioning 27 http://www.confluent.io/blog/schema-registry-kafka-stream-processing-yes-virginia-you-really-need-one • Schemas are absolutely necessary • Have a plan for how to evolve the schema before v1 • A schema registry is a good investment

© 2015 Rocana, Inc. All Rights Reserved. Replication 29 • Cross-DC clusters are not recommended • Kafka includes MirrorMaker for replication between two clusters • Replication is asynchronous • Offsets aren’t consistent

© 2015 Rocana, Inc. All Rights Reserved. Sizing 31 • Consider both throughput and retention time • Overprovision number of partitions • Rebalancing is easy, but re-sharding breaks consistent hashing http://www.confluent.io/blog/how-to-choose-the-number-of-topicspartitions-in-a-kafka-cluster/

© 2015 Rocana, Inc. All Rights Reserved. Performance 32 • Jay Kreps ran an on-premises benchmark • 18 spindles, 18 cores in 3 boxes could produce 2.5M events/sec • Aggressive batching is necessary • Synchronous ACKs halve throughput https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines

© 2015 Rocana, Inc. All Rights Reserved. Performance 33 • Reproduced on AWS with 3 and 5 node clusters • d2.xlarge nodes have 3 spindles, 4 cores, 30.5GB RAM • 5 producers on m3.xlarge instances • 3 nodes accepted 2.6M events/s • 24 partitions, one replica, one ACK • dropped to 1.7M with 3x replication and 1 ack • 5 nodes accepted 3.6M events/s • 48 partitions, one replica, one ACK • dropped to 2.16M with 3x replication and 1 ack

DataEngConf: Apache Kafka at Rocana: a scalable, distributed log for machine data

More Related Content

What's hot

Viewers also liked

Similar to DataEngConf: Apache Kafka at Rocana: a scalable, distributed log for machine data

More from Hakka Labs

Recently uploaded

DataEngConf: Apache Kafka at Rocana: a scalable, distributed log for machine data

Editor's Notes