Apache Kafka, Samza, and the Unix Philosophy of Distributed Data

沐沐2024年11月26日约 246 字小于 1 分钟

Apache Kafka, Samza, and the Unix Philosophy of Distributed Data

注意

如果pdf无法加载，请使用 Google Chrome 浏览器重试。

Apache Kafka, Samza, and the Unix Philosophy of Distributed Data

One of the things I realised while doing research for my book is that contemporary software engineering still has a lot to learn from the 1970s.
As we’re in such a fast-moving field, we often have a tendency of dismissing older ideas as irrelevant – and consequently, we end up having to learn the same lessons over and over again, the hard way. Although computers have got faster, data has got bigger and requirements have become more complex, many old ideas are actually still highly relevant today.
In this blog post I’d like to highlight one particular set of old ideas that I think deserves more attention today: the Unix philosophy.
I’ll show how this philosophy is very different from the design approach of mainstream databases,
and explore what it would look like if modern distributed data systems learnt a thing or two from Unix.

open it, Apache Kafka, Samza, and the Unix Philosophy of Distributed Data (2015).