Streaming Systems cover
Streaming Systems
by Tyler Akidau, Slava Chernyak, Reuven Lax
ISBN: 9781491983867
Found in 2 comments on Hacker News
We may earn a commission from purchases made through links on this page.
Not ready yet? Get weekly book picks.
dtoma · 2024-01-24 · Original thread
The "streaming systems" book answers your question and more: https://www.oreilly.com/library/view/streaming-systems/97814.... It gives you a history of how batch processing started with MapReduce, and how attempts at scaling by moving towards streaming systems gave us all the subsequent frameworks (Spark, Beam, etc.).

As for the framework called MapReduce, it isn't used much, but its descendant https://beam.apache.org very much is. Nowadays people often use "map reduce" as a shorthand for whatever batch processing system they're building on top of.

This is a very good book! Recently, I am very happy with a number of more foundational data books from O’Reilly:

Designing Data-Intensive Applications https://dataintensive.net/

Streaming Systems http://shop.oreilly.com/product/0636920073994.do

and this one.