Found in 2 comments on Hacker News
dtoma · 2024-01-24 · Original thread
The "streaming systems" book answers your question and more: https://www.oreilly.com/library/view/streaming-systems/97814.... It gives you a history of how batch processing started with MapReduce, and how attempts at scaling by moving towards streaming systems gave us all the subsequent frameworks (Spark, Beam, etc.).

As for the framework called MapReduce, it isn't used much, but its descendant https://beam.apache.org very much is. Nowadays people often use "map reduce" as a shorthand for whatever batch processing system they're building on top of.

This is a very good book! Recently, I am very happy with a number of more foundational data books from O’Reilly:

Designing Data-Intensive Applications https://dataintensive.net/

Streaming Systems http://shop.oreilly.com/product/0636920073994.do

and this one.