Reading Assignment 10: Distributed Data Processing
Mandatory papers to read:
- S. Ghemawat, H. Gobioff, S. Leung, The Google File System, SOPS 2003
- J. Dean and S. Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, OSDI 2004
- M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M.J. Franklin, S. Shenker, I. Stoica, Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-MemoryCluster Computing,NSDI 2012
Read all of the papers listed above and select two of the papers to write a report for.
Due date for your report: November 20, 2025. No deadline extensions are possible. Bring your printed report to the lecture.
Detailed instructions can be found here.