A Reading List for introduction to Cloud Computing technologies
If you are looking into cloud infrastructures, and a basic understanding of how existing cloud computing frameworks work and the functionality they provide, I would suggest going through Salman. A. Baset’s tutorial from SoCC 2012.
Conferences:
- Top Tier Distributed Systems Conferences: ICDCS, SoCC
- Related Systems Conferences: OSDI, SOSP, NSDI, Usenix ATC
Section 1: Introduction and Overview
- Above the Clouds: A Berkeley View of Cloud Computing: M. Armbrust, A. Fox, R. Griffith, A.D. Joseph, R.H. Katz, A. Konwinski, G. Lee, D.A. Patterson, A. Rabkin, I. Stoica, M. Zaharia, Tech. Rep. UCB/EECS-2009-28, Feb 10, 2009.
- NIST definition of cloud computing (v15), P. Mell and T. Grance. May 2010.
- 5 Lessons We’ve Learned Using AWS , John Ciancutti, Netflix blog, December 16, 2010.
- Overview of Open Source Cloud Technologies: http://www.cs.columbia.edu/~salman/publications/tutorial-socc12.pdf
Section 2: Cloud Infrastructures
- CloudCmp: Comparing Public Cloud Providers: Ang Li, Xiaowei Yang, Srikanth Kandula, Ming Zhang, Internet Measurement Conf., November 2010.
- The Eucalyptus Open-source Cloud-computing System: D. Nurmi, R. Wolski, C. Grzegorczyk, G. Obertelli, S. Soman, L. Youseff and D. Zagorodnov, CCGrid: The 9th IEEE International Symposium on Cluster Computing and the Grid, May 2009.
Technologies
- IaaS: Infrastructure as a Service: Cloud Stack, Eucalyptus, Open Nebula, OpenStack, Nimbus
- PaaS: OpenShift, CloudFoundary, AppScale
- SaaS: Drupal, Joomla
Section 3: Data Storage
- Finding a Needle in Haystack: Facebook’s Photo Storage: D. Beaver, S. Kumar, H.C. Li, J. Sobel, and P. Vajgel, OSDI, October 2010.
- Needle in a haystack: efficient storage of billions of photos: Peter Vajgel, Facebook note, April 30, 2009.
- Dynamo: Amazon’s Highly Available Key-value Store: G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels, SOSP, October 2007.
- Eventually Consistent - Revisited, W. Vogels, Amazon Inc.
- BigTable: A Distributed Storage System for Structured Data, F. Chang, J. Dean, S. Ghemawat, W. Hsieh, D. Wallach, M. Burrows, T. Chandra, A. Fikes, R. Gruber, ACM Trans. on Computer Systems (TOCS), June 2008
- The Google File System, Ghemawat, Sanjay, Howard Gobioff, and Shun-Tak Leung, ACM SIGOPS Operating Systems Review. Vol. 37. No. 5. ACM, 2003.
Section 4: Programming Frameworks
MapReduce: Simplified Data Processing on Large Clusters: J. Dean and S. Ghemawat, OSDI 2004.
Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks: M. Isard, M. Budiu, Y. Yu, A. Birrrell, D. Fetterly
Pig latin: a not-so-foreign language for data processing: C. Olston, B. Reed, U. Srivastava, R. Kumar, A. Tomkins, SIGMOD 2008.
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language,Y. Yu, et al.
Section 5: Datacenter Routing and Networks
VL2: A Scalable and Flexible Data Center Network: A. Greenberg, J. Hamilton, N. Jain, S. Kandula, C. Kim, P. Lahiri, D. Maltz, P. Patel, S. Sengupta, SIGCOMM 2009
Portland: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric: R. Mysore, A. Pamboris, N. Farrington, N. Huang, P. Miri, S. Radhakrishnan, V. Subramanya, A. Vahdat, SIGCOMM 2009
Profiling Network Performance for Multi-tier Data Center Applications: Minlan Yu, Albert Greenberg, Dave Maltz, Jennifer Rexford, Lihua Yuan, Srikanth Kandula, Changhoon Kim, NSDI'11: Proceedings of the 8th USENIX Symposium on Networked Systems Design and Implementation, 2011.
Sharing the Data-Center Network, NSDI'11_:_Alan Shieh, Srikanth Kandula, Albert Greenberg, Changhoon Kim, Proceedings of the 8th USENIX Symposium on Networked Systems Design and Implementation, 2011.
Data center TCP (DCTCP): Alizadeh, Mohammad, Greenberg, Albert, Maltz, David A., Padhye, Jitendra, Patel, Parveen, Prabhakar, Balaji, Sengupta, Sudipta, Sridharan, Murari, SIGCOMM Comput. Commun. Rev., Vol. 40, pp. 63–74, ACM, New York, NY, USA, August 2010
ElasticTree: Saving Energy in Data Center Networks, B. Heller, S. Seetharaman, P. Mahadevan, Y. Yiakoumis, P. Sharma, S. Banerjee, N. McKeown, Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010
Hedera: Dynamic Flow Scheduling for Data Center Networks, M. Al-Fares, S. Radhakrishnan, B. Raghavan, N. Huang, A. Vahdat, NSDI'10: Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010.
COTS Data-Center Ethernet for Multipathing over Arbitrary Topologies, J. Mudigonda, P. Yalagandula, M. Al-Fares, J. C. Mogul, SPAIN, NSDI'10: Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010.
Section 6: OS and OS Virtualizations
Barham, Paul, Dragovic, Boris, Fraser, Keir, Hand, Steven, Harris, Tim, Ho, Alex, Neugebauer, Rolf, Pratt, Ian, Warfield, Andrew, Xen and the art of virtualization,SOSP ‘03: Proceedings of the nineteenth ACM symposium on Operating systems principles, pp. 164–177, ACM, New York, NY, USA, 2003.
Stephen Soltesz, Herbert Pötzl, Marc E. Fiuczynski, Andy Bavier, Larry Peterson, Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors, In: Proceedings of EuroSys 2007, pp. 275–288, 2007.
Wentzlaff, David, Gruenwald,III, Charles, Beckmann, Nathan, Modzelewski, Kevin, Belay, Adam, Youseff, Lamia, Miller, Jason, Agarwal, Anant, An operating system for multicore and clouds: mechanisms and implementation, Proceedings of the 1st ACM symposium on Cloud computing, pp. 3–14, ACM, New York, NY, USA, 2010.
Kansal, Aman, Zhao, Feng, Liu, Jie, Kothari, Nupur, Bhattacharya, Arka A., Virtual machine power metering and provisioning, Proceedings of the 1st ACM symposium on Cloud computing, pp. 39–50, ACM, New York, NY, USA, 2010.
Section 7: Security in the Cloud
I. Roy, S. Setty, A. Kilzer, V. Shmatikov, E. Witchel, Airavat: Security and Privacy for MapReduce, NSDI'10: Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010
Andreas Haeberlen, Paarijaat Aditya, Rodrigo Rodrigues, Peter Druschel, Accountable Virtual Machines, OSDI ‘10: Proceedings of the 8th Symposium on Operating Systems Design and Implementation, USENIX Association, San Diego, CA, 2010
Ariel J. Feldman, William P. Zeller, Michael J. Freedman, Edward W. Felten, [SPORC: Group Collaboration using Untrusted Cloud Resources](http://SPORC: Group Collaboration using Untrusted Cloud Resources), OSDI ‘10: Proceedings of the 8th Symposium on Operating Systems Design and Implementation, USENIX Association, San Diego, CA, 2010.
Prince Mahajan, Srinath Setty, Sangmin Lee, Allen Clement, Lorenzo Alvisi, Mike Dahlin, Michael Walfish, Depot: Cloud Storage with Minimal Trust, OSDI ‘10: Proceedings of the 8th Symposium on Operating Systems Design and Implementation, USENIX Association, San Diego, CA, 2010.
Roxana Geambasu, Tadayoshi Kohno, Amit Levy, Henry M. Levy. “Vanish: Increasing Data Privacy with Self-Destructing Data.” In Proceedings of the 18th USENIX Security Symposium, Montreal, Canada, August 2009
Section 8: Resource Management in a Cloud
A. Adya, J. Dunagan, A. Wolman, Centrifuge: Integrated Lease Management and Partitioning for Cloud Services, NSDI'10: Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010
Sharad Agarwal, John Dunagan, Navendu Jain, Stefan Saroiu, Alec Wolman, Volley: Automated Data Placement for Geo-Distributed Cloud Services, NSDI'10: Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010
Z. Li, M. Zhang, Z. Zhu, Y. Chen, A. Greenberg, Y.-M. Wang, WebProphet: Automating Performance Prediction for Web Services, NSDI'10: Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation, 2010.
Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy Katz, Scott Shenker, Ion Stoica, Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center, NSDI'11: Proceedings of the 8th USENIX Symposium on Networked Systems Design and Implementation, 2011
Karagiannis, Thomas, Gkantsidis, Christos, Narayanan, Dushyanth, Rowstron, Antony, Hermes: clustering users in large-scale e-mail services, Proceedings of the 1st ACM symposium on Cloud computing, pp. 89–100, ACM, New York, NY, USA, 2010
Hajjat, Mohammad, Sun, Xin, Sung, Yu-Wei Eric, Maltz, David, Rao, Sanjay, Sripanidkulchai, Kunwadee, Tawarmalani, Mohit, Cloudward bound: planning for beneficial migration of enterprise applications to the cloud, Proceedings of the ACM SIGCOMM 2010 conference on SIGCOMM, pp. 243–254, ACM, New York, NY, USA, 2010.
Section 9: Monitoring in the Cloud
- Nagios: http://www.nagios.org/
- Ganglia: http://ganglia.sourceforge.net/
Section 10: Reliability, Testing and Fault Detection
Section 11: Power Usage and Green Computing in the Cloud
References
http://web.mit.edu/6.897/www/readings.html
http://conceptcloud.blogspot.com/2012/04/annotated-reading-list-for-cloud.html
http://bryanpendleton.blogspot.com/2013/01/cloud-computing-reading-list.html
http://nsl.cs.usc.edu/~cs694/readings/index.html
http://www.software-engin.com/teaching/intro-to-the-cloud/cloud-reading-list
http://www.businessinsider.com/10-most-important-in-cloud-computing-2013-4?op=1