Homework 5 -- Estimated Cloud Costs

Homework 5 -- Estimated Cloud Costs

AWS Calculator: AWS Pricing Calculator

  • You have 300 TB of data which can be processed in parallel (i.e., the data may be sliced into any sized chunk)

  • You have done some benchmarking where it 10 hours to process 2GB of data on a single core:

    • It takes 8GB of RAM when those 2GB of data are loaded in memory and being processed

    • Compute time scales linearly with the amount of input data (e.g., 20GB of data takes 10x longer than 2GB of data)

    • Memory usage scales linearly with the amount of input data (e.g., 20GB of data take 10x more RAM than 2GB of data)  

  • How much will it cost to process all the data in 1 hour? in 1 day?

  • The final output is 2TB. How much does cost to transfer the data to AWS and transfer the results back?

  • What is the difference between using different Amazon facilities?  Compare US West Coast, Tokyo, and San Paulo.

  • Make sure to show your work and system configuration!

  • Note:  This is an individual assignment (not a team project)

  • Due 2h before class (7:30am) on Sept. 21!