CS15-319 / 15-619 Cloud Computing Recitation 2 September 2 & 4, 2014
Administrative Issues Overview --TA hours, Piazza posts guidelines Last Week s Reflection --Project Primer, Module 1 This Week s Schedule --Deadlines for Quiz 1 & Project 1.1 Questions
Administrative TA office hours are posted on Piazza Pittsburgh TA rooms for office hours will not be available until September 15 th. See staff page on Piazza for more info on where to meet your TAs. Suggestions for using Piazza Read the Piazza Post Guidelines before asking Read Piazza questions & answers carefully to avoid duplicate ones Don t ask a public question about a quiz question Try to ask a public question if possible
FAQ this week, 1 Choose basic support level when signing up for your AWS account. If you didn t, change it ASAP. Use the link in Project Primer to submit your AWS account info. Recitation online video, click here All deadlines are 11:59 PM Pittsburgh Time
FAQ this week, 2 Questions related to the Project Primer Q: Permission denied when SSH access to EC2 A: Change access permission of pem file to 600 Q: Connect to Amazon instances under Windows environment. A: Follow instructions here; Recommended clients are XShell4 and Putty (instruction)
Amazon Web Services (AWS) Account Students who are just joining us or who have not completed the AWS Account Setup: === ONLY IF YOU HAVEN'T DONE SO ALREADY === 1. Setup an AWS Account 2. Complete AWS information using the link in Primer 3. Wait to receive Consolidated Billing Request email from Amazon Manual process, waiting time varies 4. Click the link to verify the linked billing Many students have not clicked on the link yet! 5. Start using EC2, S3, etc (beyond the Free Usage Tier)
Amazon Web Services (AWS) Account ALL STUDENTS: If you have created an AWS account and have not received notification that it was linked to our account Your credit card on file may get charged! We CANNOT reimburse you!!! Keep using the Free Usage Tier until your AWS account is linked to ours
Last Week Reflection Project Primer Tagging all instances and AWS resources Tags provided in the project write-up on OLI Checking expenditure on AWS Creating and modifying S3 Buckets Launching and remote logging into an EC2 micro instance Introduction to cloud computing Module1
Module 1: Introduction to Cloud Computing Definition delivery of computing as a service over a network whereby distributed resources are provided to the end user as a utility. Enabling Technologies Networking, virtualization, etc. Deployment Models Innovation Product Service Private, Public, Hybrid Benefits and Risks Economies of Scale
Module 2: Building Blocks and Service Models Building Blocks Service Models IAAS, PAAS, SAAS Use Cases Web/Mobile Application Big Data Analytics On-Demand, High-Performance Computing Online Storage and Archival SAAS PAAS IAAS
This Week s Schedule Complete Unit 1 by Thursday Read all pages in modules: Module 1: Introduction to Cloud Computing Module 2: Building Blocks and Service Models Complete activities on each page In-module activities are not graded but for self-test If you encounter a bug in the OLI write-up provide feedback at the end of each OLI page Take Checkpoint Quiz 1 Deadline, Thursday, September 4, 11:59pm EST Completed Project 1.1 (Sequential Analysis) Deadline, Sunday, September 7, 11:59pm EST
Quiz 1 Quiz 1 is open This assignment is timed. You will have 120 minutes to complete the attempt once you begin. Deadline for completion is Sep 4, 11:59 PM Pittsburgh Late submissions are NOT accepted You may not start the assignment after the deadline has passed. Maintain your own timer from when you start the quiz. You only have 1 attempt Question types multiple choice, fill-in-the-blanks, numeric questions most likely but not limited to these Feedback on Quiz 1 is released after the deadline passes
Big Data What is Big Data? It is high volume, high velocity, and/or high variety information assets.
Big Data
Big Data
What Do We Do With Data? Big Data requires new forms of processing to enable fast mining, enhanced decision-making, insight discovery and process optimization. Store Share Access Analyze Encrypt. and more!
Use Cases: Big Data Analysis 1. Online retailers such as Amazon are analyzing consumer spending habits to offer personalized marketing campaigns and offers to individual customers. 2. Companies such as Time Warner, Comcast etc. are using big data to track media consumption habits of their subscribers and provide value-added information to advertisers and customers.
Project 1, Module 1 Wikimedia data set Wikipedia page views Explore the data set to learn the format <project name> <page title> <number of accesses> <total data returned in bytes> You need to parse and filter the data As described on OLI Sort the data and save the output to a file Notes Configure EC2 instances in US-East-1 (N. Virginia) region Tag all resources with Key: Project and Value: 1.1 No tags grade penalties
Course AMIs What are AMIs (Amazon Machine Image) An image of the OS with other software bundled up AMIs are specified when creating virtual machines within EC2 Finding AMIs Course AMIs are Community AMIs Potential issues Regions : use US-East-1 (N. Virginia) for this course Search timeout Security Group : manage inbound ports (e.g. 22, 80, 8080)
Tagging AWS Resources How to tag instances & other resources Format: Key: Project Value: x.x Click here for more info Why tagging Manage resources & costs Issues faced with tagging Lower case/upper case Misspellings
Using Spot Instances How to use spot instances See definition of spot instances Max price should be larger than the current price of your availability zone Click here to find current spot prices Charging model for spot instances Amazon EC2 instances are billed on a non-prorated basis Each partial hour is billed as a full hour Issues Calculate total costs : Spot instance-hours are billed based on the Spot price at the start of each instance-hour. If your Spot Instance is interrupted in the middle of an instance-hour, you are not billed for that partial hour of Spot use. However, if you terminate the Spot instance, you are charged for the partial hour of use.
Penalties If No tag 10% penalty Expenditure > project budget 10% penalty Expenditure > project budget * 2 100% penalty Copy any code segment from lowest penalty is 200% Other students Previous students Internet (e.g. Stackoverflow) Do not work on code together This is about learning If you do, we will find out and take action
Submitting Code Submit your code on S3, by the deadline Submitting the wrong S3 URL on OLI will be penalized Submitting an incorrectly configured bucket will be penalized (see the instructions in the Project Guidelines page in the Project Primer on OLI) We will manually grade all code Be sure to make your code readable Preface each function with a header that describes what it does Use whitespace well. Indent when using loops or conditional statements Keep each line length to under 80 characters Use descriptive variable names For more detail, please refer to www.cs.cmu.edu/~213/codestyle.html If your code is not well documented and is not readable, we will deduct points Documentation shows us that you know what your code does! The idea is also NOT to comment every line of code
Manage Your Instances EC2 instances life cycle and costs When you are done using an instance (e.g. finish the project), terminate When you stop an instance, you still incur cost. (But all resources on that instance are accessible)
CloudWatch Billing Alert You can set up an alert to be notified automatically via e-mail when estimated charges reach a threshold that you specify You can use up to 10 alarms and 1,000 e-mail notifications free each month Demo Billing Alarm HOWTO: CloudWatch Create your billing alarm
Questions?
Upcoming Deadlines Quiz 1: Introduction to Cloud Computing Checkpoint Available Now Due 9/4/14 11:59 PM EST Project 1: Introduction to Big Data Analysis Sequential Analysis Checkpoint Available Now Due 9/7/2014 11:59 PM EST