Amazon Web Services (AWS)
Page Contents
Course Notes
These are notes from the Udemy course AWS Certified Developer - Associate 2020. It's a great course, give it a go!
Aws Samples
https://github.com/aws-samples ^^ This GitHub page has a load of AWS examples. Really worth a flick through! E.g. For deploying PHP, Containers and NGinx: https://github.com/aws-samples/eb-docker-nginx-proxy
Iam - Identity Access Management
- See https://d0.awsstatic.com/whitepapers/Security/AWS_Security_Best_Practices.pdf - For management of users and their access level to AWS console. - Centralised control of AWS account - Shared access - Granular permissions - Multifactor authentication - Integrates with AWS services - Groups - Collection of users with common permission set. - Roles - Define set of permissions that can be assigned to AWS resources/entities. Secure way to grant permissions to entities that you trust. Ex, IAM user in another account or application code running an an EC2 instnace that needs to permorm actions on AWS resources etc. E.g.: Give an EC2 instance permissions to write a file to an S3 bucket. Create role. Select the service that the role will use - EC2 instance in this example. Attach the S3 policy of choice, e.g. write access, to the role. This role can then be applied to any new EC2 instance created. - Policies - Document that defines >= 1 permissions that can be assigned user, groups, or roles. For example, a policy could define the read permisison on a dynamoDB table and then read/write on a specific S3 bucket and so on. Its a collection of permissions of varying specificity. USERS GROUP POLICY James --------> Supper Group ----> Eat policy John -------/ `---> Drink policy Paul ------/ - In console access under "Security, Identity and Compliance" - When creating users you will have - User name For AWS console login. - Access key ID For PROGRAMATIC access. i.e., via CLI, scripts etc. - Secret Access Key. For PROGRAMATIC access. i.e., via CLI, scripts etc. This will only be displayed once. You can download as a CSV if you can keep it securley. Note, once the secret has been hidden you canNOT access it again. You would have to generate a new key. - Password For AWS console login. Instance Profiles ----------------- EC2 --- See https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_use_switch-role-ec2_instance-profiles.html An instance profile is a container for an IAM role that you can use to pass role information to an EC2 instance when the instance starts ... When you then use the Amazon EC2 console to launch an instance with an IAM role, you can select a role to associate with the instance. Elastic Beanstalk ----------------- See https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/iam-instanceprofile.html Cognito -------- USER POOLS: user directories that provide sign-up and sign-in options for your ... app users ... app users can sign in either directly through a user pool, or federate through a third-party identity provider (IdP) ... After a successful authentication, your ... app will receive user pool tokens from Amazon Cognito ... IDENTITY POOLS: provide AWS credentials to grant your users access to other AWS services. Web Identity Federation ----------------------- Lets you give your users access to AWS resources after they have successfuly authenticated with a web-based identity provider like Amazon, FB or Google. After authentication, user gets auth code from Web ID provider, which they trade for temporary AWS security credentials. Cognity provides Web Identity Federation with following fetures: - Sign up and sign in - Access for guests - ID broker between app and Web ID providers so no need to write additional code - Sync user data for multiple devices Acts as broker handling all interactions with Web ID providers. E.g. user can login to FB, get an authcode which it exchanges with AWS for a temporary access credential (which maps to an IAM rolw allowing specific access rights), which it can then use to access an S3 bucket, for example, or parts of a database or event a website behind an ALB acting as a reverse authentication proxy. Cognito User Pools ------------------ - User pools are USER DIRECTORIES used to MANAGE SIGN-UP/IN in functionality for web apps. - Can sign-in directly to pool or via Web ID provider (FB, Google, Amazon etc). Successfull auth generates a number of JSON Web Tokens (JWTs) - Enable creation of unique IDs for users and ability to authenticate users with ID pools. - APP CLIENT - Is what we use to call the various APIs on our behalf. E.g. API to reg a new user or sign in an existing etc. See https://docs.aws.amazon.com/cognito/latest/developerguide/user-pool-settings-client-apps.html An app is an entity within a user pool that has permission to call unauthenticated APIs (APIs that do not have an authenticated user), such as APIs to register, sign in, and handle forgotten passwords. To call these APIs, you need an app client ID and an optional client secret In app client settings: Need to configure an identity provider. Click "Cognito User Pool" Then configure callback URLs - The URL the user is redirected to after they have successfull signed in or signed up. The logout URL will invlidate the access tokens supplied after a successful login.
Going Serverless
Lambdas ------- See https://aws.amazon.com/lambda/faqs EC2 launced in 2006. At time of writing it is 14 years old! Infrastructure as a service! Serverless means you don't need to control the server, you don't need to provision the hardware or worry about the OS its running - its no longer something you need to worry about. Also don't have to worry about scaling and response to load and keeping them running etc. Lambda allows you to use your code without any need to provision servers, install software, deploy containers or worry about any low level details and can parallelize code. All this is taken care of for you. E.g. Alexa is using Lambdas every time you make an Alexa query. So, don't need any EC2 instances, and EBS instances etc etc - THERE IS NOTHING TO MANAGE! Use: - Event driven compute service - AWS Lambda runs code in response to events, e.g. changes to data in S2 bucket or DynamoDB table. - Compute service - run code in response to HTTP requests using Amazon API gateway or API calls made using AWS SDKs. This can be massively parallel - many requests launch many Lambdas, which all use the same code but represent difference request instances with associated data etc. Each AWS Lambda function runs in its own isolated environment, with its own resources and file system view. AWS Lambda uses the same techniques as Amazon EC2 to provide security and separation at the infrastructure and execution levels. AWS Lambda natively supports Java, Go, PowerShell, Node.js, C#, Python, and Ruby code, and provides a Runtime API which allows you to use any additional programming languages to author your functions. Priced on #requires and duration. Configure A Lambda Function --------------------------- - Click - Services > Compute > Lambda > Create Lambda Function. - Create from scratch - Add name and platform. If you've not used Lambda before will need to create a new role to give Lambda permission to execute. Use policy "Simple Microservice Permissions". - Click "Create function" . - Paste in your Lambda code. - Add your trigger... can be sooo many things: including but no where near limited to API Gateway, CloudFront, S3, DynamoDB, and more... - - TODO COMPLETE ME! API Gateway ----------- Use API Gateway to create RESTful APIs endpoints. These are services your web app etc need from a backend server. For example, a login management endpoint. This could be implemented using your own server, or a server running on an EC2 instance. But why bother managing your own service?! Lets AWS do that fo you and instead just give you some end point URLs and things that can happen based on someone accessing them! So, API Gateway gives you an end point URL to use. You get to say what happends when the end point is accessed: use another AWS service or run a Lambda. See: https://aws.amazon.com/api-gateway/ https://aws.amazon.com/documentation/apigateway/ Watch out for costs!! https://aws.amazon.com/api-gateway/pricing/ Free tier (as seen on 03-Oct-2020): The Amazon API Gateway free tier includes one million API calls received for REST APIs, one million API calls received for HTTP APIs, and one million messages and 750,000 connection minutes for WebSocket APIs per month for up to 12 months. Amazon API Gateway - fully managed service - publish APIs at any scale to act as "front door" for apps to access data etc from back-end services, such as apps on EC2 instances, code on Lambdas etc. - Expose HTTPS enpoints to define a RESTful API API gateway gives us a HTTPS address to make API calls to, and then configure how the API responds to those calls - e.g. get data from a DynamoDB or fire off a Lambda etc etc to return the response. - Lets you serveless-ly connect to services like Lambda & DynamoDB. - Can send each API endpoint to a different target. - Run efficiently with low cost & scales effortlessley. - Can track and control usage with API key. - Can throttle requests to prevent attacks. - Can maintain multiple versions of your API How to configure? - Define an API (container) - Define Resources and nested Resources (URL paths) - For each resource: - Select supported HTTP methods, POST, GET etc etc - Set security - Choose target- EC2 Lambda etc - Set resquest & response transformations - Deploy API to a Stage - Use APU Gateway domain by default but you can use your own to hide the fact its AWS - !!!Supports AWS Certificate Manager: free SSL/TLS certs!!! WOO!!! API Caching In API Gateway: can enable API caching. Caches enpoint's response to reduce # calls to endpoint for repeated requests. Also improves latency. Need to be aware of XSS and CORS Example: - - - - Choose the REST API type.
Now you can define the resources you are doing to use and the HTTP methods we want to react to. Create a resource, which is like a URL SEGMENT, like so:
You can select to enable Cross-Origin Resource Sharing (CORS). CORS is a mechanism that uses additional HTTP headers to tell browsers to give a web application running at one origin, access to selected resources from a different origin. See: https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS #
Now we have created a URL segment, which is a resource. It can't be used, however, until the requests that it should handle are defined. To do this, select the end point resource and then from the actions menu select "Create Method". Then you can define which HTTP method you want the resource to respond to (ANY, DELETE, PUT, PUSH, GET etc etc). E.g if a GET method was created: #
A mock will allow you to create a dummy response. If you create a mock you can use the integration response to return dummy data.
Add some dummy response data like this:
Next goto ACTIONS > DEPLOY API. It will then ask you to deply your API. You can deploy to an existing or new stage. Stages allow management of different versions of the API, e.g. dev v.s. production versions. Simple Serverless Website Using Route53, API Gateway, Lambda and S3 ------------------------------------------------------------------- Domain name in Route53 and bucket name need to be the same. They MUST BE THE SAME (toplevel exluding the .com). Must enable static website hosting for bucket. If you don't want to spend money on AWS domain you can just use the S3 bucket address, Steps: 1. In the Bucket name list, choose the bucket that you want to use for your static website. 1.1. Choose Properties. 1.2. Choose Static website hosting. 1.3. Choose "Use this bucket to host a website". 1.4. Note down the end point. E.g. my last endpint was Endpoint : http://epicplanner.jehtech.com.s3-website.eu-west-2.amazonaws.com 2. Edit block public access settings 2.1 Choose the Permissions tab, and edit, 2.2 Clear Block all public access, and choose Save -- anyone on the internet can access your bucket!! 3. Add Bucket Policy: To make the objects in your bucket publicly readable, you must write a bucket policy that grants everyone s3:GetObject permission. 3.1 Choose "Bucket Policy" under "Permissions" tab for your S3 bucket, 3.2 Add the following: { "Version": "2012-10-17", "Statement": [ { "Sid": "PublicReadGetObject", "Effect": "Allow", "Principal": "*", "Action": [ "s3:GetObject" ], "Resource": [ "arn:aws:s3:::example.com/*" << PUT YOUR S3 BUCKET ARN HERE ] } ] } 4. (Optional) Add Route53 entry for your domain to point to S3 endpoint.
Forward And Reverse Proxies
Ref: https://www.cloudflare.com/learning/cdn/glossary/reverse-proxy/ Forward Proxy ------------- TL;DR: A forward proxy sits in front of a client/clients and ensures that no origin server ever communicates directly with that specific client / those clients. Standard Internet communication: A reaches out directly to computer C, with the client sending requests to the origin server and the origin server responding to the client. With fwd proxy: A sends requests to B (proxy), which forwards the request to C (origin server). C will then send a response to B, which will forward the response back to A. Why? - Avoid state or instituational browsing restrictions Connect to proxy rather than desired illegal site. - Block access to certain content Proxy intercepts all traffic so can deny some requests. - Protect online identity Servers see the proxy's IP, not the clients. Reverse Proxy ------------- TL;DR: Sits in front of web server(s), intercepting client requests. A reverse proxy sits in front of an origin server and ensures that no client ever communicates directly with that origin server. With rev proxy: A sends comms to reverse proxy, B. B then sends request to server C, which replies to B, which B forwards back on to A. Why? - Load balancing. Proxy can forward client requests on to one server from a pool. To client it looks like talking to one server, when it could be one of many. - Security. Server IP is never revelated. Harder to attack it therefore. Reverse proxy generally has higher security and better resource to fend off attack. - Caching. Reverse proxy can cache frequenty requested content. - TLS/SSL encryption. If origin server is low resouce or doesn't support encryption reverse-proxy can do this on the server's behalg.
Elastic Compute Cloud (Ec2)
Stands for Elastic Compute Cloud, abreviated to EC2. Web service providing resizeable compute capacity in the cloud. Virtual machines in a cloud. Pay only for capacity you use - don't have to (over) provision your own kit. Isolate from common failure scenarious. EC2 Options ----------- - On demand - fixed rate per hour/second with no commitment, no upfront payment. Good for app dev or apps with short term, spiky, unpredictable workloads that cannot be interrupted. - Reserved - capacity reservation - pay up front. Good discounts on hourly charge for an instance. One to three year term. Apps with steady state or predictable usage. - Spot - Allows bidding Apps that have very flexible start and end times For apps that are only feasible at very low compute prices. E.g. use compute during off-peak times for cheaper compute cost. - Dedicated hosts - Physical EC2 server for your use only. Regulatory requirements that do not alow multi-tenant virtulization Licensing which does not support multi-tenancy. EC2 Instance Types ------------------- There are different "families" of instance types that provide optimisations for specific requirements like hight speed storage, or memory space, or compute power etc. Families include. FPGA, High Speed Storage, Graphics Intesive, High Disk Througput, Low Cost, Dense Storage, Memory Optimized, General Purpose, Compute optimize, GPU etc. You will mostly use - T2 = Lowest cost, general purpose - Web Servers and Small DBs Elastic Block Store (EBS) ------------------------- EBS allows you to create storage volumes and attach them to Amazon EC2 instances. Once attched, can create FS on top of these volumes, run DBs etc etc. Protect you from failure of single component. Types: General Purpose SSD (GP2), Provisioned OPS SSD(IO1), Throughput Optimized HDD (ST1) Cold HDD (SC1). Elastic Load Balancer (ELB) --------------------------- See https://docs.aws.amazon.com/elasticloadbalancing/latest/userguide/load-balancer-getting-started.html Types Of Load Balancers - Application Load Balancer. OSI Layer 7. Can make very clever decisions based on application level logic. Application Load Blancers are best suited for load balancing of HTTP and HTTPS traffic. They operate at Lyer 7 and are application-aware. They are intelligent, and you can create advanced request routing, sending specified requests to specific web servers. For setup see: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/environments-cfg-alb.html For authentication see: https://aws.amazon.com/blogs/aws/built-in-authentication-in-alb/ - Network Load Balancer. Most expensive option. Works at OSI layer 4 (surely 3?!). Fast performance! Best suited for load balancing of TCP traffic where extreme performance is required. Operating at the connection level (Layer 4 - which I think is the transport layer not the network layer, need to double check that), network Load Balancers are capable of halding millioos of requests per secon, while maintaining ultra-low latencies. use for EXTREME PERFORANCE. - Classic Load Balancer (ELB) No longer recommended. Lagacy! Legacy Elastic Load Blanacers. Can ballance layer 7 or 4, but layer 7 is not as intelligent as application load balancer. If your app stops responding ELB returns a 504 error. The load balancer is an extra layer that will "hide" the public IP address of the entity accessing your server. To get the public IP look in the "X-Forwarded-For" header to see the public IP addr. To create a load balancer go back to your EC2 instance and select, in the left hand menu pane, LOAD BALANCING > Load Balancers. Then click the "Create Load Balancer Button". Create the HTTP(S) balancer. Assign a security group. Configure Routing. Create new target group on HTTP port 80. Add /index.html as the health check item. Register targets Review Create Can config ELB to terminate HTTPS connections and deliver messages to backend of HTTP. ELB will handle encrypt/decrypt. - See https://aws.amazon.com/blogs/aws/elastic-load-balancer-support-for-ssl-termination/ - Upload certificate to AWS account using the AWS Management Console or the `iam-servercertupload` command, then retrieve the ID of the uploaded certificate. - Create new load balancer which includes an HTTPS listener, and supply the certificate ID from prev step. ELBs can also act as AUTHENTICATING REVERSE PROXIES! See https://aws.amazon.com/blogs/aws/built-in-authentication-in-alb/ ALB can now securely authenticate users as they access applications, letting developers eliminate the code they have to write to support authentication and offload the responsibility of authentication from the backend MUST USE AN APPLICATION LOAD BALANCER. At time of writing when creating a load balancer classic seems to be the default option. To setup an ALB see: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/environments-cfg-alb.html Happy, happy days :D DNS Refresher ------------- See: http://social.dnsmadeeasy.com/wp-content/uploads/2016/12/DNS-for-Dummies-ebook-3-min.pdf https://ns1.com/resources/cname Fully Qualified Domain Name (FQDN) - Domain name has both a HOSTNAME AND DOMAINNAME For example, accessing a server called bob we might goto "bob.my-buiness-name.com". Bob is the hostname and my-buiness-name.com is the domain name. DNS request: 1. Contact ROOT NAME SERVERS (see http://www.iana.org/domains/root/servers) and get the address of the top level domain (TLD) server. So in the bob example we get the COM TLD server address. 2. Next contact the COM TLD server(s) to get the name server responsible for "my-buiness-name". 3. Next conatact the name server for the address of the hose "bob". Two types of DNS servers: 1. Authoritative Do the actual name -> IP address mapping. 2. Recursive (or Caching): Contacted by clients and do the work of the DNS lookup by talking to authoritative servers. They recurse throught the DNS name hierachy to resolve the address for the client. DOMAIN REGISTRAR - service through which you register a domain name and map an IP address to it. Domain registrar deals with registry operators who deal with master domain lists - overall managed by IANA. So, if I register "bob.my-buiness-name.com" the registrar will contact a registry operator, which is overseen by IANA, and adds the name "bob.my-buiness-name.com" to the global list of all domain names (ensuring, of course, that it isn't already registered!). That's all the registrar does. DNS HOSTING PROVIDER - hosts the servers that authoritatively respond to your domain. So, a registrar might have registered "bob.my-buiness-name.com" for me, but it is the DNS hosting provider that has a server that when queried will respond with the IP address for my host! Sometimes a domain registrar will offer DNS hosting and vice versa, but the 2 functions are separate Your DNS hosting provider has to know, when x.y.z is accessed, what the IP address it should map to is. For this is uses something called an "A record". If you own x.y.z, you create, in your hosting providers map of x.z.y an "A record" and fill it with your server's IP address. You can also setup aliases. If for x.y.z you setuup a "CNAME record", you can specificy that when x.y.z is resolved by your DHS hosting provider that it should re-direct the querier to another url instead. A Canonical Name (CNAME) Record is used in the Domain Name System (DNS) to create an alias from one domain name to another domain name. A common example is the www subdomain which is provided as an alias to the root domain name - users accessing "www.example.com" are referred to the root domain (or DNS zone apex) "example.com". For example, to map www.example.com to zone-apex, example.com, the record might look like: www.example.com. CNAME example.com. example.com. A 192.162.100.100 DNS Resolution Process for CNAME Records - A DNS client (such as a browser or network device) requests the address www.example.com, and a DNS request is created. - A DNS resolver receives the request and finds the Authoritative Name Server that holds the DNS Zone file with DNS records for the "example.com" domain. - The DNS request is resolved and the CNAME record is returned to the client. - The client understands www.example.com is only an alias for the real address, "example.com", and issues a new DNS query for "example.com" - The process is repeated and the resolver returns the A record for "example.com", containing the IP address. - The DNS client now connects to “example.com” using its IP address. Route53 ------- - Amazon's DNS service - Allows mapping of domain names to EC2 instances, load balancers and S3 buckets. - Log into AWS console, then Services > Network > Route 53 - Click Domains > Registered domains Click "Register Domain". Select a domain and choose a name. Add it to your cart and click "Continue" - Fill in registrant information - Domain registration can take up to 3 days to complete. - Click Dashboard > "hosted zones" Should see our registers domains. Click "Goto record sets" to create our DNS records. Create "A" record. Says where you're domain is going to go. There are lots of record set type you can choose. An "A" record points to an IPv4 address, and "AAAA" record points to an IPv6 address etc. Create a type "A" record set. Use the NAKED DOMAIN NAME - that is the name without the "www." prefix. This is also sometimes refered to as the "zone apex record". In order to do this HAVE TO USE AN ALIAS. You create an alias for the zone apex. Aliases are only supported by DNS record types A and AAAA. Pick the alias target - it can be for an S3 website, an ELB website, Elastic Beanstalk, CloudFront etc. This is why we needed to setup the ELB... doesn't seem to support directly using an EC2 instance. Thus MUST CREATE ELB THEN PUT AN EC2 INSTANCE BEHIND THE ELB. Select your ELB from the target list and hit create :) From Wikipedia: A Canonical Name record (abbreviated as CNAME record) is a type of resource record in the Domain Name System (DNS) which maps one domain name (an alias) to another (the canonical name).[1] ... Also referred to as a CNAME record, a record in a DNS database that indicates the true, or canonical, host name of a computer that its aliases are associated with. ... When referring to IP addressing, canonical means the authoritative host name stored in a DNS database that all of an IP address' aliases resolve to. From AWS Certified Solutions Architect Practive Tests Book: A zone apex record is a DNA record at the root, or apex, of a DNS zone. So amazon.com is an apex record (sometimes called a naked domain record). From NS1.com: DNS servers create a DNS record to provide important information about a domain or hostname, particularly its current IP address. The most common DNS record types are: - Address mapping record (A Record) - a.k.a DNS host record, stores hostname and its corresponding IP4 address - IP Version 6 Address record (AAAA Record) - stores a hostname and its corresponding IPv6 address. - Canonical Name record (CNAME Record) - can be used to alias a hostname to another hostname. When a DNS client requests a record that contains a CNAME, which points to another hostname, the DNS resolution process is repeated with the new hostname. - Mail exchanger record (MX Record) - specifies an SMTP email server for the domain, used to route outgoing emails to an email server. - Name Server records (NS Record) - specifies that a DNS Zone, such as "example.com" is delegated to a specific Authoritative Name Server, and provides the address of the name server. - Reverse-lookup Pointer records (PTR Record) - allows a DNS resolver to provide an IP address and receive a hostname (reverse DNS lookup). ... ... This can prove convenient when running multiple services (like an FTP server and a web server, each running on different ports) from a single IP address. One can, for example, point ftp.example.com and www.example.com to the DNS entry for example.com, which in turn has an A record which points to the IP address. Then, if the IP address ever changes, one only has to record the change in one place within the network: in the DNS A record for example.com. Setup Simple Webserver ----------------------- - From console goto Services > Compute > EC2 - Click on "Launch Service" button. - This will start us on the journey to launching first instance. instances are created from Amazon Machine Images (AMIs). - Select an AMI of your choice and then choose the type of this instance you want. Here you select the family type talked aboute in the section above, "EC2 Instance Types". I will want a T2 instance, probably one eleigable for the free teir. - Choose it, eg "T2 micro" and click the NEXT button to go and configure the instance details. - Then add tags. - Configure security groups. This is like a "virtual firewall" where you can specify what traffic you allow in and out of your instance. So for websever, for e.g. we need to enable port 80 for TCP. - Create a new security group most porbably and give it a name. - Allow SSH to remote adminthe EC2 instance. - Allow HTTP access, (and HTTPs access if you need it) - Click "Review and Launch" and review details. - Click "Launch" Need to either add or create a key pair that will allow you to access the instance. We store the private key somewhere locally and SECURELY The public key is stored by AWS NEVER SHARE THE PRIVATE KEY otherwise anyone can access the server! Name the key pair and then download it, saving it somewhere safe and secure. - Click "Launch Instance" Can take a little time for instance state to say "running". Once running it will give you your public IP address, which you can then use. Go to directory with your downloaded key pair Change permissons. needs to be quite locked down to use with EC2: chmod 400 MyNewKerPair.pem ssh ec2-user@<servier-ip-from-console> -i MyNewKeyPair.pem Inside ssh session sudo su yum update -y #Or use apt depending on server version used # install apache yum install httpd -y # start it service httpd start # make sure apache auto boots on restart chkconfig httpd on # See service status service httpd status # Create stub web page echo "<html><body>HELLO</body></html>" > /var/www/html/index.html How To Install LetsEncrypt SSL Certificate in AWS EC2 Instance Notes -------------------------------------------------------------------- Ref: https://www.youtube.com/watch?v=odtG8Cjshvo See: https://letsencrypt.org/getting-started/, https://certbot.eff.org/ How to Setup SSL certificate in CentOS 7 using LetsEncrypt Notes ---------------------------------------------------------------- Ref: https://www.youtube.com/watch?v=qYWhuITSVMc&feature=youtu.be See: https://letsencrypt.org/getting-started/, https://certbot.eff.org/
S3 (Simple Storage Service)
* Object based storage - secure, scalable, durable. Data spread across multiple device and facilities. * Files are stored in Buckets (like a folder) * Key/Value store (with Version ID optional & Metadata) * Universal namespace - names must be unique globally! From AWS website: Buckets and objects are the primary resources, and objects are stored in buckets. Amazon S3 has a flat structure instead of a hierarchy like you would see in a file system. However, for the sake of organizational simplicity, the Amazon S3 console supports the folder concept as a means of grouping objects. Amazon S3 does this by using a shared name prefix for objects (that is, objects have names that begin with a common string). Object names are also referred to as key names. * Data consistency model * PUTS of new objects - read after write consistency. A read after a write is immediately valid * Overwrite PUTs and DELETEs - eventual consistency - can take some time to propogate * Subresources - bucket specific config * Bucket polocies, acess control lists) * Cross Origin Resource Sharing (CORS) - files located in one bucket can be allowed to access files in other buckets * Transfer acceleration for multiple uploads * Guarantee 99.9% availability (i.e., service uptime), 99.999999999% durability! * Tiered storage available * Lifecycle management (automatic movement between tiers) * Versioning * Encryption * Max file size 5 TB!! Storage Tiers: * Regular - Stored across multiple devices and facilities. Can sustatin loss of 2 data centers! Guarantee 99.9% availability (i.e., service uptime), 99.999999999% durability! * IA (Infrequently Accessed) - Less frequent access but rapid. Charged per retrieval. * S3 One Zone IA - Less availablility than IA. Less cost. 20% less. * Reduced Redundancy Storage - Durability down to 99.99%. Use for data that can be re-created if lost. E.g. thumbnails. * Glacier - Very cheap but archival only. Takes 3-5 hours to restore from Glacier!! * Intelligent Tiering - Data has unknown or unpredictable access patterns Auto moves your data to most cost-effective tier. 2 tiers - frequency and infrequent access. If obj not accessed for 30 consecutive days it goes to infrequent tier. When accessed goes back to frequent tier. Same durability and reliability as regular. Charges * Storage per GB. * Requests. * Storage Management Pricing - e.g. for analytics. * Data Management Pricing - moving data OUT of S3. * Transfer Acceleration - extra charges for accelleration. S3 Security ----------- * Default - new buckets are PRIVATE * Can configure public access if required. * Use * Bucket policies - bucket level, JSON * Access Control lists - object level * Can create access logs, written to this/another bucket S3 Access Policies ------------------- From AWS S3 console, select bucket, select "Permissions" tab. Can control public accessibility using ACLs and enable/disable on a file by file basis. The "Bucket Policy" tab just exposes a policy editor. Below the text box is a link to the "Policy generator", which can help you build up your bucket policy. In the generator: * Select type of policy - "S3 Bucket Policy" * Select effect - "allow" or "deny" * Add "Principal" - this is the entity to which you are allowing/denying access to. I.e., who this policy applies to. Could be an IAM user, another S3 bucket or other AWS resource. If it is a user you must enter the uer's ARN, which you get from the Users section in IAM. * AWS service - "Amzon S3" * Actions - * ARN - Enter as displayed at the top of the editor text box. * Hit "Generate policy", and copy and paste the policy code into the editor. S3 Encryption ------------- 1. In transit - TLS/SSL 2. At rest - * S3 managed keys (SSE-S3) Each object encrypted using strong multi-factor encryption. Key also encrypted with master keys. AES-265 bit encryption. * AWS Key Management Service managed keys (SSE-KMS) Uses SSE key management service. Get separate permission for "envelope key", which is the key that encrypts your key. Also get AUDIT TRAIL. * Customer provided keys (SSE-C) AWS manage the encryption/decryption but YOU mange the keys, their rotation and lifetime etc 3. Client side User enecrypts file herself before upload Example syncing dir to encrypted folder: aws s3 sync . s3://your/dir --sse Install & Configure The CLI ---------------------------- apt install awscli aws configure set aws_access_key_id <AWS_ACCESS_KEY> [--profile <PROF_NAME>] aws configure set aws_secret_access_key <AWS_SECRET_KEY> [--profile <PROF_NAME>] aws configure set region "eu-west-1" --profile ci The access key and secret access key will have been created previously in the IAM settings. Then you can do things like: aws [--profile <PROF_NAME>] s3 ls s3://SOME/DIR/ ^ NOTE: Trailing "/" is important if you want to list the directory contents and not just the directory itself. aws [--profile <PROF_NAME>] s3 mv|cp LOCAL_FILE s3://SOME/DIR/ ^ NOTE: Trailing "/" is important! aws [--profile <PROF_NAME>] s3 mv|cp s3://SOME/DIR/FILE LOCAL_FILE
Elastic Beanstalk
MUST READ: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/concepts.html TUTORIALS: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/tutorials.html What Is It? ----------- Elastic Beanstalk is a service for deploying and scaling web applications developing in many popular languages: Java, .NET, PHP, Node.js, Python, Ruby, Go and Docker onto widely used application server platforms like Apache Tomcat, Nginx, Passenger, and IIS. From AWS: Amazon Web Services (AWS) comprises over one hundred services, each of which exposes an area of functionality. While the variety of services offers flexibility for how you want to manage your AWS infrastructure, it can be challenging to figure out which services to use and how to provision them. With Elastic Beanstalk, you can quickly deploy and manage applications in the AWS Cloud without having to learn about the infrastructure that runs those applications. Elastic Beanstalk reduces management complexity without restricting choice or control. You simply upload your application, and Elastic Beanstalk automatically handles the details of capacity provisioning, load balancing, scaling, and application health monitoring. [https://adrianhall.github.io/cloud/2019/01/31/which-aws-service-for-hosting/]: On the surface, you will find a lot of similarities between Amazon Lightsail and AWS Elastic Beanstalk with EC2 instances. They both run exactly the same web application sets ... However, AWS Elastic Beanstalk can run within a VPC, which means you can run the web applications in an "internal only" configuration (with no connection to the Internet ...). You also get a whole host of flexibility in terms of settings that you don’t get with Lightsail. ... Best for: Your most challenging enterprise apps where access to the underlying OS is required. Developers can focus on writing code and don't need to worry about any of the underlying infrastructure needed to run the application. "Throw it some code in a ZIP file and it will figure it out and configure an environment for you". Upload the code and EB handles deployment, capacity provisioning, load balancing, auto-scaling and app health. You retain full control of the underlying AWS resource pwoering your app and you pay only for the AWS resource required to store and run you app, e.g. the EC2 instances or S3 buckets etc. In summary: - Fastest/simplest way to deply app in AWS. - Scales resources up and down for you including web application server platform. - EC2 instance type is selectable for opimization. - Managed platform updates auto applies to OS and Java/PHP/NodeJS etc etc. EB v.s. CloudFormation - EB configure environment through a GUI, CF is scripted config. NOTE: Free tier eligable instances will not have a load balancer
Nomenclature ------------ Application - Logical collection of Elastic Beanstalk components, including environments, versions, and environment configurations. Think of it like a CLASS definition. Environment - Collection of AWS resources running an application version. Each environment runs only 1 app version at a time, however, you can run the same app version or different application versions in many environments simultaneously. When environment created, EBS provisions resources needed to run the app. Think of it as an INSTANCE of a class. Environment - Identifies a collection of parameters and settings that define how an environment config and its associated resources behave Think of it like the PARAMETERS passed to the CLASS CONSTRUCTOR. Saved - A template that you can use as a starting point for creating unique environment config configurations. Platform - Combination of an operating system, programming language runtime, web server, application server, and Elastic Beanstalk components. Beanstalk Update/Deployment Options ----------------------------------- Beanstalk supports several options for processing deployments (deployment policies): - All at once: - Deploys the new version to all instance simultaneously - All of your instances are out of service while the deployment takes place. - You will experience an outage while the deployment is taking place. - If the update fails, you need to roll back the changes by re-deploying the original version to all you instances. - Rolling Deployment Policies: - Deploys in batches: each batch taken out of service whilst deployment takes places - Environment capacity will be reduced by the #instances in a batch while the deployment takes place. - If update fails, need to roll back the changes via rolling update. - Reduced capacity during deplayment but not an outage. - Rolling wwith Additional Batch deployment Updates. - Maintains full capacity during deployment process. - Launches an addition batch of instances before deploying new instances. - Immutable Deployment Updates. - Deploys new version to a fresh group of instances in new group. On success moved to the existing group, and old instances killed. - Perferred option for Mission Critical stuff. Instance Profiles - "Environment must have instance profile associated with it" ------------------------------------------------------------------------------- See: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/iam-instanceprofile.html See: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/environments-create-api.html An instance profile is a container for an AWS Identity and Access Management (IAM) role that you can use to pass role information to an Amazon EC2 instance when the instance starts. When you launch an environment using the Elastic Beanstalk console or the EB CLI, Elastic Beanstalk creates a default instance profile, called `aws-elasticbeanstalk-ec2-role`, and assigns managed policies with default permissions to it. If you see the error message, "Environment must have instance profile associated with it", when trying to create an environment, you may not have sufficient priviladges or could be trying to create the environment in the wrong region. It appears you will required permission to use the role `aws-elasticbeanstalk-ec2-role`. How To Deploy Apps ------------------ NOTE: You must, generally, specify an Instance Profile for your EC2 instance From [https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/iam-instanceprofile.html]: An instance profile is a container for an AWS Identity and Access Management (IAM) role that you can use to pass role information to an Amazon EC2 instance when the instance starts. Compute > Elastic Beanstalk Click "Get Started" Create a web app - Spec name and platform and the option "Upload your code" rather than "Sample application", which is the default - Click "Create application" - Select how to load the code - Can be a local file (ZIP) or an S3 bucket. - If you choose to upload a local file, it will create an S3 bucket automatically for you and stash the ZIP there. I was suprised that it doesn't unzip the archive, it just stashes it. - Once you click "Create Application" an environment will be created for your app. By creating an environment, AWS Elastic Beanstalk is allowed to manage AWS resources and permissions on your behalf. If the app was called HelloWorld then the environment is labeled HelloWorld-env. The creation can take up to 5 or 6 minutes so be patient. - When the environment reports itself as successfull launched, in the breadcrumb links select "Applications" to view a list of all applications. Here you should see your new environment, which you can click on to "drill into it". - From the application env you should be able to see recent events and a URL to the website that was created. - From the EBS environment page you can also select from the side menu "Configuration". This is how the environment can be configured. From here you can access security, instances, configure load balacncers, capacity etc and apply these configurations. Further Config/Update - Use the breadcrumbs to go back to the "all applications" page and select "application versions". It will show you the cuurent source and date it was deployed. - Can upload new version by clicking on the "Upload button". - If you are using an S3 bucket you can navigate to the S3 bucket management console and view the contents. Inside you will see all the code ZIPs you ever uploaded as well as a .elasticbeanstalk file and resources directory. - The new version becomes visible but HAS NOT BEEN DEPLOYED to an environment. - You can change your deployment policy to the ones described in the previous section. - Click DEPLOY to deploy you new version. Where is the app stored on the EB instance? It's in the /tmp/deployment/application folder during deployment and the moved to /var/app/current afterward. Configuring Elastic Beanstalk ----------------------------- Environment can be customized using config files - spec packages to install, user groups, shell cmds to run etc etc). Files written in YAML or JSON and have a ".config" suffix and be saved in the folder `.ebextensions`, which must be included in the top-level directory of the app source code bundle. Thus, config files can be placed under source control with the rest of the app code. See also: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/ebextensions.html https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/customize-containers-ec2.html https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/environment-resources.html Examples of commonly used `.ebextensions`: https://github.com/awsdocs/elastic-beanstalk-samples/tree/master/configuration-files AWS "Getting Started" Elastic Beanstalk Tutorial ------------------------------------------------ Ref: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/GettingStarted.CreateApp.html Configuring HTTPS & Certbot .ebextensions File ----------------------------------------------- https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/https-singleinstance-ruby.html https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/configuring-https.html https://medium.com/@jameshamann/ssl-certificates-with-rails-aws-elastic-beanstalk-and-lets-encrypt-8b047d527a60 https://docs.aws.amazon.com/acm/latest/userguide/gs-acm-validate-dns.html Docker multi instance, by default does not have a load balancer. To install one follow this guide: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/using-features-managing-env-types.html#using-features.managing.changetype In summary: Goto Envionments > your-ebs-envionment > Condig > Capacity (click Edit). Change environment type to "load balanced", and configure max #instances, Click "Apply" This Medium.com article works if you are running an Nginx server. In this case running Smashing in the apline container from github, is just running the "normal" thin server. This will use DNS VALIDATION - You must have control over your DNS entry so that you can add a temporary DNS record to, e.g. `_acme-challenge.your.site.address.com` so that the ACME protocol can verify that you do indeed have control over your domain. Getting Certificates From AWS Certificate Manager ------------------------------------------------- https://docs.aws.amazon.com/acm/latest/userguide/gs-acm-validate-dns.html Deploying Docker Images / Web Apps to AWS Elastic Beanstalk ----------------------------------------------------------- MUST READ: https://docs.aws.amazon.com/AmazonECS/latest/developerguide/task_definition_parameters.html#standard_container_definition_params Ref: https://ngeor.com/2017/05/09/how-to-deploy-a-smashing-dashboard-to-aws-elastic-beanstalk-with-docker.html Ref: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/single-container-docker-configuration.html Ref: https://www.youtube.com/watch?v=lBu7Ov3Rt-M TO ACCESS DOCKER VIA EB SSH SESSION: `sudo usermod -aG docker ${USER} && su -s ${USER}` Any web application that you deploy to Elastic Beanstalk in a single container Docker environment must include a Dockerfile or a Dockerrun.aws.json file. Docker images can be deployed to EBS. EBS will setup the EC2 instance for you and can be configured, via a Dockerrun.aws.json JSON file, to map volumes from the EC2 instance into the Docker container as well as mapping EC2 ports too. Thus the web-app can run in the Docker container. Docker multi instance, by default does not have a load balancer. To install one follow this guide: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/using-features-managing-env-types.html#using-features.managing.changetype vvvvvvvv CAUTION CAUTION CAUTION CAUTION CAUTION CAUTION vvvvvvvv There is a difference between V1 and V2 of the Dockerrun.aws.json file/ V1 = Single container - "AWSEBDockerrunVersion": "1", V2 = Multi container - "AWSEBDockerrunVersion": "2", The examples below are a little confused... because I was confused whilst looking at them. Some are V1 and some are V2, and the two types are not compatible - i.e., you can't use a V1 Dockerrun file on a multicontainer (V2) instance and vice vera. SEE: https://stackoverflow.com/a/61444054/1517244 SEE: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/create_deploy_docker_v2config.html The EC2 instances that EBS creates can have environments that only support a single Docker container per EC2 instance, or multiple containers. NOTE: STANDARD AND PRECONFIGURED DOCKER PLATFORMS ON EBS SUPPORT ONLY A SINGLE DOCKER CONTAINER PER ELASTIC BEANSTALK ENVIRONMENT. In order to get the most out of Docker, Elastic Beanstalk lets you create an environment where your Amazon EC2 instances run multiple Docker containers side by side ... Elastic Beanstalk uses Amazon Elastic Container Service (Amazon ECS) to coordinate container deployments to multicontainer Docker environments. SEE https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/create_deploy_docker_ecs.html#create_deploy_docker_ecs_platform ^^^^^^^^ CAUTION CAUTION CAUTION CAUTION CAUTION CAUTION ^^^^^^^^ * Ports: To map ports, for example, the JSON configuration would include: "portMappings": [ { "containerPort": integer, "hostPort": integer } ... ] Port mappings allow containers to access ports on the host container instance to send or receive traffic. containerPort: - The port number on the container that is bound to the user-specified or automatically assigned host port. - If using containers in a task with the Fargate launch type, exposed ports should be specified using containerPort. - If using containers in a task with the EC2 launch type and you specify a container port and not a host port, your container automatically receives a host port in the ephemeral port range. hostPort: - The port number on the container instance to reserve for your container. - If using containers in a task with the Fargate launch type, the hostPort can either be left blank or be the same value as containerPort. - If using containers in a task with the EC2 launch type, you can specify a non-reserved host port for your container port mapping (this is referred to as static host port mapping), or you can omit the hostPort (or set it to 0) while specifying a containerPort and your container automatically receives a port (this is referred to as dynamic host port mapping) in the ephemeral port range for your container instance operating system and Docker version. * Volumes: To map volumes: specify a list of volumes to be passed to the Docker daemon on a container instance. { "Volumes": [ { "HostDirectory": "/path/inside/host", "ContainerDirectory": "/path/inside/container" } ] ... The following are the types of data volumes that can be used: 1. Docker Volumes: Docker-managed volume that is created under /var/lib/docker/volumes on the container instance. 2. Bind Mounts: A file or directory on the host machine is mounted into a container. Bind mount host volumes are supported when using either the EC2 or Fargate launch types. To use bind mount host volumes, specify a host and optional sourcePath value in your task definition. E.g.: "volumes": [ { "name": "dashboards", "host": { "sourcePath": "/var/app/current/dashboards" } }, NOTE: /var/app/current is the directory on the host machine that contains the deployment artifact, i.e. the .zip file unzipped. And so on. See the reference link at the top of the sub-section for details. - Goto EBS on AWS web console. - Select "Create New Application" - Give the app a name and description - Environment tier = Web Server Prodefined config = Docker Environment type = Load balancing, autoscaling - Select "Upload Your Own" Upload the `Dockerrun.aws.json` file Terminating HTTPS on EC2 instances running Docker - - - - - - - - - - - - - - - - - - - - - - - - - See: https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/https-singleinstance-docker.html See: Above section "Configuring Elastic Beanstalk" Create a file `.ebextensions/https-instance.config`, to configure the NGINX instance. The config will create the following files, which are required by NGINX et al: 1. /etc/nginx/conf.d/https.conf - Configures the nginx server. This file is loaded when the nginx service starts. 2. /etc/pki/tls/certs/server.crt - Creates the certificate file on the instance. 3. /etc/pki/tls/certs/server.key - Creates the private key file on the instance !! CAUTION !! Avoid committing a configuration file that contains your private key to source control. After you have tested the configuration and confirmed that it works, store your private key in Amazon S3 and modify the configuration to download it during deployment. For instructions, see Storing private keys securely in Amazon S3. To configure EBS to download the private keys from S3 during deployment you will need to create `.ebextensions/privatekey.config`. For example: Resources: AWSEBAutoScalingGroup: Metadata: AWS::CloudFormation::Authentication: S3Auth: type: "s3" buckets: ["MY_BUCKET_NAME"] roleName: "Fn::GetOptionSetting": Namespace: "aws:autoscaling:launchconfiguration" OptionName: "IamInstanceProfile" DefaultValue: "aws-elasticbeanstalk-ec2-role" files: # Private key "/etc/pki/tls/certs/server.key": mode: "000400" owner: root group: root authentication: "S3Auth" source: MY_BUCKET_URL/server.key Private Repositories - - - - - - - - - - - You don't have to use just DockerHub images. You can configure it to deploy images from your own private repos. [https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/create_deploy_docker.container.console.html] The Docker and Multicontainer Docker platforms for Elastic Beanstalk support the use of Docker images stored in a public OR PIRVATE online image repository. Specify images by name in Dockerrun.aws.json. Note these conventions: - Images in official repositories on Docker Hub use a single name (for example, ubuntu or mongo). - Images in other repositories on Docker Hub are qualified with an organization name (for example, amazon/amazon-ecs-agent). - Images in other online repositories are qualified further by a domain name (for example, quay.io/assemblyline/ubuntu or account-id.dkr.ecr.us-east-2.amazonaws.com/ubuntu:trusty). [https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/single-container-docker-configuration.html] Add the information about the Amazon S3 bucket that contains the authentication file in the Authentication parameter of the Dockerrun.aws.json file. Make sure that the Authentication parameter contains a valid Amazon S3 bucket and key. The Amazon S3 bucket must be hosted in the same AWS Region as the environment that is using it. For V1 (single Docker container) { "AWSEBDockerrunVersion": "1", ... "Authentication": { "Bucket": "my-bucket", "Key": "mydockercfg" }, "Image": { "Name": "quay.io/johndoe/private-image", "Update": "true" }, ... } For V2 (multi Docker) { "AWSEBDockerrunVersion": 2, ... "authentication": { "bucket": "my-bucket", "key": "mydockercfg" }, ... } [https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/create_deploy_docker.container.console.html#docker-images-private] To use a Docker image in a private repository hosted by an online registry, you must provide an authentication file that contains information required to authenticate with the registry. Generate an authentication file with the docker login command. For repositories on Docker Hub, run docker login: docker login registry-server-url Upload a copy named .dockercfg of the authentication file to a secure Amazon S3 bucket. Build Docker Image On Deployment - - - - - - - - - - - - - - - - - You can also get the deployment to BUILD your Docker image for you, rathen than pull it from a repository: [https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/single-container-docker-configuration.html] Create a Dockerfile when you don't already have an existing image hosted in a repository. Deploying Ruby Applications to AWS Elastic Beanstalk with Git ------------------------------------------------------------- Ref: https://aws.amazon.com/blogs/developer/deploying-ruby-applications-to-aws-elastic-beanstalk-with-git/ ^^^ WARNING/NOTE: I think the aricle is out of date. it meansions using the command `eb start`. I have tried this but it does not appear to be a command supported by `eb`: eb: error: unrecognized arguments: start The `eb` help says... To get started type "eb init". Then type "eb create" and "eb open" I have followed its advice and replaced the article's second-half with these suggested steps! WARNING/NOTE: I also found that after successfull setup Puma was having problems. I saw the following error message repeated many times in the Puma logs: connect() to unix:///var/run/puma/my_app.sock failed Nginx, the web-server, is failing to communicate with Puma, the application server. Why do we need both? See - https://stackoverflow.com/a/50517532/1517244 See - https://www.justinweiss.com/articles/a-web-server-vs-an-app-server/ A web server is a program that takes a request to your website from a user and does some processing on it. Then, it might give the request to your Rails app. Nginx and Apache are the two big web servers you’ll run into. An app server is the thing that actually runs your Rails app. Your app server loads your code and keeps your app in memory. When your app server gets a request from your web server, it tells your Rails app about it. After your app is done handling the request, the app server sends the response back to the web server (and eventually to the user). It seems routing to app is: NGINX ----[pass CGI reg]----> PUMA ----[Query App]----> ----> APP ----[Use sinatra to build/route REST end points]---->[App API endpoint called] ---->[Return CGI response]----> PUMA ---->[Return CGI response] ----> NGINX The above should help picture what's going on and what the various names like "Puma", "Sinatra" etc etc mean. See this SO thread for solutions: https://stackoverflow.com/q/30355569/1517244. In short the solution seems to be adding the following 2 lines to `config/puma.rb`: bind "unix:///var/run/puma/my_app.sock" pidfile "/var/run/puma/my_app.sock" See also: https://github.com/puma/puma/blob/master/README.md#configuration-file Also, the config file may be `${RAILS_ENV}/puma.rb' is RAILS_ENV is defined. Where to find the config? The file is stored in side the app's config directory. The app is a Gem, so I assume we follow this: Use `gem environment` to find out about your gem environment. Look for one of the two lines: - INSTALLATION DIRECTORY: /var/lib/gems/<version a.b.c> - USER INSTALLATION DIRECTORY: /home/<user>/.gem/ruby/<version a.b.c> Then under, probably the global installation directory you will find, for example: /var/lib/gems/2.5.0/gems/puma-4.3.5 Can also use `bundle info <gem name>` to see where the GEM is installed. To add these you will need to SSH into your EBS instance: eb ssh URG... FIDDLY AND TIME CONSUMING JUST TO SETUP A DASH... ABANDONING - WILL USE DOCKER IMAGE. EBS is clever - Gems listed in the Gemfile are AUTOMATICALLY INSTALLED FOR YOU (almost anyway!). To deploy to EBS must download & install AWS Elastic Beanstalk command line tool. Get the install scripts from GitHub: 1. Make sure the following apps are installed: sudo apt install -y build-essential zlib1g-dev libssl-dev libncurses-dev libffi-dev \ libsqlite3-dev libreadline-dev libbz2-dev 2. Clone https://github.com/aws/aws-elastic-beanstalk-cli-setup git clone https://github.com/aws/aws-elastic-beanstalk-cli-setup.git sudo ./aws-elastic-beanstalk-cli-setup/scripts/bundled_installer Running the install script takes a-g-e-s to install Python, so be patient. It doesn't give you any indication of progress :/ Go make a cup of tea and come back later! 3. Follow the "Finishing up" instructions at the end of the EB install to add EB to your PATH! echo 'export PATH="/home/jh/.ebcli-virtual-env/executables:$PATH"' >> ~/.bash_profile && source ~/.bash_profile 1. Initialize application with the CLI tool by typing: eb init You will need your AWS acces key ID and secret access key to login. Creates .elasticbeanstalk directory in project root. Add to .gitignore. You will also be asked if you want to continue with "CodeCommit". This option will allow you to create and manage Git repositories hosted by AWS CodeCommit with the Elastic Beanstalk: AWS CodeCommit is a fully-managed source control service that makes it easy ... to host ... private Git repositories. ... Get 5 active users per month for free (within limits), after which you pay $1 per additional active user per month. This also creates a Git subcommand - `aws.push`. To use this you must have initialised eb to use CodeCommit! To deploy app can now just type `git aws.push` (but don't use it yet - after the eb start). If you don't want to use CodeCommit you can upload the code from your local machine, or S3 bucket, like you would via the web console by using `eb create` with the `--source` option. Type `eb create --help` for further information. Try also `eb init --help` for more init options. For example you can use `eb init --region eu-west-2` to specify the region on the command line. Seems like which application you use is not specifiable from the CLI... need to enter this :/ 2. Create a new environment: eb create Choose the "application" load balancer Do not enable "Spot Fleet Requests" An example of the command output is useful to look at to see what EBS is doing behind the scenes for us in terms of setting up load blancers, EC2 instances, S3 buckets, security groups etc: INFO createEnvironment is starting. INFO Using elasticbeanstalk-XXXX as Amazon S3 storage bucket for environment data. INFO Created target group named: XXXX INFO Created security group named: XXXX INFO Created security group named: XXXX INFO Created Auto Scaling launch configuration named: XXXX INFO Created Auto Scaling group named: XXXX INFO Waiting for EC2 instances to launch. This may take a few minutes. INFO Created Auto Scaling group policy named: XXXX INFO Created Auto Scaling group policy named: XXXX INFO Created CloudWatch alarm named: XXXX INFO Created CloudWatch alarm named: XXXX INFO Created load balancer named: XXXX INFO Created Load Balancer listener named: XXXX INFO Application available at XXXX.elasticbeanstalk.com. INFO Successfully launched environment: XXXX Type `eb create --help` for further information. Some of the available options include: --source SOURCE - source of code to create from directly; example source_location/repo/branch --region REGION - use a specific region --profile PROFILE - use a specific profile from your credential file --elb-type ELB_TYPE - load balancer type --instance_profile EP - EC2 Instance profile, if you don't want to use the default. By DEFAULT, EBS CLI will AUTOMATICALLY PACKAGE YOUR PROJECT DIRECTORY AS A ZIP FILE AND UPLOAD THIS TO A NEW S3 BUCKET THAT IT CREATES ON YOUR BEHALF. I.e. creates an app archive from the application source code in the local project directory. To DEPLOY AN ARTIFACT INSTEAD OF THE PROJECT FOLDER: See https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/eb-cli3-configuration.html#eb-cli3-artifact The CLI doesn't by default allow you to use `--source` to specify a local ZIP file. You can tell the EB CLI to deploy a ZIP file or WAR file that you generate as part of a separate build process by adding the following lines to .elasticbeanstalk/config.yml in your project folder. deploy: artifact: path/to/buildartifact.zip 3. To debug things, under the environment created above, but accessed through the web console you can download server logs. Beanstalk With RDS ------------------ RDS = Relational Database Service, which is a web service that makes it easier to set up, operate, and scale a relational database in the AWS Cloud. Can launch RDS within EBS console - Good for development - Bad for production - DB lifecyle tied to app lifecyle - termination of env terminats DB too. - Prefer decouple of RDS instance from EBS env. I.e., launch outseide of EBS. - Provides more flexibility. - Can then connect multiple environments to same DB. - Wide choices of DB types. - Can tear down app without interfering with DB. To connect EBS to outside RDS - Req additional Security Group - add to environments Auto Scaling group. - Provide connection string config info to app servers (endpoint, pwd etc) using EBS properties. Configuring HTTPS for your Elastic Beanstalk environment -------------------------------------------------------- Simplest method = assign a server certificate to your environment's load balancer. Client to load balancer secure. Balancer to EC2 instance is not, but is hidden behind balancer so should be secure enough? For end-to-end HTTPS in a load balanced environment, can combine instance and load balancer termination to encrypt both connections. With AWS Certificate Manager (ACM), you can create a trusted certificate for your domain names for free. BUT domain names are pay-to-own. SEE https://aws.amazon.com/premiumsupport/knowledge-center/elastic-beanstalk-https-configuration/ NOTE - TO TERMINATE AT LOAD BALANCER When configurating the load balancer, make sure it listens on port 443 using HTTPS but *forwards* traffic to the backend using port 80, HTTP if you whish to terminate the HTTPS connection at the load balancer. Terminate HTTPS on the load balancer 1. Open the Elastic Beanstalk console, and then select your environment. 2. In the navigation pane, choose Configuration. 3. In the Load balancer category, choose Modify. 4. To add the listener for port 443, choose one of the following sets of steps based on the type of load balancer that your Elastic Beanstalk environment has. To add a listener for a Classic Load Balancer: 1. Choose Add Listener. 2. For Port, enter the incoming traffic port (typically 443). 3. For Protocol, choose HTTPS. 4. For Instance Port, enter 80. 5. For Instance Protocol, choose HTTP. 6. For SSL certificate, choose your certificate, and then choose the SSL policy that you want to use from the drop-down menu. 7. Choose Add, and then choose Apply. To add a listener for an Application Load Balancer: 1. Choose Add Listener. 2. For Port, enter the incoming traffic port (typically 443). 3. For Protocol, choose HTTPS. 4. For SSL certificate, choose your certificate, and then choose the SSL policy that you want to use from the drop-down menu. 5. Choose Add, and then choose Apply.
Cloudfront
See: https://aws.amazon.com/cloudfront/#:~:text=Amazon%20CloudFront%20is%20a%20fast,within%20a%20developer%2Dfriendly%20environment. From AWS Website: Amazon CloudFront is a fast content delivery network (CDN) service that securely delivers data, videos, applications, and APIs to customers globally with low latency, high transfer speeds, all within a developer-friendly environment. Another AWS page: Amazon CloudFront is a web service that speeds up distribution of your static and dynamic web content, for example, .html, .css, .php, image, and media files, to end users. CloudFront delivers your content through a worldwide network of edge locations. When an end user requests content that you're serving with CloudFront, the user is routed to the edge location that provides the lowest latency, so content is delivered with the best possible performance. If the content is already in that edge location, CloudFront delivers it immediately. If the content is not currently in that edge location, CloudFront retrieves it from an Amazon S3 bucket or an HTTP server (for example, a web server) that you have identified as the source for the definitive version of your content. [https://adrianhall.github.io/cloud/2019/01/31/which-aws-service-for-hosting/]: If your web site is just a collection of HTML, CSS, and JavaScript with no other dependencies, and you receive this from a marketing or design firm, you can just drag and drop the files onto your S3 bucket and get them deployed easily. This option may be for you. It’s simple to understand — you just publish your files to the S3 bucket and CloudFront picks them up and distributes them worldwide, also handling things like HTTPS for you. ... Best for: Pre-packaged static sites provided by marketing organizations that are deployed via drag and drop. Cloudfront may allow FREE encryption using TLS via the AWS certificate manager. - Global content deliver network (CDN) - Application acceleration and optimization - Distributed scalable integrated security controls - Optimised for deliver use cases with intelligent caching. - On-demand, full user control. Cost effective.
Amplify Framework
From AWS Website: Provides a set of libraries and UI components and a command line interface to build mobile backends and integrate with your iOS, Android, Web, and React Native apps. [https://adrianhall.github.io/cloud/2019/01/31/which-aws-service-for-hosting/]: an awesome continuous deployment (CD) platform for your web site. It has built in support ... for JavaScript single page applications written in a variety of frameworks like React ... Best for: Static site generators, JavaScript based single page applications.
Lightsail
Lightsail is an easy-to-use cloud platform that offers everything needed to build an app or website, plus a cost-effective, monthly plan. Ideal for simpler workloads, quick deployments, and getting started on AWS. The pricing scheme is way simpler, for example, than EBS. It seems to abstract even further away so that we don't even have to worry about what EC2 cost model and S3 cost model we'd use etc. [https://adrianhall.github.io/cloud/2019/01/31/which-aws-service-for-hosting/]: If your app relies on a web language backend (like Ruby or PHP) or you use a common web site platform (like WordPress or Magento), then you might want to choose Amazon Lightsail. ... Best for: Common web stacks like LAMP, MEAN, and PHP, or common web applications like WordPress SSL/TLS Certificates In LightSail --------------------------------- Ref: https://lightsail.aws.amazon.com/ls/docs/en_us/articles/understanding-tls-ssl-certificates-in-lightsail-https Lightsail uses SSL/TLS certificates to handle encrypted web traffic (HTTPS requests). You can create certificates, verify domain ownership, and then attach the validated certificates to your Lightsail load balancer.
Dynamo Database
NoSQL databases (aka "not only SQL") are non-tabular, and store data differently than relational tables. NoSQL databases come in a variety of types based on their data model. The main types are document, key-value, ... They provide flexible schemas and scale easily with large amounts of data and high user loads. Dynamo is a NoSQL database. For apps that need single digit millisecond latency at any scale. Supports documents (JSON, XML, HTML) and key-value data models. DOCUMENTS: - See https://aws.amazon.com/nosql/document/ - See https://developer.couchbase.com/documentation/server/3.x/developer/dev-guide-3.0/compare-docs-vs-relational.html - Class of non-relational (NoSQL) database. - Store data in documents similar to JSON (JavaScript Object Notation) objects - Contain pairs of fields & values. - Values can be many things: strings, numbers, arrays, objects etc. - Structure typically aligns with objects developers work with - think objects serialised to JSON. - Powerful query languages. - Unlike relational DBs does not need a pre-defined schema and does less data normalisation, generally TABLES: Class of non-relational (NoSQL) database. Table rows are called ITEMS. Rows are broken up into columns called ATTRIBUTES. TABLE +--------+---------+-----+---------+ |Attrib1 | Attrib2 | ... | AttribN | +--------+---------+-----+---------+ Item 1 | | | | | +--------+---------+-----+---------+ Item 2 | | | | | +--------+---------+-----+---------+ .... | | | | | +--------+---------+-----+---------+ Item M | | | | | +--------+---------+-----+---------+ Each item is basically a set of key-value pairs. Think of each item as a JS object with keys and values or a JSON spec. Data is stored/retrieved based on a PRIMARY KEY. Two types of primary keys: 1. PARTITION KEY Unique attribute. Key is hashed , which determines the partion/physical location where data is stored. No two items can have same partition key. That's why no two items can have same key! 2. COMPOSITE KEY (PARTITION KEY + SORT KEY) E.g. same user posts multiple time to forum. IAM Conditions -------------- They are CONDITIONS which are added to an IAM POLICY. These can be used to restrict a uer's access to only certain items (rows) in a table! IAM conditions contain the following important keys: * "Sid" - STATEMENT IDENTIFIER - A unique identifying name for the policy * "Effect" - Are we allowing/denying etc * "Action"- The actions that this policy allows/denies (see "effect"). For example "dynamodb:GetItem", "dynamodb:PutItem", "dynamodb:UpdateItem" * "Resource" - The unique Amazon resource name for your dynamo db table * "Condition" - This is where you specify the conditions on which the policy applies, e.g., allowing users to acces only the items where the Parition Key value matches their user ID. * "ForAllValues:StringEquals" * "dynamodb:Attributes" Consistency Models ------------------ 1. Strongly consistent - Any write that occur before read are reflected in the read consistent across all locations. 2. Eventuall consistent - Can take up to 1 second for new writes to be reflected in a read across all locations.
Amazon Athena
Serverless interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Easy to use: point to your data in Amazon S3, define the schema, and start querying using standard SQL. I.e., Query non database data container in things like CSV and JSON with a shim on top to make it look like a normal database that can be queried with standard SQL. Why Athena? - challenges customers faced - had to do work to get to S3 data, e.g. complex transformations etc: - Significant amount of work required to analyze data in Amazon S3 - Users often only have access to aggregated data sets - Managing a Hadoop clister or data warehouse requires expertise. What Athena gives: Athena is an INTERACTIVE QUERY SERVICE that makes it EASY to analyse data DIRECTLY FROM AWS S3 using STARTARD SQL. Results come back in seconds. It is NOT a database or a data warehouse - it is designed to enable you to write fast SQL queries with nothing to manage. - No infrastructure or administration - Zero spin up time - Transparent upgrades Easy to use - Log into console. - Create a table: - Type in a Hive Data Description Language (DDL) statement. - Use console Add Table wizard. - Start querying :) - Query data in its RAW FORMAT!! - Test, CSV, JSON, weblogs, AWS service logs etc. - Stream data directly from S3. - Query using ANSI SQL - well know language most devs are familiar with Highly available: - Takes afvantage of D3 durability and availability. - Service endpoint or log into console. - 99.999999999% durability. - uses warm compute pools across multiple availability zones.
Codecommit / Code Deploy / Code Pipeline
Code Deploy ----------- Can deploy to EC2 or on-premises systems! 2 approaches: 1. IN PLACE - The app is stopped on each instance and the new release is installed, - AKA ROLLING UPDATE, - Capacity is rediced during deployment, - Lambda no supported, - Roll-back involves a re-deploy, - Good for 1st time deployment. 2. BLUE / GREEN - New instance are provisioned and the new release is installed on the new instances. BLUE == active deployment GREEN == new release The ELB will be configured to route traffic to the green environment, but importantly the blue environment is left in tact. Therefore, rolling back to the previous version is super easy and no down-time on either step is experienced. When green is validated, the blue is just deleted and the green becomes blue, - No capcity reduction during deployment, - You pay for 2 environments until you terminate the old servers, - Safest option! AppSpec File - CodeDeploy Config - - - - - - - - - - - - - - - - The AppSpec file configures the CodeDeploy environment by defining the parameters to be used during deployment. The file specifies 4 main topics: 1. Version (reserved for future use) 2. OS - Which OS should you run 3. Files - the location of application files to be copied and where to copy to 4. HOOKS - Lifecycle event hooks specify scripts that need to be run at set points in the deployment lifecycle. They have a very specific order. Hooks include the following and are used to run scripts. The following shows a few of them in the RUN ORDER. - BeforeInstall - After install - ApplicationStart - ValidateService Doing A Deploment - - - - - - - - - #Install CodeDeploy agent on your EC2 instance: sudo yum update sudo yum install ruby sudo yum install wget cd /home/ec2-user wget https://aws-codedeploy-eu-central-1.s3.amazonaws.com/latest/install chmod +x ./install sudo ./install auto sudo service codedeploy-agent status #Create your application.zip and load it into CodeDeploy: aws deploy create-application --application-name mywebapp aws deploy push --application-name mywebapp --s3-location s3://<MY_BUCKET_NAME>/webapp.zip --ignore-hidden-files
Elastic Container Service (Ecs) / Fargate & Ec2
Lets you run containerised workloads in AWS. Has deep intrgration with AWS services like Route53, IAM, VPC etc. ECS can run containers in 3 different configurations: 1. ECS - Run containers on a cluster of virtual machines. 2. FARGATE - This is the serverless option - no need to worry about underlying EC2 instances 3. EC2 - Used for fine grained control - control the installation, config and management of the compute environment. Elastic Container Registry (ECR) -------------------------------- Do run Docker services you must create an ECR - Elastic Container Registry. ECR is just a place where Docker images are stored. ECS connects to the ECR and uses the images to deploy Docker containers. Startup: aws configure Login: eval $(aws ecr get-login --no-include-email --region eu-west-2) Create repo: aws ecr create-repository --repository-name my-repo-name --region eu-west-2 Command will return the RESPOSITORY_URL. E.g.: jh@jehtech$ aws ecr create-repository --repository-name JEHTech --region eu-west-2 { "repository": { "repositoryArn": "arn:aws:ecr:eu-west-2:300012345678:repository/JEHTech", "registryId": "300012345678", "repositoryName": "JEHTech", "repositoryUri": "300012345678.dkr.ecr.eu-west-2.amazonaws.com/JEHTech", << THIS IS REPOSITORY_URL "createdAt": 1593780814.0, "imageTagMutability": "MUTABLE", "imageScanningConfiguration": { "scanOnPush": false } } } Upload: docker tag image-i-want-to-upload:version-tag RESPOSITORY_URL/image-i-want-to-upload:version-tag docker push RESPOSITORY_URL/image-i-want-to-upload:version-tag See --- https://docs.aws.amazon.com/AmazonECS/latest/developerguide/docker-basics.html https://docs.aws.amazon.com/elasticbeanstalk/latest/dg/single-container-docker-configuration.html Example Using Smashing Docker Image On ECS ------------------------------------------ https://ngeor.com/2017/05/09/how-to-deploy-a-smashing-dashboard-to-aws-elastic-beanstalk-with-docker.html Micro Services -------------- Monolothic apps - many distinct parts of an application on one server make it hard to scale because the demands on that one server increase rapidly and the application either has to be duplicated on another server or split up into components which can expand on their own servers. All of this is hard with a monolithic app. You have to SCALE THE ENTIRE APPLICATION. E.g. User profile data, photo uploading an editing, thumbnails, photo viewe etc in one app on one server. Failures in any one compnent of the app would probably bring down the whole application. With microservices you ONLY HAVE TO SCALE THE PARTS THAT NEED IT. Serperates the jobs of an independent server into INDEPENDENT SMALLER services. Gives ability to use smaller resources to handle large tasks. Each service does just its workload. So, now in e.g. requests for auth goto the auth micro service, Photo uploads processes by the upload micro service, the editing by the edit service and so on. Containers are an excellent way to serve micro services. Far Gate -------- Running containers on EC2 is good. But need to keep track of all the pieces you needto manage. You have to manage: 1. The EC2 instances (your cluster) on which the containers sit For example you might have to commision several EC2 instances, have them in different subnets and then connect the subnets together in a Virtual Private Cloud (VPC) - Have to contunaly patch/update the instances and evaluate secutiry robustness - Have to setup autoscaling - Have to have Docker installed on each instance - You will interact with the DOcker instances using ECS - Elastic Continer Service. 2. The containers themselves Each EC2 will have multiple Docker containers running in it. Fargate answers the question HOW CAN YOU RUN CONTAINERS WITHOUT HAVING TO WORRY ABOUT SETTING UP ALL THIS INFRASTRUCTURE?? AWS Fargate is a serverless computing engine and hosting option for container based workloads. Fargate abstracts away the infrastructure management issue. You no longer have to "see" the EC" instance details. You just see fargate clusters and ECS. You only pay when your containers are running.
Kinesis
Which Aws Service To Publish A Website?
Ref: https://adrianhall.github.io/cloud/2019/01/31/which-aws-service-for-hosting/ Brilliant article that quotes have been made from in above sections. Summary here: Options include: - Amazon S3 + Amazon Cloudfront Best for: Pre-packaged static sites provided by marketing organizations that are deployed via drag and drop. - AWS Amplify Console Best for: Static site generators, JavaScript based single page applications. - Amazon Lightsail Best for: Common web stacks like LAMP, MEAN, and PHP, or common web applications like WordPress, MediaWiki, and Magento. - AWS Elastic Beanstalk Best for: Your most challenging enterprise apps where access to the underlying OS is required. - AWS EC2 - Do-it-yourself compute / storage / network stack Best for: Your most challenging enterprise apps where you can use a variety of AWS services to augment your service offering.