550 likes | 607 Views
This Amazon Redshift presentation will cover most of the basic concepts of Redshift. In this video we will be looking into what is AWS, why Amazon Redshift, what is Amazon Redshift, and the advantages of Amazon Redshift. We will also be looking into the architecture of Amazon Redshift, Some additional concepts in Amazon Redshift, the companies using Amazon Redshift and finally a demo on Amazon Redshift. Amazon Redshift is a service provided by AWS. Amazon Redshift is a cloud-based service or a data warehouse service that is used for collecting and storing data. Also, it enables a user to analyze the data using BI tools and simplifies the process of handling large scale data sets. Amazon redshift is the fastest data warehouse available. <br><br>Below topics in this Amazon Redshift presentation:<br>1. What is AWS?<br>2. Why Amazon Redshift?<br>3. What is Amazon Redshift?<br>4. Advantages of Amazon Redshift<br>5. Architecture of Amazon Redshift<br>6. Additional concepts in Amazon Redshift<br>7. Companies using Amazon Redshift<br>8. Demo<br><br>This AWS certification training is designed to help you gain an in-depth understanding of Amazon Web Services (AWS) architectural principles and services. You will learn how cloud computing is redefining the rules of IT architecture and how to design, plan, and scale AWS Cloud implementations with best practices recommended by Amazon. The AWS Cloud platform powers hundreds of thousands of businesses in 190 countries, and AWS certified solution architects take home about $126,000 per year.<br><br>This AWS certification course will help you learn the key concepts, latest trends, and best practices for working with the AWS architecture u2013 and become industry-ready aws certified solutions architect to help you qualify for a position as a high-quality AWS professional.<br> <br>The course begins with an overview of the AWS platform before diving into its individual elements: IAM, VPC, EC2, EBS, ELB, CDN, S3, EIP, KMS, Route 53, RDS, Glacier, Snowball, Cloudfront, Dynamo DB, Redshift, Auto Scaling, Cloudwatch, Elastic Cache, CloudTrail, and Security. Those who complete the course will be able to: <br><br>1. Formulate solution plans and provide guidance on AWS architectural best practices<br>2. Design and deploy scalable, highly available, and fault tolerant systems on AWS<br>3. Identify the lift and shift of an existing on-premises application to AWS<br>4. Decipher the ingress and egress of data to and from AWS<br>5. Select the appropriate AWS service based on data, compute, database, or security requirements<br>6. Estimate AWS costs and identify cost control mechanisms<br><br>This AWS course is recommended for for professionals who want to pursue a career in Cloud computing or develop Cloud applications with AWS. Youu2019ll become an asset to any organization, helping leverage best practices around advanced cloud-based solutions and migrate existing workloads to the cloud.<br><br>Learn more at https://www.simplilearn.com/cloud-computing/aws-solution-architect-associate-training
E N D
What’s in it for you? What is AWS? Why Amazon Redshift? What is Amazon Redshift? Advantages of Amazon Redshift Architecture of Amazon Redshift Additional concepts in Amazon Redshift Companies using Amazon Redshift Demo
What is AWS? AWS or Amazon Web Services is a secure cloud service platform from Amazon
What is AWS? AWS services can be used to create and deploy any application in the cloud AWS or Amazon Web Services is a secure cloud service platform from Amazon
What is AWS? AWS services can be used to create and deploy any application in the cloud It provides services over the internet AWS or Amazon Web Services is a secure cloud service platform from Amazon
What is AWS? AWS services can be used to create and deploy any application in the cloud It provides services over the internet AWS or Amazon Web Services is a secure cloud service platform from Amazon With AWS, you pay only for what you use
Why Amazon Redshift? Before Amazon Redshift Fetching data from data warehouse Note: Data warehouse is a repository where the data is stored
Why Amazon Redshift? Before Amazon Redshift GIF credits: tenure.com
Why Amazon Redshift? Before Amazon Redshift Cons of traditional data warehouse service • Time consuming • Maintenance costs outweigh the benefits • Loss of information • Data rigidity
Why Amazon Redshift? Before Amazon Redshift Cons of traditional data warehouse service these problems could have been solved With the help of Amazon Redshift • Time consuming preparation • Maintenance costs outweigh the benefits • Loss of information • Data rigidity
Why Amazon Redshift? But what is Amazon redshift? Before Amazon Redshift Cons of traditional data warehouse service these problems could have been solved With the help of Amazon Redshift • Time consuming preparation • Maintenance costs outweigh the benefits • Loss of information • Data rigidity
What is Amazon Redshift? Amazon Redshift is a cloud-based service or a data warehouse service that is used for collecting and storing data. Also, it enables a user to analyze the data using BI tools and simplifies the process of handling large scale data sets
USECASE • DNA (Telecommunications company) had an issue with managing website data and Amazon S3 data which lead down to slow process of their application
USECASE • DNA (Telecommunications company) had an issue with managing website data and Amazon S3 data which lead down to slow process of their application • To overcome this issue, Amazon Redshift was used, and the company has noticed 52% increase in the application performance
Advantages of Amazon Redshift High performance
Advantages of Amazon Redshift Low Cost High performance
Advantages of Amazon Redshift Scalability Low Cost High performance
Advantages of Amazon Redshift Availability Scalability Low Cost High performance
Advantages of Amazon Redshift Availability Scalability Low Cost High performance Security
Advantages of Amazon Redshift Availability Scalability Low Cost High performance Flexibility Security
Advantages of Amazon Redshift Availability Scalability Low Cost High performance Flexibility Security Simple database migration
Architecture of Amazon Redshift Client application JDBC ODBC Leader node AMAZON REDSHIFT Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application • The client applications of Amazon redshift interacts using two drivers. They are: • JDBC • ODBC JDBC ODBC Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application Amazon Redshift service can monitor connections from other applications using JDBC connections JDBC ODBC Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application ODBC allows a user (directly from any application) to interact with live data of Amazon Redshift JDBC ODBC Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application Amazon Redshift has a set of computing resources called nodes,which is gathered into a group called a data warehouse cluster JDBC ODBC Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application Amazon Redshift has a set of computing resources called nodes, which is gathered into a group called a data warehouse cluster JDBC ODBC Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application JDBC ODBC Leader node Compute node n Compute node 1 Each cluster has one or more databases Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application This node manages the interaction between the client application and compute nodes. It analyses and develops designs in order to carry out database operations JDBC ODBC Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application JDBC ODBC The leader node runs the program and assigns the code to individual compute nodes The leader node runs program and assigns the code to individual compute nodes Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application JDBC ODBC The compute node executes the program and shares the result back to the leader node for final aggregation Leader node Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application JDBC ODBC Leader node Compute nodes are categorized into slices. Each node slice is allotted with specific memory space, where it processes its workload Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Architecture of Amazon Redshift Client application JDBC ODBC Leader node These node slices work in parallel in order to finish their work Compute node n Compute node 1 Node slices Node slices … … … … Datawarehouse Cluster
Additional concepts in Amazon Redshift Compression Column storage
Column Storage Column storage is an essential factor in optimizing query performance and resulting in quicker outputs Below example shows how database tables stores record into disk blocks by row: 201259797 | SAM | 20 | 18TH ST | CHICAGO | 987259797| ANA | 20 | 16th ST | HOUSTON | 777259712 | CHIN | 40 | 20TH ST | PHOENIX
Compression • Compression is a column-level operation which decreases storage requirements • It eventually improves query performance The syntax for column compression is: CREATE TABLE table_name (column_namedata_type ENCODE encoding-type)[, ...]