22 Jul

July

Choosing a cloud data warehouse: Comparing some of the top CDW vendorsÂ

There are many popular cloud-based data warehouse platforms to choose from, including Amazon Redshift, GoogleÂ BigQuery, Microsoft Azure,Â SnowflakeÂ and others â€” and there are just as many important considerations when deciding on the right solution for your organization.â€¯Â

While many of the popular cloud data platforms offer similar capabilities, youâ€™ll find many differences in pricing, scalability, architecture, security features, speed, and other factors.â€¯Â

Here we compare the four top vendors for the enterprise:
Amazon vs. Azure vs. Google vs. SnowflakeÂ

Microsoft Azure Synapse Analytics: Taking SQL beyond data warehousing.
Azure Synapse Analytics is a newer analytics service that brings together enterprise data warehousing andâ€¯big data analytics. It gives you the freedom to query data using either serverless on-demand or provisioned resources. Azure Synapse offers a unified experience to ingest, prepare, manage, and serve data for your business intelligence (BI) and machine learning (ML) needs.â€¯Â

At the heart of Azure Synapse is a cloud-native, distributed SQL processing engine. Itâ€™s built on the foundation of SQL Server to drive your most demanding enterprise data warehousing workloads.Â Similar toÂ other cloud MPP solutions, Azure SQL Data Warehouse (SQL DW) separates storage and compute, billing for each separately. Azure Synapse saves relational tables data with columnar storage and abstracts physical machines by representing compute power in the form of data warehouse units (DWUs). This allows your users toÂ easily and seamlessly scale compute resourcesÂ at will.Â

Synapse Analytics aims to unify a range of analytics workloads, such asâ€¯data warehouses or data lakes, and ML, in a singular user interface (UI). The combination of an SQL Engine, Apache Spark with Azure Data Lake Storage (ADLS), and Azure Data Factory gives users the option to control both data warehouse/data lakes and data preparation for ML tasks. Azure Synapse allows for both vertical and horizontal scaling of the data warehouse. Vertically by changing the service tier orÂ placing the database in an elastic pool. Horizontally by adding more data warehouse units.Â

GoogleÂ BigQuery: A serverless solution.
BigQueryÂ is a fully managed, serverless data warehouse that automatically scales to match storage and computing power needs. Google doesnâ€™t expect you to manage your data warehouse infrastructure which is whyÂ BigQueryÂ hides many of the underlying hardware, database, nodes, and configuration details. Its elasticity automatically works out of the box. And getting started is simply a matter of creating an account with Google Cloud Platform (GCP), loading a table, and running a query. Google takes care of the rest.Â

WithÂ BigQuery, you get a columnar and ANSI SQL database that can analyze terabytes to petabytes of data at incredible speeds.Â BigQueryÂ also lets you doâ€¯spatial analysisâ€¯using familiar SQL withÂ BigQueryÂ GIS. In addition, you can quickly build and operationalize ML models on large-scale structured or semi-structured data using simple SQL withÂ BigQueryÂ ML. And you can support real-time interactive dashboarding withÂ BigQueryÂ BI Engine.Â

TheÂ BigQueryÂ architecture is composed of several components. Borg is the compute. Colossus is the distributed storage. Jupiter is the network. And Dremel is the execution engine.Â

Amazon Redshift: The first widely adopted cloud data warehouse.
For many years, data warehousing was only available as anÂ on-premiseÂ solution. Then in November 2012, Amazon Web Services (AWS) launched Redshift, a fully managed, petabyte-scale data warehouse service in the cloud. Although not the first cloud-based data warehouse, it was the first to gain market share through adoption. Redshiftâ€™s SQL dialect is based on PostgreSQL, which is well understood by analysts worldwide, and uses an architecture familiar to many on-premises data warehouses users.â€¯Â

You can start with as little as a few gigabytes of data and scale to petabytes. This empowers you to acquire new insights from your business and customer data.Â

The first step to creating a Redshift data warehouse is to launch a set of nodes, called an Amazon Redshift cluster. After you provision your cluster, you upload your data set and then perform data analysis queries. Regardless of the size of your data set, Amazon Redshift delivers fast query performance using familiar SQL-based tools and business intelligence applications.Â

Snowflake Cloud Data Warehouse: The first multi-cloud data warehouse.
Snowflake is a fully managed MPP cloud-based data warehouse that runs on AWS, GCP, and Azure. Snowflake, unlike the other data warehouses profiled here, is the only solution that doesnâ€™t run on its own cloud. With a common and interchangeable code base, Snowflake features global data replication, which means you can move your data to any cloud, in any region â€” without having to re-code your applications or learn new skills.â€¯Â

When youâ€™re a Snowflake user, you can spin up as many virtual warehouses as you need to parallelize and isolate the performance of individual queries. Snowflake enables very high concurrency by separating storage and compute to ensure that many warehouses can simultaneously access the same data source.Â

You interact with Snowflakeâ€™s data warehouse through a web browser, the command line, an analytics platform, or via Snowflakeâ€™s ODBC, JDBC, or other supported drivers. The platform supports ACID-compliant relational processing and has native support for document store formats such as JSON, Avro, ORC (Optimized Row Columnar), Parquet, and XML.Â

Taken directly from:Â https://www.qlik.com/us/cloud-data-migration/cloud-data-warehouseÂ

Â We’re not recommending or suggestingÂ any of these services.Â InnovocoÂ can helpÂ youÂ evaluateÂ the different CDW options available and suggest the best fit for your needs. Learn more at Innovoco.com and book aÂ free-consultationÂ to evaluate your CDW needs.Â

Learn more about howâ€¯Innovocoâ€¯facilitates Cloud Data Warehouse projects by starting small and simple.â€¯â€¯Â

Pick a dayâ€¯here,â€¯weâ€™ll send you a Skip the Dishes/Uber Eatsâ€¯EGiftâ€¯Card, and we can chat over Teams.â€¯â€¯Â

Sneak preview:â€¯â€¯Â

We willâ€¯NOTâ€¯bore you with theâ€¯commonly cited benefits of cloud data warehousesâ€¯â€¯â€¯Â
We will present 2 specific paths to move from Legacy Data Warehouses to the Cloud:â€¯â€¯Â
MS Azureâ€¯â€“â€¯recent developments in Synapse and Databricks that make it very easy to build a data warehouse by starting off small and building up.â€¯Â
Googleâ€¯BigQueryâ€¯â€“â€¯Google has been trying to steal market share by offering rock bottom pricing and interesting features.â€¯â€¯Â

Pick a dayâ€¯here.â€¯

No Comments

22 Jul

Choosing a cloud data warehouse: Comparing some of the top CDW vendorsÂ

Archives

Categories