{"id":15831,"date":"2018-12-18T14:37:10","date_gmt":"2018-12-18T22:37:10","guid":{"rendered":"https:\/\/devwww.3cloudsolutions.com\/post\/translating-your-current-skillset-to-a-modern-cloud-data-warehouse-2\/"},"modified":"2023-07-28T16:08:42","modified_gmt":"2023-07-28T23:08:42","slug":"translating-your-current-skillset-to-a-modern-cloud-data-warehouse","status":"publish","type":"post","link":"https:\/\/3cloudsolutions.com\/resources\/translating-your-current-skillset-to-a-modern-cloud-data-warehouse\/","title":{"rendered":"Translating your Current Skillset to a Modern Cloud Data Warehouse"},"content":{"rendered":"<p>If you are like many data professionals, you may have years of experience building and supporting on-premises data warehouse implementations, but have become overwhelmed by the blisteringly fast rate of change in the industry over the past few years. Business Intelligence tools and platforms have never required more agility and adaptability than they do today, and never have the choices and technologies been so plentiful. With this constant change can come much anxiety and confusion, but also great innovation and opportunity.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/Modern-Cloud-Data-Warehouse-1.jpg\" alt=\"Modern Cloud Data Warehouse\" width=\"805\" style=\"width: 805px; display: block; margin-left: auto; margin-right: auto;\"><\/p>\n<h2><strong>You Got This!<\/strong><\/h2>\n<p>My hope with this post is two-fold. Primarily, I\u2019d like to set your mind at ease. The more things change, the more they stay the same. You may not know how to program and build an autonomous highway vehicle, but you are not as far away from BI relevance as you might fear. I also intend to introduce you to a modern Business Intelligence architecture stack that should feel familiar to you as you explore the vast array of Microsoft Cloud offerings. To that end, I would like to share with you one possible arrangement of Azure tools which might begin to help you build an understanding of possible roadmaps for moving your data warehouse into the cloud.<!--more--><span style=\"background-color: transparent;\"><\/span><\/p>\n<p><span style=\"background-color: transparent;\">Let&#8217;s start with an overview of some key Azure services:<\/span><\/p>\n<p style=\"padding-left: 30px;\"><strong>Azure SQL Database<br \/><\/strong><span style=\"background-color: transparent;\">Azure <\/span><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/sql-database\/\" style=\"background-color: transparent;\">SQL Database<\/a><span style=\"background-color: transparent;\"> (DB) can simply be your traditional SQL Server database, except it\u2019s in the cloud. That\u2019s the simplest way to think about it. Developers and administrators with an on-premises SQL Server background will find a low learning curve when moving to Azure SQL DB. Your data will have a comfy new home in the cloud and you will reap the performance benefits of managed services. Along with great flexibility and scalability, you will pick up many of the features previously only available by ponying up for an Enterprise license. With two deployment options, Azure SQL DB can be a simple cloud replacement of on-premises databases, but it can also extend and expand upon your traditional relational data through support of many other data types like JSON, XML, spatial data, and much more.<\/span><\/p>\n<p style=\"padding-left: 30px;\"><strong>Azure SQL Data Warehouse<br \/><\/strong><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/sql-data-warehouse\/?&#038;OCID=AID719825_SEM_vyUtQ7KI&#038;lnkd=Google_Azure_Brand&#038;gclid=EAIaIQobChMIjIeVraiJ3wIVh5-zCh3JnAkGEAAYASAAEgLfYfD_BwE\" style=\"background-color: transparent;\">Azure SQL Data Warehouse<\/a><span style=\"background-color: transparent;\"> is the Azure service for processing and storing massive volumes of data, and for consolidating disparate data into a single location using simple SQL constructs. It offers flexibility and cost savings by separating compute activities from storage, so you only pay for what you are using. Spin down your compute capacity when you don&#8217;t need it without losing access to your data.<\/span><\/p>\n<p style=\"padding-left: 30px;\">SQL DW scales easily by distributing compute throughput which is a major component to this service\u2019s performance advantage through Massively Parallel Processing (MPP). For the sake of keeping this article digestible, we\u2019ll forego the technical details under the hood of MPP. Just know that it is the secret sauce that brings to Azure SQL DW the ability to ingest and analyze petabytes of data with unbelievable performance. Azure SQL DW is not the service for processing small datasets. If you are comfortable with relational data warehouse structures, think bigger, more diverse and performant, and you\u2019ll begin to see the power of Azure SQL DW.<\/p>\n<p style=\"padding-left: 30px;\"><strong>Azure Databricks<br \/><\/strong><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/databricks\/\" style=\"background-color: transparent;\">Azure Databricks<\/a><span style=\"background-color: transparent;\"> is an interactive platform built to run and manage Apache Spark in the Azure cloud. Spark is a data processing engine designed to be open and versatile, and serve the needs of everyone from data engineers to data scientists. Use it for data movement activities, to collect and process live streams of data, even IoT sensor feeds. Additionally, Azure Databricks supports a variety of languages: R, Python, SQL, Scala.<\/span><\/p>\n<p style=\"padding-left: 30px;\">Azure Databricks provides a friendly workspace to manage Spark clusters, and manipulate and visualize data. The marketing material really gravitates to the phrase \u201cunified analytics platform\u201d which, once you really dig in and start to understand it, becomes less marketing noise and more \u201cYeah, that actually describes it quite nicely.\u201d There is not much to offer here in the way of drawing a parallel between Azure Databricks and something you are already familiar with in a traditional data warehouse environment. There is a steeper learning curve with Azure Databricks, but this service offers excellent opportunity for innovation.<\/p>\n<p style=\"padding-left: 30px;\"><strong>Azure Data Factory V2<br \/><\/strong><span style=\"background-color: transparent;\">You can use <\/span><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/data-factory\/\" style=\"background-color: transparent;\">Azure Data Factory<\/a><span style=\"background-color: transparent;\"> (ADF) V2 as the cloud incarnation of <\/span><a href=\"https:\/\/docs.microsoft.com\/en-us\/sql\/integration-services\/sql-server-integration-services?view=sql-server-2017\" style=\"background-color: transparent;\">SQL Server Integration Services<\/a><span style=\"background-color: transparent;\">. As is par for this post, this statement massively understates the capacity of ADF. It provides a hybrid \u201cpipeline model\u201d so you can group logically related units of work and seamlessly span your cloud and on-premises data sources while leveraging advanced compute services (think HDInsight, Hadoop, Spark, etc.) ADF provides 75+ connectors and can be used to lift and shift your existing SSIS packages from on-premises into your Azure environment. You can also automate and schedule workflows with data-driven activities. If you are familiar with visual methods of data integration, you\u2019ll pick up on ADF like a fish to water.<\/span><\/p>\n<p style=\"padding-left: 30px;\"><strong>Azure Analysis Services<br \/><\/strong><span style=\"background-color: transparent;\">Perhaps the lowest learning curve in this transfer is to <\/span><a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/analysis-services\/\" style=\"background-color: transparent;\">Azure Analysis Services<\/a><span style=\"background-color: transparent;\"> (AAS). This Platform as a Service (PaaS) is your analytics engine and all-important semantic layer for your data warehouse. You can still access your on-premises data through the gateway while taking advantage of the scalability benefits of managed services, or you can move your data assets to the cloud using the previously mentioned services to provide an end-to-end cloud data solution. AAS is fully compatible with all your current known and trusted tools (SSMS, SSDT, Power BI, etc.) so you\u2019ll be comfortable working with it from day one.<\/span><\/p>\n<h2><strong>Putting It All Together<\/strong> (batch marries stream)<\/h2>\n<p>Now that we\u2019ve covered some basic components, lets pull them all together into a possible use case to envision our \u201cModern BI Architecture.\u201d Let\u2019s suppose you have IoT sensors capturing measurements from a manufacturing machine. This could be in the form of temperature, rotation speed, tension, amperage, etc. Those sensors could produce a staggering amount of data, but how do you measure the efficiency of that machine, or even the machine\u2019s task at hand? You may need real-time on-the-ground operator feedback. You may need to turn that data into efficiency metrics monitored not only by line operators and foremen, but by managers and corporate decision-makers who all have a different stake in and opinion of the timing of inbound data. Further suppose that you have finished good quality data which is generated and stored separately by independent systems, but that these systems are subject to less frequent batch processing. Your data scientist may need to pull data from both your batch and your streaming data sources in order to make meaningful prescriptive and predictive recommendations and to train machine learning models. Landing your sensor data and your batch data in a semi-structured blob store, you can use Azure Databricks and Azure Data Factory to stage and eventually land in an Azure SQL Data Warehouse. Once there, it will be ready to be hooked by Azure Analysis Services for presentation.<\/p>\n<p style=\"text-align: center;\"><a href=\"https:\/\/azure.microsoft.com\/en-us\/solutions\/architecture\/modern-data-warehouse\/\" rel=\"noopener\" target=\"_blank\"><img decoding=\"async\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/Modern-Data-Warehouse-Architecture-1.png\" alt=\"Modern Data Warehouse Architecture\" width=\"805\" style=\"width: 805px; display: block; margin-left: auto; margin-right: auto;\"><\/a><em style=\"background-color: transparent; text-align: center;\"><a href=\"https:\/\/azure.microsoft.com\/en-us\/solutions\/architecture\/modern-data-warehouse\/\" rel=\"noopener\" target=\"_blank\">Azure Reference Architecture Diagram<\/a><\/em><\/p>\n<p>Maybe you have never dealt with any of these services or technologies. I made a claim earlier in this post: The more things change, the more they stay the same. When you boil the above scenario down to its simplest form you have what you should find a familiar formula. Regardless of the source or the target, you must extract data and load data. Somewhere in between your source and target you may need to do some transformations \u2013 extract, transform, load (ETL). If you are familiar with data warehouse modeling, SQL and the concepts of ETL or ELT, you should be right at home with modern BI architecture.<\/p>\n<p>If you\u2019re looking to advance your organization\u2019s Business Intelligence skills, <a href=\"https:\/\/www.blue-granite.com\/contact-us\">contact us<\/a> today. We offer hands-on, instructor-led training in the latest analytics technologies, and design custom solutions that transform your enterprise through data.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>If you are like many data professionals, you may have years of experience building and supporting on-premises data warehouse implementations, but have become overwhelmed by the fast rate of change over the past few years. With this change can come much confusion, but also innovation and opportunity.<\/p>\n","protected":false},"author":21,"featured_media":14141,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[260],"tags":[304],"class_list":["post-15831","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-ai","tag-modern-data-platform","topics-blog"],"acf":[],"_links":{"self":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15831","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/users\/21"}],"replies":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/comments?post=15831"}],"version-history":[{"count":0,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15831\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media\/14141"}],"wp:attachment":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media?parent=15831"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/categories?post=15831"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/tags?post=15831"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}