{"id":15970,"date":"2016-12-06T17:47:17","date_gmt":"2016-12-07T01:47:17","guid":{"rendered":"https:\/\/devwww.3cloudsolutions.com\/post\/options-to-scale-your-sql-server-data-warehouse-2\/"},"modified":"2024-01-08T10:48:27","modified_gmt":"2024-01-08T18:48:27","slug":"options-to-scale-your-sql-server-data-warehouse","status":"publish","type":"post","link":"https:\/\/3cloudsolutions.com\/resources\/options-to-scale-your-sql-server-data-warehouse\/","title":{"rendered":"Options to Scale Your SQL Server Data Warehouse"},"content":{"rendered":"<p>The Data Warehouse has been critical to business decision making for over two decades. As a Microsoft business intelligence expert and systems integrator, 3Cloud frequently encounters customers looking to maximize their existing SQL Server data warehouse&#8217;s performance, or who want help planning a migration to a new version of SQL Server.<\/p>\n<p><!--more--><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" style=\"display: block; margin-left: auto; margin-right: auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/iStock-495333816edited.png\" alt=\"iStock-495333816edited.png\" width=\"805\" height=\"509\" \/><\/p>\n<p>One of the common problems that 3Cloud sees working with customers in many different industries is an increase in data sources, data volumes, and lengthening ETL load times. Additionally, end users expect more functionality and split-second report\/dashboard response times. The days of waiting two minutes for a report are over. In most cases, customers are looking for features, functionality, techniques, and even hardware that will allow them to meet increasing performance requirements.<\/p>\n<p>As a nine-year leader in the Gartner Magic Quadrant for BI, Microsoft has a vast array of solutions and products to meet the growing needs of the data warehouse. While many customers are aware of <a href=\"https:\/\/www.microsoft.com\/en-us\/sql-server\/sql-server-2016\">SQL Server 2016<\/a> and its earlier versions, Microsoft also has three additional product offerings for data warehouse solutions: <a href=\"https:\/\/www.microsoft.com\/en-us\/cloud-platform\/data-warehouse-fast-track\">Data Warehouse Fast Track<\/a>, <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/sql-data-warehouse\/\">Azure SQL Data Warehouse<\/a>, and the <a href=\"https:\/\/www.microsoft.com\/en-us\/sql-server\/analytics-platform-system\">Analytics Platform System<\/a> (details below). While SQL Server contains many BI features like Integration Services, Analysis Services, and Reporting Services in a single installation of the product, here we are focusing on the Relational database being used for data warehouse solutions.<\/p>\n<p><img decoding=\"async\" style=\"display: block; margin-left: auto; margin-right: auto; width: 768px;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/Microsoft-Data-Warehouse-1.png\" alt=\"Microsoft Data Warehouse.png\" width=\"768\" \/><\/p>\n<h2><span style=\"font-size: 36px;\"><strong>SQL Server<\/strong><\/span><\/h2>\n<h2><span style=\"font-size: 30px;\">What is It: SQL Server 2016 \u2013 Best BI Platform for over a decade<\/span><\/h2>\n<p>Microsoft SQL Server Enterprise is the most common data warehouse solution of the four listed above. It is asymmetric multiprocessing (SMP) database engine that is NUMA aware, which means it relies on the CPUs, RAM, and storage shared within a single server. When using SQL Server for a data warehouse, the customer must build their own hardware solution, then install and \u201cconfigure\u201d SQL Server on top of the hardware. It has industry-leading features including <a href=\"https:\/\/msdn.microsoft.com\/en-us\/library\/gg492088(v=sql.130).aspx\">Clustered Columnstore Index<\/a> for extremely fast, low IO, in-memory query processing.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Considerations: Build Your Own Deployment \u2013 Lots of Options<\/span><\/h2>\n<p>Customers can use a mixture of physical or virtual hardware and direct attached or shared storage via a storage area network. Most customers will configure SQL Server for data warehouse workloads using specific considerations and configurations for scan centric workloads (opposite of OLTP). While SQL Server has traditionally been installed and run on premises, customers can also run SQL Server in the cloud using Infrastructure as a Service (IaaS). SQL Server Enterprise comes with a plethora of BI features that can be used to create a cohesive solution. The speed of storage available to the server is vital to any SQL Server warehouse deployment, so many built SQL Server solutions take advantage of current PCI and flash-based storage technologies. Any design should factor in not only the capacity of storage but also its IO performance to deliver results at the required speed.<\/p>\n<h2><span style=\"font-size: 30px;\">BI Features: All in One or One for All<\/span><\/h2>\n<p>SQL Server doesn\u2019t put constraints on a customer\u2019s ability to install and use multiple features on one server; this is a popular practice due to Microsoft only imposing licensing on a server basis (per core pairs) and not by feature or instance.<\/p>\n<h2><span style=\"font-size: 30px;\">Programming: Regular T-SQL<\/span><\/h2>\n<p>Customers program their data warehouse solution using the robust T-SQL programming language that is expanded and made more robust with each version release of SQL Server.<\/p>\n<h2><span style=\"font-size: 30px;\">Scale Options: Scale Up Yourself<\/span><\/h2>\n<p>Customers can \u201cscale up\u201d their SQL Server data warehouse by adding more horsepower (RAM, CPU, and faster storage). By using SQL Server in a scaled up environment, customers can expect to grow their data warehouse up to 10TB before needing to analyze other solutions.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Features: CCIs &amp; Much More<\/span><\/h2>\n<p>SQL Server has page and row compression which allow the data to be compressed on disk so data retrieval is faster overall. Starting with SQL Server 2014, customers can also create Clustered Columnstore Indexes (CCIs) which convert data to columnar storage from row storage allowing even greater compression. CCIs have the potential to speed up data warehousing queries 100x. In-memory OLTP tables can be used for Staging tables to speed up ETL workloads. Batch mode also helps with performance as the engine can iterate on \u201cbatches\u201d of rows at a time.<\/p>\n<p>For information on SQL Server 2016 performance features, <a href=\"https:\/\/msdn.microsoft.com\/en-us\/library\/bb510411.aspx\">Microsoft\u2019s library of resources<\/a> is a great starting point.<\/p>\n<h2><span style=\"font-size: 36px;\"><strong>SQL Server Fast Track<\/strong><\/span><\/h2>\n<h2><span style=\"font-size: 30px;\">What is It: High Performance SMP SQL Server Appliance<\/span><\/h2>\n<p>SQL Server Fast Track is a reference architecture methodology for data warehouse solutions where performance is balanced between all the components of software, hardware, and storage to remove bottlenecks. Microsoft and various hardware vendors have teamed up to use the architecture to create pre-configured, tested, and shipped appliance solutions for SQL Server data warehousing. This can eliminate most of the effort and expertise needed to engineer an optimal server solution from basic parts.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Considerations: Time Saver \u2013 Off the Shelf Hardware &amp; Software Deployment<\/span><\/h2>\n<p>The reference architecture can be manually applied to a plain SMP SQL Server data warehouse solution, but customers get the most value in being able to pick an off-the-shelf solution for their desired performance level and price, as this can speed up solution deployments. When buying a Fast Track appliance, customers can expect the vendor to ship and configure the hardware and software to the customer\u2019s on-premises data center.<\/p>\n<h2><span style=\"font-size: 30px;\">BI Features: Only SQL on the Appliance<\/span><\/h2>\n<p>Because Fast Track is an appliance-based approach, there are some underlying restrictions. First, Fast Track appliances are only currently available on-premises. Second, while SMP SQL Server allows the customer to install features for BI, this is typically not supported on the appliance.<\/p>\n<h2><span style=\"font-size: 30px;\">Programming: Regular T-SQL Compatible<\/span><\/h2>\n<p>Since Fast Track is just SQL Server Enterprise SMP, you can use regular T-SQL.<\/p>\n<h2><span style=\"font-size: 30px;\">Scale Options: Scale Up \u2013 Predetermined Scale Size Limit (5TB to 145TB)<\/span><\/h2>\n<p>Fast Track is a scale up solution. Appliances are available in many different sizes (5TB to 145T) from many vendors. Since the appliance is already scaled up, they typically can\u2019t be scaled up any further. If a customer exceeds their Fast Track size threshold, they typically must buy another one.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Features: Fast Storage, All SQL Server Features Available<\/span><\/h2>\n<p>The main benefits of Fast Track are screaming-fast performance (up to 10 GB\/second read\/write throughput), usage of plain T-SQL, and access to all the performance enhancements to SQL Server like CCIs. This means that customers can migrate their data warehouse solutions to Fast Track, leave their code unchanged, and obtain tremendous performance benefits.<\/p>\n<p>For more information on Fast Track, try checking out <a href=\"https:\/\/www.microsoft.com\/en-us\/cloud-platform\/data-warehouse-fast-track\">Microsoft\u2019s website<\/a>.<\/p>\n<h2><span style=\"font-size: 36px;\"><strong>Analytics Platform System<\/strong><\/span><\/h2>\n<h2><span style=\"font-size: 30px;\">What is It: SQL Server MPP Appliance<\/span><\/h2>\n<p>The Microsoft Analytics Platform System (APS) is a SQL Server Enterprise-based data warehouse appliance created by Microsoft and hardware vendors that can scale to up to 6PB of data. APS is an Massively Parallel Processing (MPP) database engine, meaning that it is a set of multiple SQL Servers that are running in concert together.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Considerations: Scale Out &amp; Distribute Your Data<\/span><\/h2>\n<p>APS is a scale out solution. APS fundamentally works by taking a big problem and breaking it up into smaller pieces. APS does this by implementing a control\/compute topology, where one SQL Server functions as the brains of the solution (control), and other SQL Servers function as the brawn (compute) to perform parallel operations on data that has been distributed among the compute SQL Servers.<\/p>\n<h2><span style=\"font-size: 30px;\">BI Features: Only SQL on the Appliance<\/span><\/h2>\n<p>Similar to Fast Track, APS does have some limitations, and it\u2019s only available on-premises. APS is only on SQL Server; additional BI features cannot be installed on the appliance, though all BI features can use the appliance.<\/p>\n<h2><span style=\"font-size: 30px;\">Programming: DSQL is a little different and may require conversion<\/span><\/h2>\n<p>APS uses Distributed SQL (DSQL). DSQL is slightly different than T-SQL and customers typically have some code conversion (DDL &amp; DML) that is needed before they can move their databases to APS. This conversion can be minimal or extensive depending on the customer\u2019s coding practices.<\/p>\n<h2><span style=\"font-size: 30px;\">Scale Options: Scale Out in Scale Units<\/span><\/h2>\n<p>Customers should look at APS when they have around 10TB of data and expect data growth. APS can scale out to as large as 6PB. APS scales with linear cost and performance because it can be expanded by adding additional compute nodes to the appliance (called adding Scale Units).<\/p>\n<h2><span style=\"font-size: 30px;\">Key Features: It\u2019s Extremely Fast<\/span><\/h2>\n<p>When using APS, customers can expect significant performance improvements in querying and loading data. Customer reporting queries can often be sped up by a factor of 100 and data can be loaded as fast as several TBs an hour. Because APS is SQL Server under the covers, there is nothing special needed to connect to it, so customers can expect a seamless transition of their reporting and semantic layer tools.<\/p>\n<p>For more information on APS, see Microsoft\u2019s <a href=\"https:\/\/www.microsoft.com\/en-us\/sql-server\/analytics-platform-system\">website<\/a>.<\/p>\n<h2><span style=\"font-size: 36px;\"><strong>Azure SQL Data Warehouse<\/strong><\/span><\/h2>\n<h2><span style=\"font-size: 30px;\">What is It: SQL Server MPP in the Cloud<\/span><\/h2>\n<p>Azure SQL Data Warehouse (SQL DW) is a Platform as a Service (PaaS) data warehouse service in the Microsoft Azure cloud. SQL DW is an MPP database engine relying on a control\/compute topology using Azure SQL Database and Azure Blog Storage. SQL DW is not APS, but it is very similar.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Considerations: SQL DW is Scale Out PaaS \u2013 Infrastructure is managed by Microsoft<\/span><\/h2>\n<p>SQL DW is a scale out solution like APS, and like APS, data is distributed to compute nodes for faster performance and scale. Since SQL DW is an Azure offering, it only runs in Azure and is not available on-premises. The offering is also PaaS, so customers don\u2019t have to worry about on-premises or even cloud infrastructure as it is all managed by Microsoft.<\/p>\n<h2><span style=\"font-size: 30px;\">BI Features: Only SQL On the Appliance<\/span><\/h2>\n<p>SQL DW is only SQL Server like Fast Track and APS. While you can\u2019t add BI features to the cluster, SQL DW can be used with all the SQL Server and Azure BI features.<\/p>\n<h2><span style=\"font-size: 30px;\">Programming: DSQL is a little different and may require conversion<\/span><\/h2>\n<p>Like APS, SQL DW uses DSQL and has similar programming limitations.<\/p>\n<h2><span style=\"font-size: 30px;\">Scale Options: Elastically Scale in Minutes<\/span><\/h2>\n<p>Customers should look at SQL DW when they have TBs of data. Using SQL DW, they can expect to scale out to as large as PBs. SQL DW can be elastically scaled out in seconds by changing the Data Warehouse Units (DWU \u2013 unit of scale performance) setting in the Azure portal.<\/p>\n<h2><span style=\"font-size: 30px;\">Key Features: Separation of Compute &amp; Storage<\/span><\/h2>\n<p>Customers can create a SQL DW cluster in minutes. Just like other Azure offerings, it is a pay-as-you-go pricing model so you are charged by the hour. Not only can SQL DW be elastically scaled out, Customers can even pause their instance which eliminates compute charges. Storage and compute are completely decoupled so when customers pause their cluster, their data is persisted.<\/p>\n<p>If you\u2019d like more information about Azure SQL DW, please see <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/sql-data-warehouse\/\">Microsoft\u2019s website<\/a>.<\/p>\n<h3>Scale Your Company with 3Cloud<\/h3>\n<p>Whether you want to scale an existing data warehouse platform or migrate to a new one work, 3Cloud would love to help. <a href=\"\/get-started\/\">Contact us<\/a> today \u2013 we\u2019re experts at helping organizations of all sizes navigate the many Modern Data Platform solutions and better understand the benefits of Big Data.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Data Warehouse is critical to business decision making. BlueGranite frequently helps clients plan for migration to new versions of SQL Server effectivley.<\/p>\n","protected":false},"author":21,"featured_media":14747,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[297],"tags":[304],"class_list":["post-15970","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-platform","tag-modern-data-platform","topics-blog"],"acf":[],"_links":{"self":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15970","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/users\/21"}],"replies":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/comments?post=15970"}],"version-history":[{"count":0,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15970\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media\/14747"}],"wp:attachment":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media?parent=15970"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/categories?post=15970"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/tags?post=15970"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}