{"id":15945,"date":"2017-03-28T14:37:00","date_gmt":"2017-03-28T21:37:00","guid":{"rendered":"https:\/\/devwww.3cloudsolutions.com\/post\/webinar-recap-distributed-computing-r-server-2\/"},"modified":"2024-04-17T13:58:08","modified_gmt":"2024-04-17T20:58:08","slug":"webinar-recap-distributed-computing-r-server","status":"publish","type":"post","link":"https:\/\/3cloudsolutions.com\/resources\/webinar-recap-distributed-computing-r-server\/","title":{"rendered":"Webinar Recap: Distributed Computing &#038; R Server"},"content":{"rendered":"<p>Last week, our monthly webinar series covered Distributed Computing and What&#8217;s New in R Server 9.0.1. If you missed the session, you can find the <a href=\"https:\/\/www.blue-granite.com\/distributed-computing-microsoft-r-server-webinar-mar-2017\" target=\"_blank\" rel=\"noopener\">recording here<\/a>. We received a number of\u00a0questions during the presentation\u00a0and wanted to take the opportunity to provide some insightful answers for\u00a0the audience.<\/p>\n<p><!--more--><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" style=\"margin-right: auto; margin-left: auto; display: block;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/iStock-496851962.png\" alt=\"iStock-496851962.png\" width=\"805\" height=\"509\" \/><\/p>\n<h3 style=\"padding-left: 30px;\">If I am just getting started with R,\u00a0how should I go about choosing between open-source and Microsoft R?<\/h3>\n<p style=\"padding-left: 30px;\">It really depends on your needs, but luckily both <a href=\"https:\/\/www.r-project.org\/\" target=\"_blank\" rel=\"noopener\">open-source R<\/a> and\u00a0<a href=\"https:\/\/msdn.microsoft.com\/en-us\/microsoft-r\/\" target=\"_blank\" rel=\"noopener\">Microsoft R Open<\/a> are free to get started. Either solution\u00a0will allow you to practice working in the environment, and once you get comfortable in R and you starting running into the limitations with R on large datasets (the ones that won\u2019t fit in your computer\u2019s memory), you can look into purchasing\u00a0R Server, or spin up an R Server on Azure in HDInsight or on the Data Science Virtual Machine.<\/p>\n<h3 style=\"padding-left: 30px;\">When thinking of SAS vs. R: as a SAS user how difficult or easy would it be for me to learn and start using R?<\/h3>\n<p style=\"padding-left: 30px;\">Both SAS and R do very similar things. However, SAS code is written in procedures where R is more of a script. Since you\u2019re already familiar with the algorithms\/functions you use in SAS, learning the syntax of R really isn\u2019t that bad. Plus, the cost savings of R over SAS are tremendous! Check out the example below for a quick comparison.<\/p>\n<table style=\"height: 167px; margin-right: auto; margin-left: auto;\" width=\"698\">\n<tbody>\n<tr>\n<td style=\"width: 167px; text-align: center; background-color: #b3b3b3;\"><\/td>\n<td style=\"width: 167px; text-align: center; background-color: #b3b3b3;\">\n<h4>SAS<\/h4>\n<\/td>\n<td style=\"width: 167px; text-align: center; background-color: #b3b3b3;\">\n<h4>R<\/h4>\n<\/td>\n<td style=\"width: 167px; text-align: center; background-color: #b3b3b3;\">\n<h4>Microsoft R<\/h4>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 167px; text-align: left; background-color: #cdcdcd;\">\n<p style=\"text-align: center;\">Linear Regression<\/p>\n<\/td>\n<td style=\"width: 167px; text-align: left; background-color: #cdcdcd;\"><strong>proc reg <\/strong>data=mydata;<br \/>\nmodel y=x1 x2 x3;<\/td>\n<td style=\"width: 167px; text-align: left; background-color: #cdcdcd;\"><strong>lm(<br \/>\n<\/strong>y~x1+x2+x3,<br \/>\nmydata<br \/>\n<strong>)<\/strong><\/td>\n<td style=\"width: 167px; text-align: left; background-color: #cdcdcd;\"><strong>rxLinMod(<br \/>\n<\/strong>formula: y ~ x1 + x2 + x3,<br \/>\nmyxdfdata<br \/>\n<strong>)<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3 style=\"padding-left: 30px;\">\nWhat use cases do you see being used with the R server technology?<\/h3>\n<p style=\"padding-left: 30px;\">I think the use cases depend on\u00a0when you want to take your R practice to the next level:<\/p>\n<ul>\n<li>Larger, collaborative data science teams<\/li>\n<li>Larger amounts of data (tweets, click stream data, genomics, etc.) a.k.a. \u201cBig Data\u201d<\/li>\n<li>When operationalizing the models you make is crucial to providing business insight. Also, when you want to easily maintain, update, and rerun them.<\/li>\n<\/ul>\n<h3 style=\"padding-left: 30px;\">Is there a benefit to using R if my dataset is in the range of two to four million rows of data?<\/h3>\n<p style=\"padding-left: 30px;\">In short, it depends.\u00a0Two to four\u00a0million rows might not be outside of the memory limit of normal R, but it is approaching it for many machines. The added value of R Server won&#8217;t necessarily\u00a0be in\u00a0its\u00a0power to handle more data, but\u00a0in its\u00a0ability\u00a0to run computations on that data in a more reasonable timeframe. Millions of rows may take a long time to train the model or predict upon, but using R Server will definitely cut down on that time.<\/p>\n<h3 style=\"padding-left: 30px;\">Do you have any virtual instructor-led R training?<\/h3>\n<p style=\"padding-left: 30px;\">While BlueGranite does not currently\u00a0offer virtual training opportunities for R,\u00a0our 3-day <a href=\"https:\/\/www.blue-granite.com\/microsoft-r-training\">Microsoft R Training<\/a> session takes place at\u00a0your location\u00a0and features hands-on, instructor-led lessons for up to 10 of your firm&#8217;s\u00a0attendees.\u00a0A BlueGranite senior consultant will facilitate hands-on labs and provide material for your team on the fundamentals of R programming for data ingestion, exploratory data analysis, model building, evaluation, and operationalization. Attendees from your company will learn how to write effective R code that can be operationalized in production.<\/p>\n<p style=\"padding-left: 30px;\">Additionally, there are many online resources that could help you get started. I would recommend looking into <a href=\"https:\/\/www.edx.org\/school\/microsoft\">free courses online<\/a> with edX as a beginning point and perhaps exploring an in-person training session in the future.<\/p>\n<h3 style=\"padding-left: 30px;\">Azure seems to be the way everything is moving \u2013 would you agree?<\/h3>\n<p style=\"padding-left: 30px;\">I think so because of the flexibility and expandability of the Azure environment. Why would your organization want to pay thousands (or millions) on a big server system that will be out of date in a few years when you can just pay monthly only for what you use on Azure? Plus, the ease of setup is an added bonus. Instead of having to hire a crew to come in and setup Hadoop on your on-premise server, just spin it up on Azure in less than 30 minutes.<\/p>\n<p>Thank you to everyone who joined us for the webinar! If you have any more questions or want to know more about R training opportunities, feel free to <a href=\"https:\/\/www.blue-granite.com\/contact-us\" target=\"_blank\" rel=\"noopener\">reach out to us<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our recent webinar covered Distributed Computing and What\\&#8217;s New in R Server 9.0.1. We received some questions during the presentation &#8211; here are some answers!<\/p>\n","protected":false},"author":21,"featured_media":14699,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[260,378],"tags":[319],"class_list":["post-15945","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-ai","category-past-webinars","tag-machine-learning-ai","topics-blog"],"acf":[],"_links":{"self":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15945","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/users\/21"}],"replies":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/comments?post=15945"}],"version-history":[{"count":0,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15945\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media\/14699"}],"wp:attachment":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media?parent=15945"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/categories?post=15945"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/tags?post=15945"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}