{"id":16004,"date":"2016-04-08T13:30:00","date_gmt":"2016-04-08T20:30:00","guid":{"rendered":"https:\/\/devwww.3cloudsolutions.com\/post\/apache-solr-3-analytic-use-cases-2\/"},"modified":"2023-10-09T14:10:23","modified_gmt":"2023-10-09T21:10:23","slug":"apache-solr-3-analytic-use-cases","status":"publish","type":"post","link":"https:\/\/3cloudsolutions.com\/resources\/apache-solr-3-analytic-use-cases\/","title":{"rendered":"Apache Solr: 3 Analytic Use Cases"},"content":{"rendered":"<h3>Introduction to Solr<\/h3>\n<p><img decoding=\"async\" style=\"margin: 10px; width: 200px; float: right;\" title=\"Apache Solr\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/Solr.png\" alt=\"Apache Solr\" width=\"200\" align=\"right\" \/><\/p>\n<p><a href=\"http:\/\/lucene.apache.org\/solr\/\" target=\"_blank\" rel=\"noopener\">Apache Solr<\/a> is a free and open source search engine.\u00a0 It provides ultra-fast search against structured, semi-structured and unstructured data.\u00a0 Its cloud-enabled mode allows for massive search indexes scaled and replicated on a Hadoop cluster.\u00a0 It forms the search backbone used by companies such as Best Buy, Sears, eHarmony and <a href=\"https:\/\/wiki.apache.org\/solr\/PublicServers\" target=\"_blank\" rel=\"noopener\">more<\/a>. In fact, it\u2019s used by 90% of Fortune 500 companies.<\/p>\n<p>Now that you know what Solr is, you might question how a search engine would be used in a BI infrastructure, but the very things that make it an excellent search engine also make it a potent data store for analytic use.\u00a0 After all, search engines are nothing more than specialized databases.\u00a0 In this article, we\u2019ll outline a few BI applications of Solr.<!--more--><\/p>\n<h3><img loading=\"lazy\" decoding=\"async\" title=\"solr-sun-beach.jpg\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/solr-sun-beach.jpg\" alt=\"solr-sun-beach.jpg\" width=\"798\" height=\"504\" \/><\/h3>\n<div>\n<h3>Scenario 1: Text Analytics with HR<\/h3>\n<p>Hiring managers often have to go through piles of resumes just to find a dozen or so to interview.\u00a0 If you\u2019re in HR, you\u2019re going through resumes ad nauseam.\u00a0 Whereas traditional databases have limited text processing capabilities, Solr is much better suited to analyze and filter resumes submitted for job openings.<\/p>\n<p>It\u2019s a natural use case for Solr. It can be fed native documents, including PDF, Word, XML or plain text and integrate those into its index.\u00a0 With its development as a search engine, it can easily process the unstructured text.\u00a0 It can extract key words and phrases, perform language detection and transparently deal with differing word forms.<\/p>\n<p>After a hire, periodic reviews can be combined with keywords, key phrases and other metadata extracted from the source resume to form a predictive model, which can then be used in later hiring processes.<\/p>\n<h3>Scenario 2: Spatial Analytics with Strategic Planning<\/h3>\n<p>When a store chain grows from a local to a regional endeavor, new locations better serve existing customers and attract new ones.\u00a0 A strategic planner has to make the age old decision of a business storefront: location.\u00a0 Here, too, Solr has specialized features that can help.\u00a0 Its geospatial features allow the strategic planner to plot existing and potential customers on a map, and easily incorporate distance into the ranking of each potential location.<\/p>\n<p>Likewise, when visualizing customer purchases, its grouping features can quickly break down customers by distance traveled, amount purchased, or number of visits.<\/p>\n<h3>Scenario 3: Log file Analytics with Manufacturing<\/h3>\n<p>Manufacturing operations track parts assembly as they enter the inventory until they leave the line fully assembled.\u00a0 All the machines on the assembly line record log entries.\u00a0 They might post entries with different structures.\u00a0 That line might be one of dozens or hundreds.\u00a0 With that volume, you need very efficient and scalable ingestion and search.\u00a0 Solr can operate in SolrCloud mode, scaling to nearly infinite volume.\u00a0 Combined with an ingestion tool like Apache Nifi, Solr can index extremely high volumes of data.<\/p>\n<p>Because it is first a text processing engine, it can deal with vagaries of structure, searching the general text or extracting them into appropriate structures as the entries are indexed.\u00a0 It can power responsive dashboards showing production rate, defect rate, etc.\u00a0 They can be filtered by date range, batch, product line, location or even by keyword. Solr can usually handle these filters in near real-time.<\/p>\n<h3>Will Solr 6\u00a0Provide\u00a0Analytics for Anything?<\/h3>\n<p>Solr 6, due in the first half of 2016, introduces a new\u00a0SQL query engine.\u00a0 That allows it to be a data source for, well, just about anything.\u00a0 SQL opens up a new world of complex queries AND makes it available to a much broader audience already familiar with SQL.\u00a0 It might just be a new day for Solr.<\/p>\n<p>If you have any additional questions, please reach out to us on our <a href=\"\/get-started\/\">Contact Us<\/a> page.<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Apache Solr is a free and open source search engine.  It provides ultra-fast search against structured, semi-structured and unstructured data.<\/p>\n","protected":false},"author":21,"featured_media":14820,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[260],"tags":[345],"class_list":["post-16004","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-ai","tag-solr","topics-blog"],"acf":[],"_links":{"self":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/16004","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/users\/21"}],"replies":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/comments?post=16004"}],"version-history":[{"count":0,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/16004\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media\/14820"}],"wp:attachment":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media?parent=16004"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/categories?post=16004"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/tags?post=16004"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}