{"id":15867,"date":"2018-06-05T12:23:00","date_gmt":"2018-06-05T19:23:00","guid":{"rendered":"https:\/\/devwww.3cloudsolutions.com\/post\/cognitive-services-showcase-api-vision-tools-2\/"},"modified":"2024-01-03T14:35:18","modified_gmt":"2024-01-03T22:35:18","slug":"cognitive-services-showcase-api-vision-tools","status":"publish","type":"post","link":"https:\/\/3cloudsolutions.com\/resources\/cognitive-services-showcase-api-vision-tools\/","title":{"rendered":"Cognitive Services Showcase: API Vision Tools"},"content":{"rendered":"<p><span style=\"background-color: transparent;\">Continuing 3Cloud&#8217;s series on the Microsoft <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/\">Cognitive Services<\/a> APIs, let&#8217;s look into Vision. Using the pre-built artificial intelligence from the Vision APIs will give your apps and other solutions the rich capabilities of<\/span><span style=\"background-color: transparent;\">\u00a0different types of image and video analysis.<\/p>\n<p><img decoding=\"async\" style=\"width: 805px; display: block; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/Banner_Vision.png\" alt=\"Banner_Vision\" width=\"805\" \/><\/span><\/p>\n<p><!--more--><\/p>\n<h2>All About Vision<\/h2>\n<p>The Vision capabilities use AI to automatically provide detailed explanations of what your images and videos contain.\u00a0<span style=\"background-color: transparent;\">The following Vision services cover many of the common tasks that you might use to enrich your applications.\u00a0<\/span><\/p>\n<table style=\"margin-left: auto; margin-right: auto; width: 847px;\">\n<tbody>\n<tr style=\"height: 21px;\">\n<td style=\"height: 21px; width: 82px;\"><img decoding=\"async\" style=\"width: 512px;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/vision-computervision.png\" alt=\"vision-computervision\" width=\"512\" \/><\/td>\n<td style=\"height: 21px; width: 259.844px;\"><span style=\"font-size: 20px;\"><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/computer-vision\/Home\" target=\"_blank\" rel=\"noopener\">Computer Vision API<\/a><\/span><\/td>\n<td style=\"height: 21px; width: 492.156px;\">Analyze image content, obtain tags and categories, identify text using optical character recognition, flag racy or adult content, crop photos as thumbnails, and more<\/td>\n<\/tr>\n<tr style=\"height: 21px;\">\n<td style=\"height: 21px; width: 82px;\"><img decoding=\"async\" style=\"width: 512px;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/vision-face.png\" alt=\"vision-face\" width=\"512\" \/><\/td>\n<td style=\"height: 21px; width: 259.844px;\"><span style=\"font-size: 20px;\"><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/face\/Overview\" target=\"_blank\" rel=\"noopener\">Face API<\/a><\/span><\/td>\n<td style=\"height: 21px; width: 492.156px;\">Locate and group faces in images as well as suggest gender, age, and emotions (previously a separate Emotion API)<\/td>\n<\/tr>\n<tr style=\"height: 21px;\">\n<td style=\"height: 21px; width: 82px;\"><img decoding=\"async\" style=\"width: 512px;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/vision-customvision.png\" alt=\"vision-customvision\" width=\"512\" \/><\/td>\n<td style=\"height: 21px; width: 259.844px;\"><span style=\"font-size: 20px;\"><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/Custom-Vision-Service\/home\" target=\"_blank\" rel=\"noopener\">Custom Vision Service<\/a><\/span><\/td>\n<td style=\"height: 21px; width: 492.156px;\">Train and use your own custom image classifiers<\/td>\n<\/tr>\n<tr style=\"height: 21px;\">\n<td style=\"height: 21px; width: 82px;\"><img decoding=\"async\" style=\"width: 512px;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/vision-videoindexer.png\" alt=\"vision-videoindexer\" width=\"512\" \/><\/td>\n<td style=\"height: 21px; width: 259.844px;\"><span style=\"font-size: 20px;\"><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/video-indexer\/video-indexer-overview\" target=\"_blank\" rel=\"noopener\">Video Indexer<\/a><\/span><\/td>\n<td style=\"height: 21px; width: 492.156px;\">Analyze video content to track faces, gauge sentiment, transcribe audio, recognize different speakers, and more<\/td>\n<\/tr>\n<tr style=\"height: 21px;\">\n<td style=\"height: 21px; width: 82px;\"><img decoding=\"async\" style=\"width: 512px;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/vision-contentmoderator.png\" alt=\"vision-contentmoderator\" width=\"512\" \/><\/td>\n<td style=\"height: 21px; width: 259.844px;\"><span style=\"font-size: 20px;\"><a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/content-moderator\/overview\" target=\"_blank\" rel=\"noopener\">Content Moderator<\/a><\/span><\/td>\n<td style=\"height: 21px; width: 492.156px;\">Detect and potentially filter offensive content from images and video, and flag content for human review<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p style=\"text-align: center;\"><em>To try any of these Vision Cognitive Services, get free trial API keys\u00a0<a href=\"https:\/\/azure.microsoft.com\/en-us\/try\/cognitive-services\/#vision\" target=\"_blank\" rel=\"noopener\">here<\/a>.<span style=\"background-color: transparent;\">\u00a0<\/span><\/em><\/p>\n<h2>Custom Vision Service<\/h2>\n<p><span style=\"background-color: transparent;\">Let&#8217;s peer into some of the functionality for one of Microsoft&#8217;s Vision offerings: the <\/span><a style=\"background-color: transparent;\" href=\"https:\/\/customvision.ai\/\" target=\"_blank\" rel=\"noopener\">Custom Vision Service<\/a><span style=\"background-color: transparent;\">.\u00a0<\/span><span style=\"background-color: transparent;\">Among the Vision APIs, the <\/span><span style=\"background-color: transparent;\">Custom Vision Service<\/span><span style=\"background-color: transparent;\"> is unique because it enables developers to easily train their own image classification model. In addition to the abundant data provided by pre-trained AI from the Computer Vision and Face APIs, the Custom Vision Service lets you go a step further and tag your own images. Once you have trained a model and it&#8217;s ready for production, create a personalized API endpoint. Ultimately, send additional images to the API to classify them and predict how closely they align with your custom labels.<\/span><\/p>\n<p>Why would you consider using the Custom Vision Service and not the regular Computer Vision API? Custom Vision is beneficial in cases where you need precise or proprietary labels for your data. For example, a manufacturer may want to label product lines using their own enterprise terminology, which the Computer Vision API is not equipped to do. In fact, consider using <strong>both APIs together<\/strong>. Here is a look at some of the output from the Computer Vision API and how you can enhance that output using the Custom Vision Service.<\/p>\n<h3 style=\"text-align: center;\"><strong>Use the Computer Vision API for General Results<\/strong><\/h3>\n<p style=\"text-align: center;\"><strong><img decoding=\"async\" style=\"width: 600px; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/ComputerVision-GenericResults.png\" alt=\"ComputerVision-GenericResults\" width=\"600\" \/><\/strong><\/p>\n<h4 style=\"text-align: center;\"><strong>Use the Custom Vision Service for Personalized Tagging<\/strong><\/h4>\n<p><img decoding=\"async\" style=\"width: 600px; display: block; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/CustomVision-TagObjectsForest.png\" alt=\"CustomVision-TagObjectsForest\" width=\"600\" \/><\/p>\n<p>Through its website, the Custom Vision Service lets you build an image classifier using a process similar to tagging images on social media. Even if you are not a data scientist, you could take advantage of Microsoft\u2019s AI without the time and resources needed to code and productionalize a custom image model from scratch.<\/p>\n<p><img decoding=\"async\" style=\"width: 600px; display: block; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/CustomVision-TagObjectSquirrel.png\" alt=\"CustomVision-TagObjectSquirrel\" width=\"600\" \/><\/p>\n<p>With a language such as Python, you can use code to perform the same tasks as the <a href=\"https:\/\/customvision.ai\" target=\"_blank\" rel=\"noopener\">customvision.ai<\/a>\u00a0 website. Code gives you the advantage of greater scalability, faster changes, and the ability to reference image URLs instead of manually uploading images.<\/p>\n<p><img decoding=\"async\" style=\"width: 600px; display: block; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/CustomVision-SamplePredictionWithPython.png\" alt=\"CustomVision-SamplePredictionWithPython\" width=\"600\" \/><\/p>\n<h4><\/h4>\n<h3>Getting Started with the Custom Vision Service<\/h3>\n<p>To get started, go to <strong><a href=\"https:\/\/customvision.ai\/\">customvision.ai<\/a><\/strong> and sign in with a Microsoft account associated with an Azure subscription. The first time you sign in, you need to agree to Microsoft\u2019s Terms of Service.<\/p>\n<p>When adding a new project, choose between <em>Classification <\/em>and <em>Object Detection<strong>.<\/strong><\/em> Classification predicts labels for an entire image while Object Detection details where tagged content appears in an image. With a generic classification project, you may optionally select targeted domains for scenarios like Food or Retail.<\/p>\n<p><img decoding=\"async\" style=\"width: 406px; display: block; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/CustomVision-ProjectOptions.png\" alt=\"CustomVision-ProjectOptions\" width=\"406\" \/><\/p>\n<p>To train a model, you currently need at least 15 images for every tag, but precision should increase with additional images. Microsoft recommends at least 50 images per tag. With a standard project, you can include up to 250 tags and up to 50,000 training images. It also helps to include images that contain your objects of interest in a variety of settings and backgrounds.<\/p>\n<p>The Custom Vision Service also provides precision and recall by tag as basic measures of model performance. <span style=\"background-color: transparent;\">To use your model in production, you need a subscription key. As with the other Cognitive Services, incorporate your Custom Vision model into a solution with a basic API call.<\/span><\/p>\n<p><span style=\"background-color: transparent;\"><img decoding=\"async\" style=\"width: 455px; display: block; margin: 0px auto;\" src=\"https:\/\/3cloudsolutions.com\/wp-content\/uploads\/2022\/11\/CustomVision-PredictionAPI.png\" alt=\"CustomVision-PredictionAPI\" width=\"455\" \/><\/span><\/p>\n<p>As you can see, the Vision APIs are convenient for a variety of tasks. Whether you are a developer or data scientist, these APIs bring you advanced AI capabilities without the cost and time of training your own image models. You can always decide if you need more at a future point, but the Vision APIs provide a jump start into an area where it is time- and resource-intensive to build these types of models on your own. Also, c<span style=\"text-align: right;\">onsider looking further into <em>all<\/em> of the <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/\" target=\"_blank\" rel=\"noopener\">Cognitive Services <\/a><\/span><a style=\"text-align: right;\" href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/\" target=\"_blank\" rel=\"noopener\">here<\/a>, and combine Vision with other sets of APIs like Language or Speech to make your apps even more intelligent<span style=\"text-align: right;\">.<\/span><\/p>\n<h2>More to Come<\/h2>\n<p>So far, 3Cloud has surveyed the Search and Vision APIs. In addition, we will explore the remaining Cognitive Services categories: Speech, Knowledge, and Language. Subscribe to our blog so that you don&#8217;t miss out, and <a href=\"\/get-started\/\">contact us<\/a> if you would like to learn more about incorporating Cognitive Services into your own solutions.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Continuing 3Cloud\\&#8217;s series on the Microsoft Cognitive Services APIs, let\\&#8217;s look into Vision. Using the pre-built artificial intelligence from the Vision APIs will give your apps and other solutions the rich capabilities of different types of image and video analysis.<\/p>\n","protected":false},"author":21,"featured_media":14346,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[260],"tags":[331,319],"class_list":["post-15867","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-ai","tag-cognitive-services","tag-machine-learning-ai","topics-blog"],"acf":[],"_links":{"self":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15867","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/users\/21"}],"replies":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/comments?post=15867"}],"version-history":[{"count":0,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/posts\/15867\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media\/14346"}],"wp:attachment":[{"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/media?parent=15867"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/categories?post=15867"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/3cloudsolutions.com\/wp-json\/wp\/v2\/tags?post=15867"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}