In order to search on images and scanned files, you can enable automatic OCR using the out of the box support for the Azure Computer Vision APIs on your Curiosity application.

You will need:

  • An active Azure subscription (you can test this feature with the free subscription)
  • Your Computer Vision endpoint
  • Your Computer Vision subscription key

Note: The Azure Computer Vision API is an offer from Microsoft - you can check the prices for using the API on their website.

Creating a Azure Computer Vision endpoint

If you already have a Computer Vision endpoint, you can skip to the next section.

Otherwise, you can easily create a new one from within your Azure portal page.

Click on Create a Resource:

Select the option AI + Machine Learning, and then click on Computer Vision

Configure your new Computer Vision resource as required:

And click on Review + Create to check all configurations, and finally on Create to deploy the new endpoint.

Once your endpoint is created, navigate to the section Keys and Endpoint and copy the value of the fields Endpoint and Key 1 - you'll need these values for the next step below.

Configuring Azure Computer Vision on Curiosity

On your Curiosity application, navigate to Settings > Data > OCR Settings, and enter the values for Endpoint and Key 1 in the fields Azure Ocr Endpoint and Azure Subscription Key respectively. Click on Save to store all changes.

If you have already added files in your system, you might want to reprocess these files so that they will be OCR-ed as required. For that, you can run this simple query on the Shell interface :

Q().StartAt("_FileEntry").Tx().Set("Indexed", false).Commit();

Did this answer your question?