Azure AI Video Indexer: A Powerful Tool for Video Analysis

Article

Introduction

With the rise of digital content, managing and analyzing videos has become more important than ever. Azure AI Video Indexer (VI) is a cloud-based service that helps businesses and developers extract valuable insights from videos using artificial intelligence (AI). This tool can automatically analyze video content, recognize faces, detect emotions, extract speech-to-text transcriptions, and even generate summaries.

Development

Azure AI Video Indexer is part of Microsoft's AI services and uses machine learning to process and analyze video files. It provides a set of powerful features that help users understand video content quickly and efficiently.

Key Features of Video Indexer

Speech-to-Text and Translations
- Converts spoken words in videos into text
- Supports multiple languages and can provide translations
Face and Object Recognition
- Identifies known celebrities and people from a custom database
- Detects objects, such as cars, buildings, and other relevant elements
Sentiment and Emotion Analysis
- Analyzes the tone and emotions of people in the video
- Helps in understanding audience reactions
Keyword Extraction and Topic Detection
- Identifies important keywords from speech and visual content
- Group videos by topics for better organization
Scene and Shot Detection
- Breaks down videos into scenes and shots
- Helps in fast navigation and editing

Integration of OpenAI in Video Indexer

Microsoft has integrated OpenAI technology into Azure AI Video Indexer to enhance its capabilities. OpenAI models improve the accuracy of transcriptions and summaries, making it easier to understand and categorize video content. AI-powered text generation can also help create descriptions and highlights for videos, improving accessibility and content management.

Demo

Go to the Azure portal and create an Azure OpenAI resource.

Create Azure OpenAI

Then, we can create the VI resource.

Azure AI Video Indexer

You can use a free, trial account Azure AI Video Indexer and try it out with your content. No Azure subscription is required. The account provides up to 2,400 minutes of free indexing. But, trial accounts that haven't been used for 12 months might be deleted. To prevent its deletion, simply sign in the trial account.

Existing VI accounts created before June 21, 2022 were able to access face identification, customization and celebrities recognition till June 2023. In order to enable these features, you must fill a form that is in the documentation.

For further details about VI account types, visit this page.

From the wizard, you can create a Storage Account or select an existing one in the resource group.

Create Storage account

Select the Storage Account and OpenAI resources. Click on Review + Create.

Create Video Indexer resource

Once the VI resource is created, you will see two error alerts that indicate to assign a respective role, click on those buttons.

ccindexer

VI Resource

Before starting to upload a video, we must deploy an Open AI model.

From the Azure portal, go to the OpenAI resource you created earlier and click on Go to Azure AI Foundry portal.

Go to Azure AI Foundry Portal

Go to the Deployments tab and click on Deploy model/base model.

Deploy model

The supported models to use with VI are the following:

Llama 2
Phi 2
GPT3-5 Turbo
GPT4

For further details about the use of Generative AI with VI, visit this page.

For this demo, we used a GPT-4 model.

GPT-4 Model

Click on Deploy.

Deploy model GPT-4

After that, you can upload a video. Go to the VI portal, sign in, and select the VI paid account you created earlier.

Select accounts

In the Media Files / Libary tab, we can upload a video.

Azure AI Video Indexer

Write the file name and video source language. Click on Review + Upload.

Upload and Index

Click on Upload + index.

Upload and Index

Wait until the indexing reaches 100%.

Library

Once the video is indexed you can see the insights.

Video is indexed

Also, the video timeline is available.

Video Timeline

If you didn’t link an Azure OpenAI resource when VI was created, you will see a message like this.

Azure OpenAI resource

Otherwise, you can generate the textual summary by clicking on Configure and Generate.

Configure and generate

Select the model deployed and click on Generate Summary.

Generate summary

Wait a few seconds for the summary generation.

Summary

And these are the rest of the video insights.

Video Insights

Labels

Conclusion

Azure AI Video Indexer is a powerful tool for analyzing videos with AI. It provides automatic transcription, facial recognition, sentiment analysis, and more, making it easier to manage and search through video content. With the integration of OpenAI models, the service continues to improve in accuracy and usability. Businesses and developers can use this tool to enhance their video workflows, making content more accessible and valuable.

Thanks for reading

Thank you very much for reading. I hope you found this article interesting and may be useful in the future. If you have any questions or ideas you need to discuss, it will be a pleasure to collaborate and exchange knowledge.