Introduction
With the rise of digital content, managing and analyzing videos has become more important than ever. Azure AI Video Indexer (VI) is a cloud-based service that helps businesses and developers extract valuable insights from videos using artificial intelligence (AI). This tool can automatically analyze video content, recognize faces, detect emotions, extract speech-to-text transcriptions, and even generate summaries.
Development
Azure AI Video Indexer is part of Microsoft's AI services and uses machine learning to process and analyze video files. It provides a set of powerful features that help users understand video content quickly and efficiently.
Key Features of Video Indexer
- Speech-to-Text and Translations
- Converts spoken words in videos into text
- Supports multiple languages and can provide translations
- Face and Object Recognition
- Identifies known celebrities and people from a custom database
- Detects objects, such as cars, buildings, and other relevant elements
- Sentiment and Emotion Analysis
- Analyzes the tone and emotions of people in the video
- Helps in understanding audience reactions
- Keyword Extraction and Topic Detection
- Identifies important keywords from speech and visual content
- Group videos by topics for better organization
- Scene and Shot Detection
- Breaks down videos into scenes and shots
- Helps in fast navigation and editing
Integration of OpenAI in Video Indexer
Microsoft has integrated OpenAI technology into Azure AI Video Indexer to enhance its capabilities. OpenAI models improve the accuracy of transcriptions and summaries, making it easier to understand and categorize video content. AI-powered text generation can also help create descriptions and highlights for videos, improving accessibility and content management.
Demo
Go to the Azure portal and create an Azure OpenAI resource.
![Create Azure OpenAI]()
Then, we can create the VI resource.
![Azure AI Video Indexer]()
You can use a free, trial account Azure AI Video Indexer and try it out with your content. No Azure subscription is required. The account provides up to 2,400 minutes of free indexing. But, trial accounts that haven't been used for 12 months might be deleted. To prevent its deletion, simply sign in the trial account.
Existing VI accounts created before June 21, 2022 were able to access face identification, customization and celebrities recognition till June 2023. In order to enable these features, you must fill a form that is in the documentation.
For further details about VI account types, visit this page.
From the wizard, you can create a Storage Account or select an existing one in the resource group.
![Create Storage account]()
Select the Storage Account and OpenAI resources. Click on Review + Create.
![Create Video Indexer resource]()
Once the VI resource is created, you will see two error alerts that indicate to assign a respective role, click on those buttons.
![ccindexer]()
![VI Resource]()
Before starting to upload a video, we must deploy an Open AI model.
From the Azure portal, go to the OpenAI resource you created earlier and click on Go to Azure AI Foundry portal.
![Go to Azure AI Foundry Portal]()
Go to the Deployments tab and click on Deploy model/base model.
![Deploy model]()
The supported models to use with VI are the following:
- Llama 2
- Phi 2
- GPT3-5 Turbo
- GPT4
For further details about the use of Generative AI with VI, visit this page.
For this demo, we used a GPT-4 model.
![GPT-4 Model]()
Click on Deploy.
![Deploy model GPT-4]()
After that, you can upload a video. Go to the VI portal, sign in, and select the VI paid account you created earlier.
![Select accounts]()
In the Media Files / Libary tab, we can upload a video.
![Azure AI Video Indexer]()
Write the file name and video source language. Click on Review + Upload.
![Upload and Index]()
Click on Upload + index.
![Upload and Index]()
Wait until the indexing reaches 100%.
![Library]()
Once the video is indexed you can see the insights.
![Video is indexed]()
Also, the video timeline is available.
![Video Timeline]()
If you didn’t link an Azure OpenAI resource when VI was created, you will see a message like this.
![Azure OpenAI resource]()
Otherwise, you can generate the textual summary by clicking on Configure and Generate.
![Configure and generate]()
Select the model deployed and click on Generate Summary.
![Generate summary]()
Wait a few seconds for the summary generation.
![Summary]()
And these are the rest of the video insights.
![Video Insights]()
![Labels]()
Conclusion
Azure AI Video Indexer is a powerful tool for analyzing videos with AI. It provides automatic transcription, facial recognition, sentiment analysis, and more, making it easier to manage and search through video content. With the integration of OpenAI models, the service continues to improve in accuracy and usability. Businesses and developers can use this tool to enhance their video workflows, making content more accessible and valuable.
Thanks for reading
Thank you very much for reading. I hope you found this article interesting and may be useful in the future. If you have any questions or ideas you need to discuss, it will be a pleasure to collaborate and exchange knowledge.