Share
Explore

icon picker
VideoDB Vs Azure Media Indexer

Feature Comparison
Feature
VideoDB
Azure Media Indexer
1
Video Streaming
Yes
Yes
2
Programmable Video Editing ( clip, compile, logo overlay, captions, AI generated audio, translation etc.)
Yes
No ( Only face blur )
3
Storage
Yes
Yes
4
Spoken Analysis
Transcription with word level timestamps
Transcription, topics with sentence level timestamps.
5
Visual Analysis
Vision Model Based
Label based
6
Search Infrastructure for Spoken and Vision
Yes
No
7
Custom Annotations
Yes
Not possible
8
Multimodal Index
Yes ( Late fusion )
Not possible
There are no rows in this table
Pricing Comparision
Feature
Metric
VideoDB Price
Azure Price
1
Data Storage
Cost to store your uploaded content Size (GB)
$0.03/GB/month
$0.15 - Premium $0.018 - Hot
2
Index Storage
Cost to store your indexes Minutes
$0.0005/min/month
NA
3
Spoken Index
One time cost to index conversations Minutes (Indexed)
$0.02/min
$0.012 ( Basic ) $0.024 ( Standard ) $0.04 ( Advanced )
4
Scene Index
One time cost to index scenes Minutes (Indexed)
$0.09/min
$0.045 ( Basic ) $0.09 ( Standard ) $0.15 ( Advanced )
5
Search Queries
Search across videos based on semantic or keyword based query.
$0.025/query
NA
6
Programmable Stream Generation (720p)
Programmable streams include modifications like edits, compilations, overlays etc. Minutes (Generated)
$0.06/min
NA
7
Simple Streaming (720p)
Depends on the number of views your streams receive Minutes (Streamed)
$0.000998/min
NA
There are no rows in this table

Main Advantage of VideoDB:

It’s a complete video infrastructure where you don’t have to maintain anything else, or work with video files directly.
You can organize videos into collection of videos to segregate and manage.
You can also upload audio and images in VideoDB.
Provides search across videos on collections of videos. semantic, scene and keyword search with tweaking parameters
Better control over keyframe detection algorithms.
Much more sophisticated vision understanding using cutting edge vision models.
Ability to bring your own LLM for vision understanding.
Ability to create multimodal search queries.
Programmable video streams to create clips, compilation of videos etc.

Can Azure media indexer search across all indexed video files?
No, You’ll have to create and maintain your own search infrastructure. It only provides information in json format with a standard label. VideoDB has semantic search built in with parameters to tweak the accuracy and recall.

Does Azure offer any kind of multimodal search?
No, It only extract insights in json. VideoDB can provide the multimodal late fusion search API.

Does it provide Video Answers ?
Azure media indexer doesn’t offer video answers or video stream answers. VideoDB’s search results would have parts of video stream with exact moments. It can easily embed into any application.

How does VideoDB compare against standard and advanced spoken and vision index of Azure media indexer?

Spoken :

Standard : timestamps are not word level. Provides keywords and topics which are generic and prone to false positives as observed in our analysis.
Advanced : Different model, but same analysis as standard.
VideoDB : Word level transcript, semantic and keyword based search. Easy to build pipelines for NLP ( keyword and topics ) analysis. For example beeping curse words etc.

Vision

Azure’s vision indexer runs on object and label detection type of model, VideoDB uses vision models to describe the frames.
Vision model based understanding of frames is more advanced compared to label based understanding.
VideoDB has freedom to choose keyframe extraction algorithms. Users also have choice to choose any vision model to describe the frames, setup prompts used for describing the frames.

Multimodal

Using late fusion, multimodal search queries like - “show me where the therapist asked to raise hands and kid raise the hands” are possible.
Azure Media Indexer can’t solve such queries out of the box.

Example chat app built on spoken Index

Want to print your doc?
This is not the way.
Try clicking the ⋯ next to your doc name or using a keyboard shortcut (
CtrlP
) instead.