Skip to content
videodb
VideoDB Documentation
  • Pages
    • Welcome to VideoDB Docs
    • Quick Start Guide
      • Video Indexing Guide
      • Semantic Search
      • Collections
      • Public Collections
      • Callback Details
      • Ref: Subtitle Styles
      • Language Support
      • Guide: Subtitles
      • How Accurate is Your Search?
    • Visual Search and Indexing
      • Scene Extraction Algorithms
      • Custom Annotations
      • Scene-Level Metadata: Smarter Video Search & Retrieval
      • Advanced Visual Search Pipelines
      • Playground for Scene Extractions
      • Deep Dive into Prompt Engineering : Mastering Visual Indexing
      • How VideoDB Solves Complex Visual Analysis Tasks
      • Multimodal Search: Quickstart
      • Conference Slide Scraper with VideoDB
    • Examples and Tutorials
      • Dubbing - Replace Soundtrack with New Audio
      • VideoDB: Adding AI Generated voiceovers to silent footage
      • Beep curse words in real-time
      • Remove Unwanted Content from videos
      • Instant Clips of Your Favorite Characters
      • Insert Dynamic Ads in real-time
      • Adding Brand Elements with VideoDB
      • Elevating Trailers with Automated Narration
      • Add Intro/Outro to Videos
      • Audio overlay + Video + Timeline
      • Building Dynamic Video Streams with VideoDB: Integrating Custom Data and APIs
      • AI Generated Ad Films for Product Videography
      • Fun with Keyword Search
      • Overlay a Word-Counter on Video Stream
      • Generate Automated Video Outputs with Text Prompts | VideoDB
      • VideoDB x TwelveLabs: Real-Time Video Understanding
      • Multimodal Search
      • How I Built a CRM-integrated Sales Assistant Agent in 1 Hour
      • Make Your Video Sound Studio Quality with Voice Cloning
      • Automated Traffic Violation Reporter
    • Live Video→ Instant Action
    • Generative Media Quickstart
      • Generative Media Pricing
    • Video Editing Automation
      • Fit & Position: Aspect Ratio Control
      • Trimming vs Timing: Two Independent Timelines
      • Advanced Clip Control: The Composition Layer
      • Caption & Subtitles: Auto-Generated Speech Synchronization
      • Notebooks
    • Transcoding Quickstart
    • director-light
      Director - Video Agent Framework
      • Agent Creation Playbook
      • Setup Director Locally
    • Workflows and Integrations
      • zapier
        Zapier Integration
        • Auto-Dub Videos & Save to Google Drive
        • Create & Add Intelligent Video Highlights to Notion
        • Create GenAI Video Engine - Notion Ideas to Youtube
        • Automatically Detect Profanity in Videos with AI - Update on Slack
        • Generate and Store YouTube Video Summaries in Notion
        • Automate Subtitle Generation for Video Libraries
        • Solve customers queries with Video Answers
      • n8n
        N8N Workflows
        • AI-Powered Meeting Intelligence: Recording to Insights Automation
        • AI Powered Dubbing Workflow for Video Content
        • Automate Subtitle Generation for Video Libraries
        • Automate Interview Evaluations with AI
        • Turn Meeting Recordings into Actionable Summaries
        • Auto-Sync Sales Calls to HubSpot CRM with AI
        • Instant Notion Summaries for Your Youtube Playlist
    • Meeting Recording SDK
    • github
      Open Source
      • llama
        LlamaIndex VideoDB Retriever
      • PromptClip: Use Power of LLM to Create Clips
      • StreamRAG: Connect ChatGPT to VideoDB
    • mcp
      VideoDB MCP Server
    • videodb
      Give your AI, Eyes and Ears
      • Building Infrastructure that “Sees” and “Edits”
      • Agents with Video Experience
      • From MP3/MP4 to the Future with VideoDB
      • Dynamic Video Streams
      • Why do we need a Video Database Now?
      • What's a Video Database ?
      • Enhancing AI-Driven Multimedia Applications
      • Beyond Traditional Video Infrastructure
    • Customer Love
    • Join us
      • videodb
        Internship: Build the Future of AI-Powered Video Infrastructure
      • Ashutosh Trivedi
        • Playlists
        • Talks - Solving Logical Puzzles with Natural Language Processing - PyCon India 2015
      • Ashish
      • Shivani Desai
      • Gaurav Tyagi
      • Rohit Garg
      • Edge of Knowledge
        • Language Models to World Models: The Next Frontier in AI
        • Society of Machines
          • Society of Machines
          • Autonomy - Do we have the choice?
          • Emergence - An Intelligence of the collective
        • Building Intelligent Machines
          • icon picker
            Part 1 - Define Intelligence
          • Part 2 - Observe and Respond
          • Part 3 - Training a Model
      • Updates
        • VideoDB Acquires Devzery: Expanding Our AI Infra Stack with Developer-First Testing Automation

Part 1 - Define Intelligence

Imagine a future where you are not stuck in traffic because machines are driving vehicles, not humans. A future where everyone has a personal assistant to do mundane tasks. A future where industry is more productive and per capita income is rising.
image.jpeg
That future is already here, just not evenly distributed across the world. Some parts of the world are experiencing it earlier than the others. A big driving force behind this development is advancement of AI technologies.
The impact of AI on economy and industry productivity is growing exponentially. Look at the chart below – it shows that — AI is doubling the baseline growth in most sectors. This happens rarely. It means new markets, improved GDP, improved per capita income and improved quality of life.
image.jpeg
Industrial revolution took people out of field jobs and moved them to offices to manage processes and troubleshoot problems. Humans switched to managing the job while the real job was done by robotic assembly lines or machines.
Similarly, AI technologies automate repeated tasks and free humans from drudgery of mundane jobs. Some jobs will be taken away but new jobs will be created for humans to do what they do best — Thinking and Imagining.
“The last 10 years have been about building a world that is mobile-first. In the next 10 years, we will shift to a world that is AI-first.” - Sundar Pichai, CEO of Google
AI technologies are all about building intelligent machines and to do that you need to make machines that learn and understand our world. Now, how can we make machines learn and what does learning mean for a machine anyway? In this series of posts, I want to take a non-technical view on AI and Machine Learning.
Before we start thinking about artificial intelligence. Let’s first understand natural intelligence — the biological kind – the one that you and I are born with. Later, we’ll look at the process of creating the artificial one. There are few thinking exercises in this post so, I would suggest grabbing a notepad.

WHAT IS INTELLIGENCE?

Our brain is a fascinating organ with unimaginable capabilities. We are intelligent beings. But what is this intelligence? Take a moment and think about it… Try writing down your answers.
Some questions to think aboutHow do you know that you are intelligent? What differentiates you from something which you consider less intelligent?
image.jpeg

If you are done with writing or thinking your answers about intelligence, you will notice that most of what we call intelligence, is a process called Decision Making.
We, or our brains take decisions all the time. Sometimes it is a conscious choice but many a times it is an unconscious process. Even for putting our foot forward to take a step, brain calculates and decide where and how to do it. We calculate and decide all the time. But there are other aspects of intelligence that are different which we don’t fully understand yet- for example, creating art, poetry or music and even something like consciousness, the voice which speaks to you in your head, what kind of intelligence is that?
If we have to sum up everything about intelligence to cover all aspects of it, can we look at it from the point of view of Interaction? Humans, animals, insects, non living things all interact with this world – but in dramatically different ways. These limitations and dimensions of interactions with their environment tells us about the scale of their intelligence. So for me –
Intelligence is a measure of the magnitude of our interaction with this world.
Intelligence therefore, is not binary but a spectrum – a pup interacts with the world but rarely thinks about consciousness or happiness. It’s dimensions of interaction are limited as compared to humans. But is still intelligent.
image.jpeg
An intelligent pup

DECISION MAKING + INTERACTION WITH THE WORLD

The reason why most computers and machines today focus on decision making is because it is a kind of intelligence which we have understood well scientifically. We have good frameworks for it which can be applied to create machines with more decision making power. We will deep dive into human decision making in part 2 of this series but let’s look at the decision making and interaction with the world together. We can gain some interesting insights. It’s a good framework to study about current machines and how they will transform in the AI era.
This is a machine — a logical machine or as we call it, software.
image.jpeg
If you observe the behavior of this software and its interaction with the world – it’s fixed and rigid. As a user, you can only provide a fix type of inputs to which it will validate and act logically to respond. I think we can safely say that the interaction is very limited.
It’s a good software because it works great. But this software is not capable of taking a lot of decisions. Every possibility is thought of beforehand by its developer, starting from validating the inputs.
Now, have a look at Alexa. This software is expected to interact with the world in a very different way. There are infinite possibilities to interact with this kind of software . The
image.jpeg
developer of this machine can never think of all kind of inputs it will receive and how to respond to it. Anyone can ask anything. The decision making power of such machines has to be great.
If we understand the core of the decision making process of natural intelligence, we will be able to add amazing capabilities to our machines. We can greatly extend their interaction with the environment they operate in.
So, the quest of all AI technologies will be to add more dimensions of interaction and understanding the process of decision making will be the key. In , I discuss the process of human decision making and later apply it to creating softwares or machines capable of taking more decisions.
 
Want to print your doc?
This is not the way.
Try clicking the ··· in the right corner or using a keyboard shortcut (
CtrlP
) instead.