Skip to content
[New] Concise and Practical AI/ML
  • Pages
    • Preface
    • Artificial Intelligence
      • Concepts
      • High-level Intelligence
    • Maths for ML
      • Calculus
      • Algebra
    • Machine Learning
      • History of ML
      • ML Models
        • ML Model is Better
        • How a Model Learns
        • Boosted vs Combinatory
      • Neuralnet
        • Neuron
          • Types of Neurons
        • Layers
        • Neuralnet Alphabet
        • Heuristic Hyperparams
      • Feedforward
        • Input Separation
      • Backprop
        • Activation Functions
        • Loss Functions
        • Gradient Descent
        • Optimizers
      • Design Techniques
        • Normalization
        • Regularization
          • Drop-out Technique
        • Concatenation
        • Overfitting & Underfitting
        • Explosion & Vanishing
      • Engineering Techniques
    • Methods of ML
      • Supervised Learning
        • Regression
        • Classification
      • Reinforcement Learning
        • Concepts
        • Bellman Equation
        • icon picker
          Q-table
        • Q-network
        • Learning Tactics
          • Policy Network
      • Unsupervised Learning
        • Some Applications
      • Other Methods
    • Practical Cases
    • Ref & Glossary

Q-table

Code Files

What Q-table Is

Q-table is a type of q-function, just as q-network is also a type of q-function.
Q-table is the knowledge storage for classic q-learning. Q-table is still used in education but no longer effective in industry because it needs a huge a mount of computer memory to store data. Q-table stores the value of q-function for pairs of state and actions.

Q-learning on Q-table

Use this table update formula based on Bellman Equation ​
image.png

Q-table Init

Need to randomize to all near zero
No meaningful q values known yet. Q-table is similar to known good actions for rewards if initialized to non-zeros, thus non-zero means wrong.
But need randomness for variations.

 
Want to print your doc?
This is not the way.
Try clicking the ··· in the right corner or using a keyboard shortcut (
CtrlP
) instead.