I saw Tesla’s full self-driving module’s navigation panel the other day and I was amazed by how the real world was accurately described in the panel. Buildings, vehicles, pedestrians, and even trees were fully re-created in this virtual world and fed to the autopilot model.

Tesla autopilot navigation panel. Source: https://www.tesla.com/jp/autopilot
I decided to recreate what this module was doing, i.e. analyzing a video taken in the real world and identifying the moving objects in the video.