The program reads the RGB frames from the Intel Realsense depth cameras and detects objects (chairs) using the OpenCV implementation of YOLOv5. It then localizes the object in 3D geometry using depth information and uses this information to generate images where the object is overlayed with a generic 3D model using OpenGL