This project is my own implementation of Unsupervised Monocular Depth Estimation with Left-Right Consistency paper by C. Godard, O. M. Aodha and G. J. Brostow.
The above picture shows the model used in this project. EfficientNet B4 model is used as a backbone for transfer learning which helps in reducing the training time.
For training the model above loss function is used which is taken from the Left Right Consistency paper mentioned above. It also uses image resampling (bilinear sampler) introduced in the Spatial Transformer Network paper.
The model in this project was trained and tested on GPU (NVIDIA GTX 1650 4GB) powered laptop.
Some dependencies used in the project are:
- Numpy v1.19.2
- Tensorflow v2.3
- Python v3.8.5
- Jupyter Notbook v6.2.0
- Matplotlib
For training model download the kitti stereo dataset from their website and place the left and right image in the same folder in which main file is. Then open the main file in Jupyter Notebook and run all the cells.
If you only want to try out the model then download the weight from here. Put the weight in the same folder with main file then run all cells except the one which has NUM_EPOCHS=100
at the starting.