bebeal
diff --git a/‎.gitignore
Lines changed: 5 additions & 0 deletions b/‎.gitignore
Lines changed: 5 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 236 additions & 0 deletions b/‎README.md
Lines changed: 236 additions & 0 deletions
diff --git a/‎config.py
Lines changed: 75 additions & 0 deletions b/‎config.py
Lines changed: 75 additions & 0 deletions
@@ -0,0 +1,5 @@
+data/
+log/
+__pycache__/
+model_saves/
+.idea/
@@ -0,0 +1,236 @@
+# PyTorch mip-NeRF 
+
+A reimplementation of mip-NeRF in PyTorch. 
+
+![mipnerf](misc/images/nerfTomipnerf.png)
+
+Not exactly 1-to-1 with the official repo, as we organized the code to out own liking (mostly how the datasets are structued, and hyperparam changes to run the code on a consumer level graphics card), made it more modular, and removed some repetitive code, but it achieves the same results.
+
+## Features
+
+* Can use Spherical, or Spiral poses to generate videos for all 3 datasets
+  * Spherical:
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/video.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+  * Spiral:
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/video_spiral.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+* Depth and Normals video renderings:
+  * Depth:
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/depth.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+  * Normals:
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/normals.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+* Can extract meshes
+  * Default Mesh
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/mesh.mkv" type="video/mkv">)
+
+[//]: # (</video>)
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/mic/mesh.mkv" type="video/mkv">)
+
+[//]: # (</video>)
+
+
+
+## Future Plans
+
+In the future we plan on implementing/changing:
+
+* Factoring out more repetitive/redundant code, optimize gpu memory and rps
+* Clean up and expand mesh extraction code
+* Zoomed poses for multicam dataset
+* [Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields](https://jonbarron.info/mipnerf360/) support
+* [NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis](https://pratulsrinivasan.github.io/nerv/) support
+
+## Installation/Running
+
+1. Create a conda environment using `mipNeRF.yml`
+2. Get the training data
+   1. run `bash scripts/download_data.sh` to download all 3 datasets: LLFF, Blender, and Multicam.
+   2. Individually run the bash script corresponding to an individual dataset
+         * `bash scripts/download_llff.sh` to download LLFF
+         * `bash scripts/download_blender.sh` to download Blender
+         * `bash scripts/download_multicam.sh` to download Multicam (Note this will also download the blender dataset since it's derived from it)
+3. Optionally change config parameters: can change default parameters in `config.py` or specify with command line arguments
+    * Default config setup to run on a high-end consumer level graphics card (~8-12GB)
+4. Run `python train.py` to train
+   * `python -m tensorboard.main --logdir=log` to start the tensorboard
+5. Run `python visualize.py` to render a video from the trained model
+6. Run `python extract_mesh.py` to render a mesh from the trained model
+
+## Code Structure
+
+I explain the specifics of the code more in detail [here](misc/Code.md) but here is a basic rundown.
+
+* `config.py`: Specifies hyperparameters.
+* `datasets.py`: Base generic `Dataset` class + 3 default dataset implementations.
+  * `NeRFDataset`: Base class that all datasets should inherent from.
+  * `Multicam`: Used for multicam data as in the original mip-NeRF paper.
+  * `Blender`: Used for the synthetic dataset as in original NeRF.
+  * `LLFF`: Used for the llff dataset as in the original NeRF.
+* `loss.py`: mip-NeRF loss, pretty much just MSE, but also calculates psnr.
+* `model.py`: mip-NeRF model, not as modular as the way the original authors wrote it, but easier to understand its structure when laid out verbatim like this.
+* `pose_utils.py`: Various functions used to generate poses.
+* `ray_utils.py`: Various functions related involving rays that the model uses as input, most are used within the forward function of the model.
+* `scheduler.py`: mip-NeRF learning rate scheduler.
+* `train.py`: Trains a mip-NeRF model.
+* `visualize.py`: Creates the videos using a trained mip-NeRF.
+
+## mip-NeRF Summary
+
+Here's a summary on how NeRF and mip-NeRF work that I wrote when writing this originally.
+
+* [Summary](misc/Summary.md)
+
+## Results
+
+### LLFF - Trex
+
+<div>
+   <img src="misc/results/trex/LR.png" alt="pic0" width="49%">
+   <img src="misc/results/trex/Evaluation_PSNR.png" alt="pic1" width="49%">
+</div>
+<div>
+   <img src="misc/results/trex/Train_Loss.png" alt="pic2" width="49%">
+   <img src="misc/results/trex/Train_PSNR.png" alt="pic3" width="49%">
+</div>
+
+<br>
+Video:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/trex/video.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+<br>
+Depth:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (<source src="misc/results/trex/depth.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+<br>
+Normals:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (<source src="misc/results/trex/normals.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+### Blender - Lego
+
+<div>
+   <img src="misc/results/lego/LR.png" alt="pic0" width="49%">
+   <img src="misc/results/lego/Evaluation_PSNR.png" alt="pic1" width="49%">
+</div>
+<div>
+   <img src="misc/results/lego/Train_Loss.png" alt="pic2" width="49%">
+   <img src="misc/results/lego/Train_PSNR.png" alt="pic3" width="49%">
+</div>
+Video:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/video.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+<br>
+Depth:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/depth.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+<br>
+Normals:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/lego/normals.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+### Multicam - Mic
+
+<div>
+   <img src="misc/results/mic/LR.png" alt="pic0" width="49%">
+   <img src="misc/results/mic/Evaluation_PSNR.png" alt="pic1" width="49%">
+</div>
+<div>
+   <img src="misc/results/mic/Train_Loss.png" alt="pic2" width="49%">
+   <img src="misc/results/mic/Train_PSNR.png" alt="pic3" width="49%">
+</div>
+Video:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/mic/video.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+<br>
+Depth:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/mic/depth.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+<br>
+Normals:
+<br>
+
+[//]: # (<video controls>)
+
+[//]: # (    <source src="misc/results/mic/normals.mp4" type="video/mp4">)
+
+[//]: # (</video>)
+
+## References/Contributions
+
+* Thanks to [Nina](https://github.com/ninaahmed) for helping with the code
+* [Original NeRF Code in Tensorflow](https://github.com/bmild/nerf)
+* [NeRF Project Page](https://www.matthewtancik.com/nerf)
+* [NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis](https://arxiv.org/abs/2003.08934)
+* [Original mip-NeRF Code in JAX](https://github.com/google/mipnerf)
+* [mip-NeRF Project Page](https://jonbarron.info/mipnerf/)
+* [Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields](https://arxiv.org/abs/2103.13415)
+* [nerf_pl](https://github.com/kwea123/nerf_pl)
@@ -0,0 +1,75 @@
+import argparse
+import torch
+from os import path
+
+
+def get_config():
+    config = argparse.ArgumentParser()
+
+    # basic hyperparams to specify where to load/save data from/to
+    config.add_argument("--log_dir", type=str, default="log")
+    config.add_argument("--dataset_name", type=str, default="blender")
+    config.add_argument("--scene", type=str, default="lego")
+    # model hyperparams
+    config.add_argument("--use_viewdirs", action="store_false")
+    config.add_argument("--randomized", action="store_false")
+    config.add_argument("--ray_shape", type=str, default="cone")  # should be "cylinder" if llff
+    config.add_argument("--white_bkgd", action="store_false")  # should be False if using llff
+    config.add_argument("--override_defaults", action="store_true")
+    config.add_argument("--num_levels", type=int, default=2)
+    config.add_argument("--num_samples", type=int, default=128)
+    config.add_argument("--hidden", type=int, default=256)
+    config.add_argument("--density_noise", type=float, default=0.0)
+    config.add_argument("--density_bias", type=float, default=-1.0)
+    config.add_argument("--rgb_padding", type=float, default=0.001)
+    config.add_argument("--resample_padding", type=float, default=0.01)
+    config.add_argument("--min_deg", type=int, default=0)
+    config.add_argument("--max_deg", type=int, default=16)
+    config.add_argument("--viewdirs_min_deg", type=int, default=0)
+    config.add_argument("--viewdirs_max_deg", type=int, default=4)
+    # loss and optimizer hyperparams
+    config.add_argument("--coarse_weight_decay", type=float, default=0.1)
+    config.add_argument("--lr_init", type=float, default=1e-3)
+    config.add_argument("--lr_final", type=float, default=5e-5)
+    config.add_argument("--lr_delay_steps", type=int, default=2500)
+    config.add_argument("--lr_delay_mult", type=float, default=0.1)
+    config.add_argument("--weight_decay", type=float, default=1e-5)
+    # training hyperparams
+    config.add_argument("--factor", type=int, default=2)
+    config.add_argument("--max_steps", type=int, default=200_000)
+    config.add_argument("--batch_size", type=int, default=2048)
+    config.add_argument("--do_eval", action="store_false")
+    config.add_argument("--continue_training", action="store_true")
+    config.add_argument("--save_every", type=int, default=1000)
+    config.add_argument("--device", type=str, default="cuda")
+    # visualization hyperparams
+    config.add_argument("--chunks", type=int, default=8192)
+    config.add_argument("--model_weight_path", default="log/model.pt")
+    config.add_argument("--visualize_depth", action="store_true")
+    config.add_argument("--visualize_normals", action="store_true")
+    # extracting mesh hyperparams
+    config.add_argument("--x_range", nargs="+", type=float, default=[-1.2, 1.2])
+    config.add_argument("--y_range", nargs="+", type=float, default=[-1.2, 1.2])
+    config.add_argument("--z_range", nargs="+", type=float, default=[-1.2, 1.2])
+    config.add_argument("--grid_size", type=int, default=256)
+    config.add_argument("--sigma_threshold", type=float, default=50.0)
+    config.add_argument("--occ_threshold", type=float, default=0.2)
+
+    config = config.parse_args()
+
+    # default configs for llff, automatically set if dataset is llff and not override_defaults
+    if config.dataset_name == "llff" and not config.override_defaults:
+        config.factor = 4
+        config.ray_shape = "cylinder"
+        config.white_bkgd = False
+        config.density_noise = 1.0
+
+    config.device = torch.device(config.device)
+    base_data_path = "data/nerf_llff_data/"
+    if config.dataset_name == "blender":
+        base_data_path = "data/nerf_synthetic/"
+    elif config.dataset_name == "multicam":
+        base_data_path = "data/nerf_multiscale/"
+    config.base_dir = path.join(base_data_path, config.scene)
+
+    return config
-Original file line number
+Diff line change
@@ @@ -0,0 +1,5 @@ @@
 +data/
 +log/
 +__pycache__/
 +model_saves/
 +.idea/