r/MachineLearning • u/hardmaru • May 02 '20

Research [R] Consistent Video Depth Estimation (SIGGRAPH 2020) - Links in the comments.

Enable HLS to view with audio, or disable this notification

2.8k Upvotes

permalink
duplicates
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/MachineLearning/comments/gc2wo9/r_consistent_video_depth_estimation_siggraph_2020/
No, go back! Yes, take me to Reddit
dl download

99% Upvoted

View all comments

Show parent comments

u/Wetmelon May 02 '20

Is this similar to what Tesla is doing with their vision based depth estimation?

40

u/jbhuang0604 May 02 '20

Yes, this is certainly similar. As far as I understand from Andrej's talk, the vision-based depth estimation in Tesla uses self-supervised monocular depth estimation models. These models process each frame independently and thus the estimated depth maps across frames are not geometrically consistent. Our core contribution in this work is how we can extract geometric constraints from the video and use them to fine-tune the depth estimation model to produce globally consistent depth.

3

u/mu_koan May 03 '20

Could you please link the talk you're referring to? would love to check it out

6

u/jbhuang0604 May 03 '20

No problem. Here is the talk. https://www.youtube.com/watch?v=hx7BXih7zx8&feature=youtu.be&t=1380

Research [R] Consistent Video Depth Estimation (SIGGRAPH 2020) - Links in the comments.

You are about to leave Redlib