Bachelor-/ Master Thesis/ Internship: Combining real camera characteristics with foundation models
28.05.2025, Diplomarbeiten, Bachelor- und Masterarbeiten
External thesis or internship at Arnold & Richter Cine Technik GmbH & Co. Betriebs KG
Recent advances in generation depth from 2D images shows impressing results, e.g. https://arxiv.org/pdf/2410.02073. Multiscale Vision Transformers are used to generate sharp depth maps together with estimating a focal length in pixels.
To get a reliable depth value, i.e. a distance in meters from the camera, additional camera data could be taken into account. This work investigates, how the available models can be used as a foundation model to take advantage of the extensive training on large datasets. In this work you set up a training procedure that allows the additional inputs from a known camera system to be fed in. These camera characteristics are extracted from available measurements and optical simulations.
Aiming a model creating depth maps that do not only look great but are also accurate in numbers and could later be used for real movie-making applications.
Based on background and interests the topic can be a thesis or an internship or a combination.
Contact: Dr. Tamara Seybold
Kontakt: tseybold@arri.de