OPUS 4 | Search

2 search hits

1 to 2

Sort by

Machine Learning Based Reconstruction of Non-Regularly Sampled Raw Images (2023)

Today’s digital cameras use a mosaic of red, green, and blue color filters to capture images in three color channels on a single sensor plane. This thesis investigates the use of convolutional neural networks (CNNs) for demosaicing – the process of reconstructing full-color images from raw mosaic sensor data. While there are existing CNNs for demosaicing raw images from the well-established regular Bayer color filter array (CFA), this thesis focuses on how they perform on alternative non-regular sampling patterns that produce less aliasing artifacts, namely the stochastic Gaussian- and the RandomQuarter sampling pattern (Backes and Fröhlich, 2020). A basic UNet (Ronneberger et al., 2015) and the spatially adaptive SANet (T. Zhang et al., 2022) are implemented in a supervised training pipeline based on the PixelShift200 image dataset (Qian et al., 2021) to investigate their suitability for the irregular demosaicing task. The experiments indicate that the basic UNet encounters difficulties in restoring the missing color values, whereas the spatially adaptive convolutional layers help in processing the irregularly sampled raw images. In addition, this thesis enhances SANet effectiveness by employing an alternative residual branch based on a CFA-normalized Gaussian filter, as well as a tileable modification to the Gaussian CFA pattern. The modified SANet is shown to outperform the conventional dFSR algorithm (Backes & Fröhlich, 2020) in terms of peak signal to noise ratio (PSNR) and structural similarity index measure (SSIM).

Implementing Deep Learning Object Recognition on NAO (2016)

Philippczyk, Yann

Deep learning methods have proven highly effective for object recognition tasks, especially in the form of artificial neural networks. In this bachelor’s thesis, a way is shown to imple- ment a ready-to-use object recognition implementation on the NAO robotic platform using Convolutional Neural Networks based on pretrained models. Recognition of multiple objects at once is realized with the help of the Multibox algorithm. The implementation’s object recognition rates are evaluated and analyzed in several tests. Furthermore, the implementation offers a graphical user interface with several options to adjust the recognition process and for controlling movements of the robot’s head in order to easier acquire objects in the field of view. Additionally, a dialogue system for querying further results is presented.

1 to 2

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

2 search hits