Remember me
Denoise ability: To enhance its robustness and generality in real situations and to examine its denoise ability, the input images are preprocessed by adding Gaussian noises, which is called degradation and is widely used in conventional semi-supervised or self-supervised denoising. Under this circumstance, the modified MAE aims to reconstruct the whole image instead of merely the missing patches.
(iii) Evaluations: For datasets in simulation, the reconstructed images were evaluated by the peak-signal-to-noise (PSNR) and structure similarity (SSIM).27,2827. Z. Wang, A. Bovik, H. Sheikh, and E. Simoncelli, “Image quality assessment: From error visibility to structural similarity,” IEEE Trans. Image Process. 13, 600–612 (2004). https://doi.org/10.1109/tip.2003.81986128. A. Horé and D. Ziou, “Image quality metrics: PSNR vs SSIM,” in 20th International Conference on Pattern Recognition, ICPR 2010, Istanbul, Turkey, . These two metrics are compelling assessment criteria for image denoising and reconstruction. To measure its performance, Davis2017,2929. J. Pont-Tuset, F. Perazzi, S. Caelles, P. Arbeláez, A. Sorkine-Hornung, and L. Van Gool, “The 2017 DAVIS challenge on video object segmentation,” arXiv:1704.00675 (2017). a relatively small dataset for unsupervised and semi-supervised learning in object segmentation, is chosen for testing and checking its average PSNR and SSIM. The 480p test-challenge part contains 2180 854 × 480 pixel sequential pictures, including several objects, details, and actions. Owing to the fixed input size of the network, a sliding window is adapted with details as shown in Fig. 3. Each time, a slice of the image is generated for the model and the final output is gathered to consist the reconstructed image. During the process, contiguous slices share 23 pixels to smooth the sharp boundary.(iv) Adjustment for real condition: Unlike MAE, which conveys the mask to the decoder directly, the optical MAE fails to align each patch of mask when confronted with real imaging. To be specific, it is hard to adjust the pictures captured and make the input mask and predefined mask equal. To tackle the problem, an additional module and several tricks on imaging are added to detect the mask accurately. First, mask patch sizes bigger than the input of the model are imposed by DMD. A larger area is captured by the optical system, and the photo is then compressed into befitting sizes. In this way, the distortion that skewed images bring is eliminated to a large extent. In addition, when figures and objects are presented, complex illumination and interference fringes disturb mask detection modules. Blank plates with masked patches are captured in advance. In this way, each masked patch of every masked blank plate is easy to localize and map to the corresponding measurement by conventional adaptive local filters and threshold methods. The process is illustrated in Fig. 4.
Comments (0)