A 2D image 3D reconstruction function adaptive denoising algorithm

Feng Wang; Weichuan Ni; Shaojiang Liu; Zhiming Xu; Zemin Qiu; Zhiping Wan

doi:10.7717/peerj-cs.1604

A 2D image 3D reconstruction function adaptive denoising algorithm

Feng Wang , Weichuan Ni, Shaojiang Liu, Zhiming Xu, Zemin Qiu, Zhiping Wan

Guangzhou Xinhua University, Dongguan, Guangdong, China

DOI: 10.7717/peerj-cs.1604

Published: 2023-10-03
Accepted: 2023-08-29
Received: 2023-05-18

Academic Editor: Muhammad Asif

Subject Areas: Algorithms and Analysis of Algorithms, Artificial Intelligence, Computer Vision, Data Science
Keywords: Denoising algorithm, Threshold, Adversarial generative network, 3D reconstruction

Copyright: © 2023 Wang et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Wang F, Ni W, Liu S, Xu Z, Qiu Z, Wan Z. 2023. A 2D image 3D reconstruction function adaptive denoising algorithm. PeerJ Computer Science 9:e1604 https://doi.org/10.7717/peerj-cs.1604

The authors have chosen to make the review history of this article public.

Abstract

To address the issue of image denoising algorithms blurring image details during the denoising process, we propose an adaptive denoising algorithm for the 3D reconstruction of 2D images. This algorithm takes into account the inherent visual characteristics of human eyes and divides the image into regions based on the entropy value of each region. The background region is subject to threshold denoising, while the target region undergoes processing using an adversarial generative network. This network effectively handles 2D target images with noise and generates a 3D model of the target. The proposed algorithm aims to enhance the noise immunity of 2D images during the 3D reconstruction process and ensure that the constructed 3D target model better preserves the original image’s detailed information. Through experimental testing on 2D images and real pedestrian videos contaminated with noise, our algorithm demonstrates stable preservation of image details. The reconstruction effect is evaluated in terms of noise reduction and the fidelity of the 3D model to the original target. The results show an average noise reduction exceeding 95% while effectively retaining most of the target’s feature information in the original image. In summary, our proposed adaptive denoising algorithm improves the 3D reconstruction process by preserving image details that are often compromised by conventional denoising techniques. This has significant implications for enhancing image quality and maintaining target information fidelity in 3D models, providing a promising approach for addressing the challenges associated with noise reduction in 2D images during 3D reconstruction.

Introduction

Deep learning techniques have made significant advancements in various fields such as image processing, natural language processing, and network security detection. These techniques have shown promising results in experiments, exhibiting low error rates during training and strong generalization capabilities for test data. However, noisy images in image processing and noise in other types of data can negatively impact the accuracy of deep learning algorithms (Tibi et al., 2021; Giannatou et al., 2019; Zhang et al., 2021; Ye, Li & Chen, 2021; Singh, Mittal & Aggarwal, 2020; Hales, Pfeuffer & Clark, 2020; Zhu et al., 2022). For instance, noise in speech recognition can lead to reduced accuracy in semantic prediction. The ubiquity of noise presents challenges in training deep learning algorithms, as it is difficult to collect pure data for training purposes. Even if the activity being studied is not affected by noise, real-world applications may introduce noisy data due to the environment, which can significantly impact accuracy in detection and processing tasks.There are three main categories of denoising algorithms: spatial domain-based, transform domain-based, and learning-based algorithms. Each category has its own advantages and disadvantages (Li et al., 2022a; Pimpalkhute et al., 2021; Yan et al., 2021; Kazuaki et al., 2022; Zhang et al., 2021). Spatial domain-based algorithms are easy to understand and implement, but they may not perform well in removing strong noise. Transform domain-based algorithms are more effective in handling various types of noise, but they require experience and professional knowledge during processing. Learning-based algorithms can learn data relationships better, but they typically require a large amount of labeled data for training and may suffer from overfitting issues. In addition, the 3D reconstruction of 2D images is a highly researched topic in computer vision and image processing. Modeling techniques can help explain changes in natural images, enabling neural networks to better understand image details. These techniques have practical applications in various computer vision-related fields. For instance, in autonomous driving systems, converting 2D camera images into 3D can help estimate scene depth. In the medical imaging field, it can assist with on-site diagnosis and simulation training (Gao & Yuille, 2017; Sisniega et al., 2021; Li et al., 2022a; Zhang, Cui & Ding, 2021; Sun, 2021; Yu et al., 2021; Svahn et al., 2021; Wu et al., 2021). However, noise in 2D images can pose challenges in the 3D reconstruction process. It may result in incomplete reconstruction of target details or even mistaken noise points for features, adversely affecting target recognition. This highlights the need for effective denoising algorithms to improve the accuracy and quality of 3D reconstructions.

In this study, an adaptive denoising algorithm for the 3D reconstruction function of 2D images is proposed, utilizing Generative Adversarial Networks (GANs) as a neural network model trained on adversarial learning data. The main objective is to generate noise-free images realistically, even in the presence of noise. The project involves preparing a noise-free image generator using GANs, which reproduces the image accurately despite the presence of noise. In this approach, noise generators are introduced and trained using the noise-free image generator as a reference. Distribution and transformation constraints are incorporated into the noise generator function, guiding it to capture specific noise components effectively. This ensures that the method can adaptively learn the noise-free image generator, even when training images contain significant amounts of noise. To preserve target information effectively, this study constructs a 3D model of the target region. A two-dimensional monocular image is utilized as input, combined with target information, and a confidence factor is introduced to improve the preservation of target details from the original image. Through experimental evaluations, the algorithm demonstrates stable preservation of image details. The reconstruction effect is tested on noisy 2D images, and the reconstruction of the 3D model is tested on noisy pedestrian datasets. The results show an average noise reduction rate exceeding 95%, with the 3D model effectively retaining most of the feature information from the original image. In summary, this study proposes an adaptive denoising algorithm for 3D reconstruction using GANs. By effectively reducing noise and preserving target details, the algorithm offers promising results in improving the quality and accuracy of image reconstruction. It introduces innovative techniques to handle noise and enhance the fidelity of reconstructed 3D models from 2D images.

Generating Adversarial Networks

Goodfellow proposed the Generative Adversarial Network (GAN), which employs two convolutional neural networks in a game-based training approach to generate images resembling the original picture (Ozkanoglu & Ozer, 2022; He, Wandt & Rhodin, 2022; Iqbal & Ali, 2018; Kumar et al., 2022; Zhao, Wei & Wong, 2022). The GAN model consists of two essential components: the generator and the discriminator.

In Fig. 1, the structure of the generative adversarial network is depicted. The generator is responsible for generating new data that is similar to the given data. Its objective is to create samples with high resemblance to real data. On the other hand, the discriminator’s role is to determine the authenticity of the data generated by the generator. It aims to differentiate between real and generated samples accurately. During the training process, the generator and discriminator interact with each other, creating a competitive feedback loop. This allows the generator to progressively improve its ability to generate data that appears genuine, while the discriminator becomes more skilled at distinguishing between real and generated data. The GAN model facilitates the generator in learning the skills necessary to produce high-quality and realistic data, while also enhancing its generation capability. This interplay between the generator and discriminator enables the GAN to generate novel, authentic-like data that closely resembles the original picture.

Figure 1: GAN structure.
GAN is a deep learning model consisting of two parts: the generator, which is used to generate new data similar to the given data, and the discriminator, which is used to judge the authenticity of the data generated by the generator. The two parts interact with each other through training, allowing the generator to continuously learn the skills of generating real data, and improving the generator’s generation ability.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-1

GAN offers several key advantages, including:

High-quality generation: GAN’s generator produces samples of superior quality that closely resemble actual data. This enables GAN to generate highly realistic and even imaginative data, making it applicable in various practical scenarios.

Scalability: GAN can be trained using different types of data, such as images, text, audio, video, and 3D models. This makes it versatile and applicable across multiple fields, providing flexibility for various applications.

Unsupervised learning: GAN operates through unsupervised learning, meaning it doesn’t require labeled data for training. This makes it more universal since it can work with unlabeled datasets without the need for extensive data labeling.

Ability to learn data distribution: GAN can simulate and learn the underlying distribution of the data it processes. This capability is valuable for data reconstruction and image generation tasks. The generator in GAN not only generates new samples but can also predict new data distributions, which is crucial in the field of data science.

High effectiveness and training efficiency: GAN training is highly efficient as the generator and discriminator are trained simultaneously in a competitive manner. This allows GAN to converge rapidly, leading to efficient and effective training processes.

Article Algorithm

Region noise reduction

In this study, we incorporate the intrinsic visual characteristics of the human eye, taking into account the entropy value of the image, which accurately reflects the image signal (Pulgar et al., 2021; Zheng et al., 2021). However, in practical computer processing, calculating the entropy value can be computationally intensive. Therefore, we propose a simplified calculation process as follows.

Assume that the information level of the original image is L. The number of targets with the information i is ni. The total number of pixels of the image is N. The probability of occurrence of each target can be obtained as Pi. then we have Pi = ni/N. In the image segmentation algorithm, a threshold λ is used to classify the image information level into two classes. The target class Co and the background class Cb. The target part f(x, y) ≥λ and the background part f(x, y) < λ. Thus, the image is effectively segmented into subsets that do not overlap. Thus, the ratio of its target to background occurrences is: (1) $P_{b} = \sum_{i = 0}^{λ} P_{i} and P_{f} = \sum_{i = λ + 1}^{L - 1} P_{i}$ (2) $Target mean: μ_{b} (t) = \frac{\sum_{i = 0}^{λ} i P_{i}}{P_{b} (t)}$ (3) $Background mean : μ_{f} (t) = \frac{\sum_{i = λ + 1}^{L - 1} i P_{i}}{P_{f} (t)}$

The average value of the information in the whole image is: $μ = \sum_{i = 0}^{L - 1} i P_{i}$

Therefore, the inter-class variance of the image is obtained according to the inter-class variance formula: (4) $σ_{B}^{2} (t) = P_{b} (t) {[μ_{b} (t) - μ]}^{2} + P_{f} (t) {[μ_{f} (t) - μ]}^{2}$ (5) $Simplifying, we get: σ_{B}^{2} (t) = P_{b} (t) [1 - P_{b} (t)] {[μ_{b} (t) - μ_{f} (t)]}^{2}$

To make the article algorithm can better cope with different background images. In this article, the background of its image is analyzed for complexity. Suppose H_ij is the local neighbourhood entropy centred on (i, j). The expression of its function is shown as follows: (6) $Local neighbourhood entropy: H_{i j} = - \sum_{i = 1}^{m} \sum_{j = 1}^{n} P_{i j} lg P_{i j}$

Where: m ×n is the size of a local neighbourhood. P_ij is the probability of the target distribution at point (i, j).

Figures 2 and 3 are taken as samples. In order to better reflect the complexity of the image background. In this article, the neighbourhood entropy is converted into a background factor between (0, 1) by establishing an affiliation function, which is shown as follows: (7) $The affiliation function of the background factor : K_{i j} = \frac{H_{i j} - H_{m i n}}{H m i n_{m a x}}$

Figure 2: Segmentation area map.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-2

Figure 3: Segmentation target map.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-3

The background factor of the image background is finally obtained, and the target area is finally targeted.

Noise reduction processing

Background area processing

Let the image signal processed for noise reduction be I = f + n, where f is the image signal. n is the noise signal. In this article, the noise reduction function of the image is obtained by improving the soft threshold denoising algorithm. The expression of the function is defined as follows: (8) ${\hat{C}}_{ij} = \{\begin{matrix} μ \cdot sgn (C_{i, j}) (| C_{i, j} | - λ) \\ 0 \end{matrix} \begin{matrix} , | C_{ij} | \geq λ \\ | C_{ij} | < λ \end{matrix}$

Where $sgn (n) = \{\begin{matrix} 1 & , n > 0 \\ - 1 & , n \leq 0 \end{matrix} . μ$ is the weight constant. λ Is the threshold value. Although the selection of the threshold value directly affects the noise reduction process of the image. Considering the region’s more practical information, it is easy to cause the “overkill” phenomenon. In this article, the weight μ is defined. The value of μ is 0.6 and is substituted into the image noise reduction formula. Then the formula is as follows: (9) ${\hat{C}}_{ij} = \{\begin{matrix} 0.6 sgn (C_{i, j}) (| C_{i, j} | - λ) \\ 0 \end{matrix} \begin{matrix} , | C_{ij} | \geq λ \\ | C_{ij} | < λ \end{matrix}$

Target area processing

Confidence processing.

This study proposes a method to introduce a confidence factor in GAN. The confidence data is obtained by combining the data generated by the generator with the data in the discriminator.

As shown in Figs. 4 and 5, the ground truth of S is calculated from the 2D points x_j,k and x_i,k annotated in the image. where x_j₁,k and x_j₂,k denote the two key points j₁ and j₂ corresponding to the real pixel points of a person k in the figure.

Figure 4: Target key point extraction.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-4

Figure 5: 2D target pose extraction.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-5

If a pixel point C_real is located on this target node. $L_{c, k}^{*} (C_{real})$ the valued table is a unit vector from a key point j₁ to key point j₂. The corresponding vector is a zero vector for pixel points not on the torso. Then the values of $L_{c, k}^{*} (C_{real})$ Are as follows: (10) $L_{c, k}^{*} (C_{r e a l}) = \{\begin{matrix} v i f C_{r e a l} o n c, k \\ 0 o t h e r w i s e \end{matrix}$

where $v = \frac{(x_{j_{2}, k} - x_{j_{1}, k})}{| x_{j_{2}, k} - x_{j_{1}, k} |_{2}}$ Denotes the unit direction vector corresponding to this torso.

The target pixel point satisfies the following function: (11) $0 \leq v (C_{r e a l} - x_{j_{1}, k}) \leq l_{c, k} a n d | v_{⊥} (C_{r e a l} - x_{j_{1}, k}) | \leq σ_{l}$

where the inner table σ_l shows the distance between pixel points. The torso length is l_c,k = |x_j₂,k − x_j₁,k|₂ and $v_{⊥}$ Denotes the vector perpendicular to v.

Generator and discriminator

We optimize the loss L_MSE of the generator and the adversarial loss L_adv of the discriminator. (12) $L_{M S E} = \sum_{i = 1}^{N} \sum_{j = 1}^{M} {(C_{i j} - {\hat{C}}_{i j})}^{2}$ (13) $L_{a d v} = \sum_{i = 1}^{N} {({\hat{C}}_{j} - D ({\hat{C}}_{j}, X))}^{2}$ (14) $L_{G} = L_{M S E} + λ L_{a d v}$

The primary objective of the discriminator is to discern whether a given heat map is real or fake, generated by the generator (Vo et al., 2021; Dong et al., 2021; Li et al., 2022b; Lu & Su, 2021; Luo et al., 2021). To accomplish this, we optimize the loss function of the discriminator to improve its ability to differentiate between real and fake heat maps. The optimization process aims to enhance the discriminator’s discriminative capabilities and make it more effective in accurately identifying the authenticity of the input heat maps. By fine-tuning the loss function, we aim to facilitate the discriminator in becoming increasingly proficient at recognizing real and generated heat maps. (15) $L_{r e a l} = \sum_{j = 1}^{N} {(C_{j} - D (C_{j}, X))}^{2}$ (16) $L_{noiseless} = \sum_{j = 1}^{N} {({\hat{C}}_{j} - D ({\hat{C}}_{j}, X))}^{2}$ (17) $L_{G} = L_{r e a l} + k_{t} L_{noiseless}$ (18) $k_{t + 1} = k_{t} + λ_{k} (L_{r e a l} - L_{noiseless}) .$

The kt in the above equation is used to constrain the capability of the resolver.

As shown in Fig. 6, it can be seen that all components in this network, including the confidence map, are learned from the image only. In processing this deep neural network, the feature factors are derived from the original image. These feature factors are mapped to the image with depth information in a recombination manner to construct a new image.

Figure 6: 3D target pose extraction.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-6

If the expected target is irrelevant for noisy samples, the pose in this image is not credible for the body construction. Therefore, if the information is closer to the target information, the corresponding vector is 1, and vice versa, the corresponding vector is 0. The formula is as follows: (19) $C_{r e a l} = \{\begin{matrix} 1 i f ||K_{i j}|| < τ \\ 0 i f ||K_{i j}|| \geq τ \end{matrix}$

K_ij represents the value of the affiliation function for the background factor.

The system framework is shown in Fig. 7.

Figure 7: Framework diagram of the algorithm.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-7

Simulation

In this study, the experimental platform utilized Ubuntu 18.04 as the operating system. Anaconda was utilized as the software platform, while PyTorch served as the deep learning framework. In order to assess the practicality of the algorithm, real pedestrian videos captured by the researchers were used, with noise intentionally added to the images. The model was trained and tested on a high-performance server equipped with a 2080 × 4 GPU. A GAN-based image generator, capable of producing noise-free images, was trained and evaluated using the noisy dataset. The noise removal rate was measured, and comparable experiments were conducted with other algorithms under the same experimental conditions. Data comparison was performed to evaluate the performance of the proposed approach against the comparison algorithms.

In order to validate the denoising effectiveness of the algorithms proposed in this article, several traditional and literature algorithms are selected for comparison (Chen & Han, 2005; Hu et al., 2021; Deng et al., 2020; Frazier-Logue & José Hanson, 2020). The simulation process involves collecting data for each algorithm and comparing them. The equations used for comparison are based on observing the signal-to-noise ratio (PSNR) values of each algorithm. (20) $Signal-to-noise ratio: P S N R = 10 log (\frac{\sum_{i = 1}^{N} x^{2} i}{\sum_{i = 1}^{N} {(x [i] - \hat{x} [i])}^{2}}) .$

As depicted in Fig. 8, it is noticeable that the denoising algorithm proposed in this study effectively eliminates noise while preserving the edge and texture detail features present in the original image. The resulting image exhibits high visual quality, indicating the algorithm’s ability to retain crucial visual information despite noise removal.

Figure 8: 3D model and denoising effect of the algorithm.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-8

The objective of this article is to validate the effectiveness of the proposed method, particularly in terms of successfully denoising images while preserving target information. The superiority of the algorithms is assessed by comparing their PSNR values and time consumption. Through the experiments, the table shows the comparative PSNR values of the results obtained by different denoising algorithms.

Table 1 demonstrates that the proposed algorithm outperforms other denoising algorithms in terms of PSNR values across various noise factors. It consistently exhibits better performance compared to alternative denoising techniques. The algorithm showcases a significant difference of up to 3.6 dB compared to the literature algorithm 3, certifying its reliability. The evaluation index performance is particularly commendable at a noise variance of σ = 50. Furthermore, assessing the image metrics at different variances reveals a decrease in PSNR values as the noise variance increases. Despite this decrease in performance, the proposed algorithm still maintains a superior denoising effect compared to other algorithms. This comprehensive investigation effectively substantiates the image fidelity achieved by the proposed algorithm.

Table 1:

Data table of PSNR values for each denoising algorithm.

Table 1 shows this algorithm outperforms other denoising algorithms regarding PSNR values under different noise factors. It offers better performance than other denoising techniques

σ	PSNR dB
	Noisy images	Traditional algorithms	Literature algorithm 1	Literature algorithm 2	Literature algorithm 3	Article algorithm
10	26.0	29.6	31.8	35.8	38.2	40.1
20	24.1	27.1	27.9	34.8	38.2	38.6
30	22.1	25.9	27.5	34.1	34.7	38.1
40	18.3	25.8	26.5	32.1	33.3	34.0
50	15.8	25.2	23.8	31.9	30.8	32.8

DOI: 10.7717/peerjcs.1604/table-1

In the process of denoising, it is inevitable that some image information may be lost and residual noise information may still remain. Consequently, the grayscale values of certain pixels in the image may change accordingly. This presents an opportunity to evaluate and compare the denoising effects of various methods based on the grayscale histogram of the image.

Figure 9A displays the original image along with its grayscale histogram, while Fig. 9B depicts the grayscale histogram of the image after adding noise. By comparing the histograms of different denoising algorithms, it can be observed that the histograms of the literature algorithm 3 and the proposed algorithm in this article closely resemble the histogram of the original image. Furthermore, among these algorithms, the proposed algorithm in this article preserves the image details to the highest extent. This indicates that the proposed algorithm achieves the most effective denoising results, as it successfully retains the original image’s characteristics and minimizes the impact of noise on the image. Thus, based on the comparison of histograms, it can be concluded that the algorithm proposed in this article offers the most favorable denoising effect.

Figure 9: Histogram of each algorithm.
(A) Original image; (B) noisy image; (C) traditional algorithm; (D) literature algorithm 1; (E) literature algorithm 2; (F) literature algorithm 3; (G) algorithm of this article.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-9

In order to verify the effectiveness of the proposed algorithm in denoising and preserving target information in images, a comparative experiment is conducted. The experiment focuses on advanced denoising of targeted image information. The results of this comparison experiment are presented in Fig. 10, showcasing the denoising effects achieved by the different algorithms.

Figure 10: Comparison of denoising details of each algorithm.
(A) Original image; (B) noisy image; (C) traditional algorithm; (D) literature algorithm 1; (E) literature algorithm 2; (F) literature algorithm 3; (G) algorithm of this article.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-10

Through a comparison of the denoising effects achieved by different algorithms, it is evident that the algorithm proposed in this article is capable of preserving the target signal of the image while effectively denoising the noisy image. This results in a visually superior outcome with a more precise overall impact compared to the original image. Upon closer inspection of the key details in the denoising effect of each algorithm, it is apparent that the algorithm proposed in this article holds a significant advantage in preserving target details. The denoising results align with the intended experimental objectives, validating the effectiveness of the proposed algorithm. While the literature algorithm 3 can achieve a similar effect to the original image, it often leads to an excessively subdued background. In contrast, the algorithm proposed in this article presents better denoising results for both the target and the background, with the clearest details. This effectively showcases the strengths of the algorithm proposed in this article in the field of denoising.

This article also highlights the efficiency of the proposed method in eliminating image noise. The favorable denoising performance can be primarily attributed to the utilization of GAN within the optimization algorithm. The time efficiency of the proposed method is demonstrated in Fig. 11. It is evident that the algorithm proposed in this article maintains a good balance between denoising performance and time efficiency. Although there is a slight increase in denoising efficiency compared to the literature algorithm, the algorithm in this article operates with a time efficiency difference of 0.153s. This signifies that the proposed algorithm effectively achieves better denoising results while still maintaining a reasonable level of computational efficiency.

Figure 11: Denoising efficiency graph of each algorithm.

Download full-size image

DOI: 10.7717/peerjcs.1604/fig-11

Conclusion

We proposed the design of a noise-free image generator based on GAN, which is a neural network trained using adversarial learning to model data distribution. The primary objective of this project was to develop a noise-free image generator that can produce high-fidelity images despite the presence of noise. To achieve this, we combined the inherent visual properties of the human eye with the entropy value of the input image to divide it into different regions. The background region was subjected to threshold denoising, while the target region undergoes specific processing. Additionally, we constructed a 3D model for the target using an adversarial generation network, starting from the 2D target image with noise. By combining the target information with a confidence factor, we aimed to better preserve the target’s detailed information from the original image. The experimental results demonstrate the algorithm’s ability to efficiently denoise noisy images while retaining the detail signal of the target image, which aligns with the expected outcomes. However, it is important to note that the adversarial generative network often involves complex calculations and requires tedious parameter adjustments, potentially impacting the training timeliness. In future works, we aim to address these limitations by exploring high-speed and effective algorithms that can offer improved efficiency without compromising on the denoising accuracy achieved by the proposed method.

Supplemental Information

Code

DOI: 10.7717/peerj-cs.1604/supp-1

Download

Data

DOI: 10.7717/peerj-cs.1604/supp-2

Download

[1] Chen Y, Han C. 2005. Adaptive wavelet threshold for image denoising. Electronics Letters 41(10):586-587

[2] Dong M, Li H, Yin S, Wu Y, See KY. 2021. A postprocessing-technique-based switching loss estimation method for GaN devices. IEEE Transactions on Power Electronics 36(7):8253-8266

[3] Frazier-Logue N, José Hanson S. 2020. The stochastic delta rule: faster and more accurate deep learning through adaptive weight noise. Neural Computation 32(5):1-15

[4] Gao Y, Yuille AL. 2017. Exploiting symmetry and/or Manhattan properties for 3D object structure estimation from single and multiple images. IEEE Computer Society 2017:6718-6727

[5] Giannatou E, Papavieros G, Constantoudis V, Papageorgiou H, Gogolides E. 2019. Deep learning denoising of SEM images towards noise-reduced LER measurements. Microelectronic Engineering 216:111051

[6] Hales PW, Pfeuffer J, Clark CA. 2020. Combined denoising and suppression of transient artifacts in arterial spin labeling MRI using deep learning. Journal of Magnetic Resonance Imaging 5(52):1413-1426

[7] He X, Wandt B, Rhodin H. 2022. LatentKeypointGAN: controlling images via latent keypoints extended abstract. Computer Vision and Pattern Recognition 2022:1-5

[8] Hu M, Zhang S, Dong W, Xu F, Liu H. 2021. Adaptive denoising algorithm using peak statistics-based thresholding and novel adaptive complementary ensemble empirical mode decomposition. Information Sciences 563:269-289

[9] Iqbal T, Ali H. 2018. Generative adversarial network for medical images (MI-GAN) Journal of Medical Systems 42(11)

[10] Kazuaki K, Ryo I, Shun S, Shibata N, Ikuhara Y. 2022. Atomic-resolution STEM image denoising by total variation regularization. Microscopy 71(5):302-310

[11] Kumar A, Tamboli D, Pande S, Banerjee B. 2022. RSINet: inpainting remotely sensed images using triple GAN framework. IEEE International Geoscience and Remote Sensing Symposium 2022:143-146

[12] Lai YK, Lai YF, Chen YC. 2012. An effective hybrid depth-generation algorithm for 2D-to-3D conversion in 3D displays. Journal of Display Technology 9:154-161

[13] Li Y, Gan Z, Zhou X, Chen Z. 2022b. Accurate classification of Listeria species by MALDI-TOF mass spectrometry incorporating denoising autoencoder and machine learning. Journal of Microbiological Methods 192:106378

[14] Li H, Zhang H, Wan X, Yang Z, Li C, Li J, Han R, Zhu P, Zhang F. 2022a. Noise-Transfer2Clean: denoising cryo-EM images based on noise modeling and transfer. Bioinformatics 38(7):2022-2029

[15] Lu HP, Su CT. 2021. CNNs combined with a conditional GAN for mura defect classification in TFT-LCDs. IEEE Transactions on Semiconductor Manufacturing 34(1):25-33

[16] Luo SH, Wang X, Chen GY, Xie Y, Zhang W-H, Zhou Z-F, Zhang Z-M, Ren B, Liu G-K, Tian Z-Q. 2021. Developing a peak extraction and retention (PEER) algorithm for improving the temporal resolution of Raman spectroscopy. Analytical Chemistry 93(24):8408-8413

[17] Ozkanoglu MA, Ozer S. 2022. InfraGAN: a GAN architecture to transfer visible images to infrared domain. Pattern Recognition Letters 155:69-76

[18] Pimpalkhute VA, Pagea R, Kotharib A, Bhurchandi KM, Kamble VM. 2021. Digital image noise estimation using DWT coefficients. IEEE Transactions on Image Processing PP(99):1

[19] Pulgar FJ, Charte F, Rivera AJ, Jesus MJD. 2021. ClEnDAE: a classifier based on ensembles with built-in dimensionality reduction through denoising autoencoders. Information Sciences 565(3)

[20] Singh G, Mittal A, Aggarwal N. 2020. ResDNN: deep residual learning for natural image denoising. IET Image Processing 14(11):2425-2434

[21] Sisniega A, Stayman JW, Capostagno S, Weiss CR, Ehtiati T, Siewerdsen JH. 2021. Accelerated 3D image reconstruction with a morphological pyramid and noise-power convergence criterion. Physics in Medicine and Biology 66(5)

[22] Sun J. 2021. A 3D image encryption algorithm based on chaos and random cross diffusion. Modern Physics Letters B

[23] Svahn TM, Gordon R, Ast JC, Riffel J, Hartbauer M. 2021. Comparison of photon-counting and flat-panel digital mammmography for the purpose of 3D imaging using a novel image processing method. Radiation Protection Dosimetry 195(3-4):454-461

[24] Tibi R, Hammond P, Brogan R, Young CJ, Koper K. 2021. Deep learning denoising applied to regional distance seismic data in Utah. Bulletin of the Seismological Society of America 111(2):775-790

[25] Vo DM, Nguyen DM, Le TP, Lee S-W. 2021. HI-GAN: a hierarchical generative adversarial network for blind denoising of real photographs. Information Sciences 570:225-240

[26] Wu J, Chen Q, Gui Z, Bai M. 2021. Fast dictionary learning for 3D simultaneous seismic data reconstruction and denoising. Journal of Applied Geophysics 194

[27] Yan Z, Xu X, Wang Y, Li T, Ma B, Yang L, Lu Y, Li Q. 2021. Application of ultrasonic Doppler technology based on wavelet threshold denoising algorithm in fetal heart rate and central nervous system malformation detection. World Neurosurgery 149:380-387

[28] Ye H, Li H, Chen C. 2021. Adaptive deep cascade broad learning system and its application in image denoising. IEEE Transactions on Cybernetics 51(9):4450-4463

[29] Yu T, Meng J, Yang M, Yuan J. 2021. 3D object representation learning: a set-to-set matching perspective. IEEE Transactions on Image Processing 30:2168-2179

[30] Zhang S, Cui S, Ding Z. 2021. Hypergraph spectral analysis and processing in 3D point cloud. IEEE Transactions on Image Processing 30:1193-1206