This study aims to develop a deep learning approach to detect C. neoformans in patient samples as an alternative diagnostic method for a timely and reliable diagnosis.
3. Result and DiscussionThis study uses a deep learning framework to detect C. neoformans using microscopic images of India-ink-stained smears of CSF. Images that contain C. neoformans are designated positive, while others are designated negative throughout this section. Before model training, the dataset was split into a 70% training set and a 30% test set. Of the 70% training set, a further 20% was used as a validation dataset. Model training, validation, testing, and evaluation were conducted using frameworks and libraries, including Jupyter notebook, TensorFlow, Keras, Pandas, NumPy, Matplotlib, and Seaborn. The implementation tools and framework used in this study were developed on a personal computer (PC) with Windows 10 Pro, 11th Gen Intel (R) Core (TM) i7-11700KF @ 3.60 GHz (Gigahertz) 3.60 GHz processor, 64.0 GB (Gigabyte) installed RAM (Random Access Memory), 64-bit operating system, and NVIDIA GeForce RTX (Ray Tracing Texel eXtreme) 3070 GPU (graphic processing unit) card to meet the training of deep neural network workload requirements. Python was the programming language of choice throughout the study.
The study proves that the deep learning framework can help detect C. neoformans in the microscopic image of India-ink-stained smears of CSF. The state-of-the-art VGG16 model produced an accuracy and loss of 86.88% and 0.36203, as shown in Table 3. The accuracy and loss values show how the VGG16 model performed in detecting and classifying the images into positive and negative. The accuracy indicates how effective the model is at generalizing unseen data, while the loss indicates fewer errors were made. Loss is the penalty for a wrong decision. It depicts the distance between the true value and the predicted value. The greater the loss, the more enormous the error made by the model on the data. The values of loss range from 0–1, where 0 indicates that the model’s prediction is perfect, while a loss of 1 means the model is making a terrible prediction.When training a machine learning model, one of the main things to avoid is overfitting. This is when the model fits the training data so well that it cannot generalize and make accurate predictions for data it has not seen before (test data). Metrics on the training data indicate how well the model is progressing in terms of its training, but it is the metrics on the validation data that provide the measure of the quality of a model—how well it can make new predictions based on data it has not seen before. Figure 7 shows the accuracy obtained during training and validation. The training and validation accuracy in a typical learning curve is expected to increase with each epoch. The VGG16 model accuracy increased with a corresponding validation accuracy. This generates a good fit void of overfitting and underfitting. The training and validation loss are the terms used to measure how a deep learning model fits the training and validation data. This indicates the performance of the model on the data. As indicated in Figure 8, the training and validation loss indicates how the VGG16 model fits the training and validation data and identifies which aspect needs tuning. The VGG16 generated a relatively stable good fit as the training and validation loss decreased and gradually stabilized. A high loss value usually means the model produces erroneous output, while a low loss value indicates fewer errors in the model. In addition, the loss is generally calculated using a cost function, which measures the error in different ways. Because of the nature of the study (binary classification), we adopted binary cross-entropy.Aside from accuracy and loss, the performance of a deep learning model can be measured using other metrics. These metrics provide a much more robust evaluation of the model. Accuracy alone is not enough to measure a model’s performance [10]. Precision is a machine learning metric that indicates the quality of a positive prediction made by the model. It provides insight into the number of true positives predicted by the model. Table 4 shows the mean performance metrics of the VGG16 model. With a precision of 90.00% and 85.00% for positive and negative images, the VGG16 shows its capability to effectively identify and classify smear images containing C. neoformans and those without it. The sensitivity of 83.00% and 91.00% for both positive and negative images indicate the model’s ability to correctly predict the proportion of true positives that are correctly predicted.Furthermore, the F1 score of 86.00% and 88.00% for positive and negative images indicate the harmonic mean of precision and recall. It combines precision and recalls into a single number using the following formula. A confusion matrix prints the correct and incorrect values in the number count. It provides an understanding of data visualization and gives insight not only into the errors made by a classifier but, more importantly, the types of errors being made. Figure 9 shows the confusion matrix of the VGG16 model. The VGG16 model correctly detects and classifies 245 images, 114 of which were positives, while 131 were negatives. This indicates good precision and the applicability of deep learning frameworks for the detection of C. neoformans in smear images. However, 37 images were misclassified, of which 24 were falsely classified as positives and 13 as false negatives. The performance evaluation metrics generated substantive outcomes that can aid the rapid detection of C. neoformans and aid in managing immunocompromised patients.We compared the performance of our model with two state-of-the-art pre-trained models—ResNet50 and InceptionV3. ResNet-50 is a convolutional neural network that is 50 layers deep. It is a smaller version of ResNet 152 and uses a deeper network to avoid poor accuracy. Each convolution block has three convolution layers, and each identity block has three convolution layers. The ResNet50 has over 23 million trainable parameters [34]. InceptionV3, also called GoogleNet, is CNN architecture from the Inception family that makes several improvements, including label smoothing, factorized 7 × 7 convolutions, and an auxiliary classifier to propagate label information to lower the network. The InceptionV3 is a superior version of the InceptionV1. It has 42 layers and a lower error rate than its predecessors [35].Compared with the two state-of-the-art pre-trained models, the VGG16 model significantly outperformed them with accuracy and loss of 86.88% and 0.36203, as shown in Table 5. Additionally, the model correctly classifies 245 images as positive and negative. In contrast, the ResNet50 and InceptionV3 correctly classify 197 and 235 images. Furthermore, ResNet50 and InceptionV3 misclassify 85 and 47, respectively. In retrospect, the VGG16 only misclassifies 37 images, as shown in Figure 10.Conventional diagnostic methods for the identification of Cryptococcus species have been used widely in the past. However, some disadvantages of these techniques caused delayed treatment and increased mortality. Several studies reported the disadvantages of cryptococcal cultures; living cryptococcal cells in CSF samples are required [8], samples taken from patients under systemic antifungal treatment require a longer incubation period [3], and positive culture results should be confirmed by patients’ clinical findings [3]. Although serologic tests for the detection of cryptococcal antigens are meaningful in laboratories with inadequate medical equipment, cryptococcal antigen detection of blood serum sensitivity and specificity are reported as 83–100% and 72–100%, respectively. In CSF samples, the sensitivity of serologic tests was within 80–100%, while specificity was found to be within 82–100% [8]. The detection of C. neoformans from respiratory system samples of diagnosed pulmonary cryptococcosis cases by multiplex RT PCR showed 90.7% sensitivity and 100% specificity [3]. Huston and Mody reported that the low fungal load and prozone effect could lead to false negative results in latex agglutination tests, where rheumatoid diseases; the presence of some other microorganisms, such as Trichosporon beigelii; the effect of some chemicals, such as disinfectants; and an extended waiting period of serum samples can lead to false positive results [36].In recent years, opportunistic fungal infection incidence has increased dramatically due to the frequent use of broad-spectrum antibiotics and immunosuppressive medications [37]. Cryptococcus spp. is an opportunistic fungus, susceptible to polyenes, flucytosine, and azoles [38]. Polyenes decrease the ergosterol content of the plasma membrane, while azoles inhibit ergosterol biosynthesis, and flucytosine blocks DNA synthesis [39]. In some conditions, lifelong therapy is required [35].Usually, combined Amphotericin B, flucytosine, and fluconazole therapy are applied against CM, which is effective in immunocompetent populations [37]. However, the excessive usage of antifungals in agriculture and medicine induced the emergence of antifungal-resistant strains of Cryptococcus spp., which is one of the major difficulties in CM treatment [38].Besides the resistance challenges, these drugs are expensive, and various side effects, such as toxicity, are reported [37]. Therefore, cryptococcosis is still a public health concern, and new antifungal drug developments or other therapeutic strategies are required [39].The International Treatment Preparedness Coalition (ITPC) published a global strategic plan, ‘Ending Cryptococcal Meningitis Deaths by 2030’, which aims to reduce CM-related deaths by 90% by 2030 from the 2020 baseline [40]. This goal can only be achieved if diagnosis, treatment, and preventive screening programs are implemented immediately [7]. Machine learning applications in microbiology are promising for accurate and timely diagnosis, and the state-of-the-art VGG16 model, which is applied for the first time in this study, showed similar accuracy (86.88%) to some of the diagnostic methods that have been in practice for many years.Species–specific fungal diagnosis needs further diagnostic tests after the detection of yeast-like fungi under the microscope. Usually, cultivation, biochemical tests, molecular tests, and sequencing are employed for species–specific identification. However, these techniques prolong the time period required for the results and are expensive. On the other hand, our CNN method, which can detect C. neoformans based on basic microbiological staining (India ink) of the patients’ samples, enables the diagnosis only in a few minutes [41]. 4. ConclusionsCryptococcosis was considered an uncommon disease before the frequent use of immunosuppressive therapy and the emergence of the HIV/AIDS pandemic. The dramatic increase in the incidence of Cryptococcus spp. also increased the interest of researchers to understand the morphology, pathogenesis, diagnosis, and treatment strategies of this fungi [10].Research in machine learning is evolving rapidly, and applications in microbiology evolved this field to a new era [10]. This study is a pioneer in the literature, since it is the only study that is designed to directly detect C. neoformans in India-ink-stained smears of CSF samples collected from patients. The preliminary results of this study demonstrate that deep learning frameworks can provide an effective and accurate choice for C. neoformans detection, thereby leading to early diagnosis and subsequent treatment. The study’s outcome also demonstrates that with minimal training and a small test dataset, an accuracy of 86.88% was achieved by the VGG16 model. At the same time, other metrics, including precision, sensitivity, and F1 score evaluated, show the reliability of the result obtained.Deep learning methods, especially CNN, have shown human-level performance in the case of large amounts of training data; however, since the microscopic examination is not the only diagnostic method for the diagnosis of C. neoformans, the microscopic image datasets are limited. The lack of fungal image libraries also makes the data collection process difficult. Therefore, the photographic documentation of the C. neoformans images can be useful to obtain high quality images and many of them, and we can create a microscopic image dataset.
Further studies should include more and higher quality images to eliminate the limitations of the adopted deep learning model.
Comments (0)