dragon tiger casino online

Neural network validation

This use case highlights how the Saimple tool offers healthcare professionals the ability to understand the origin of the clinical decision made by the artificial intelligence

Pneumonia prediction


Pneumonia is an infection affecting part of the lungs. This disease is the leading cause of death in children under 5 years of age according to the World Health Organization (WHO). In fact, approximately 1.4 million children die of pneumonia each year. To detect this disease, doctors perform a physical examination on patients with symptoms. They then perform a chest x-ray and make a diagnosis. However, in third-world countries, the tools needed to diagnose this disease are scarce and diagnoses are therefore often inaccurate. The objective of this case study is therefore to help doctors to have a faster and more accurate diagnosis. Today, data collection is standardized but diagnostic applications are rare in the medical field. The interest is to develop an automated system using artificial intelligence, where Saimple, a tool developed by Numalis, would allow health professionals to understand the origin of the model's clinical decision.

Description of the dataset

This case study uses a dataset from a Kaggle competition:

This dataset contains about 6000 images of chest radiographs of children under 5 years old. The images are labeled according to two classes "NORMAL" and "PNEUMONIA". It should be noted that in a radiograph, clarity corresponds to the black color and opacity to the white color. It is important to remember that pneumonia is detected on a chest x-ray when an abnormal accumulation of fluid in the lungs is visible. To identify it, look for areas of opacity, specifically in the lung parenchyma (functional tissue of the lung). For example, in the x-rays below, the one on the right shows that water has accumulated in one of the lungs, while the one on the left does not.

dragon tiger casino onlineLiên kết đăng nhập


Visualization of the dataset

dragon tiger casino onlineLiên kết đăng nhập

Proportion of classes

Before starting the analysis, it is important to visualize the proportion of images in each class and in each dataset.

dragon tiger casino onlineLiên kết đăng nhập

Data augmentation

In order to balance the dataset, we will use data augmentation, which is an effective technique to increase the number of images in the underrepresented class. To do this, several images from the underrepresented class are selected and will be transformed to create new images. The transformation can correspond to: - Rotate the image randomly, - Resize vertically or horizontally, - Crop the image, - Zooming, - Fill in pixels (i.e. fill in missing spaces in the resized image). However, in reality, the images must meet medical quality criteria: - Symmetry, - Penetrance, - Centering, - Deep inspiration and clearance of the scapulae. Thus, it is possible to determine that image rotation is not included in the data augmentation. The four X-rays below are an example of data augmentation. Indeed, from a single image, four new ones have been generated by cropping and filling the pixels of the missing areas.

dragon tiger casino onlineLiên kết đăng nhập


Explanation of the model

For this case study, the prediction model is a convolutional neural network. It is a particular type of neural network used to process image data. The output value of this model is the probability of belonging to a class ("NORMAL" or "PNEUMONIA").

dragon tiger casino onlineLiên kết đăng nhập

Using of Saimple

Saimple is a neural network analysis tool offering the possibility to automatically measure and extract robustness and explainability elements of models. In this use case, Saimple allows identifying the areas of the image that were important in the prediction of the model and thus make it more understandable for health professionals.


Before the Saimple tool can be used, it is necessary to resize the input images of the model. As can be seen below, in order to standardize the size of the radiographs, the radiograph has been resized and this transformation can be easily visualized.

dragon tiger casino onlineLiên kết đăng nhập

Relevance analysis

Saimple identifies the important pixels that allowed the model to classify the image. A pixel is considered important when a value called "relevance" is higher than the average. The more this value tends towards red or blue, the more the pixel is considered important. As a reminder, the relevance of an image classifier represents the influence of each pixel on the calculation of the model of the "NORMAL" class. In the example below, the red pixels of the relevance correspond to the pixels of the input image that have positive effects on the membership of the output class.