Deep Learning 101
top of page

Deep Learning 101

Aivia uses a wide range of AI techniques (e.g. random forest, support vector machines and deep learning) to solve image-related tasks. For more demanding applications, Aivia uses a specific type of deep learning, known as fully convolutional neural networks (CNN).

CNNs are particularly well-suited for capturing non-linear relationships between large volumes of paired image sets (e.g. raw image and manually annotated image in the case of image segmentation tasks) thus allowing for a level of accuracy that rivals human experts (1,2).


A neural network is composed of multiple artificial "neurons" organized in interconnected layers. Similarly to their biological counterparts, the neurons in artificial neural networks respond to specific stimuli, image patterns or features.



When stimulated, a neuron (both real and artificial) affects connected neurons in deeper layers until the output layer is reached where a prediction is made.




By comparing the prediction to the ground truth an error is calculated. The network is updated to minimize the error before a new round of learning (i.e. feeding a new input image to the CNN) takes place.


Initially, all neuron connections have equivalent weights and biases but during the process of learning the neuron paths that produce lower error values strengthen (i.e. connection weights are increased) while the ones with higher error values weaken.



The key processes responsible for learning in CNNs are gradient descent and back-propagation. Gradient descent helps determine how the weights and biases that govern the connections between different layers should be adjusted to minimize error.



Back-propagation implements the changes suggested by gradient descent, thus spreading the updates across the connected elements in the network.



The trained deep learning model includes the entire network with optimized weights and biases. It can be applied to "out of network" images (i.e. images that have not been used for training) similar to the ones used for training.



References

  1. Esteva et al., Nature, 2017

  2. Zeng, Wu and Ji,Bioinformatics, 2017



bottom of page