datahacker.rs@gmail.com

# Category: Deep Learning

### #032 CNN Triplet Loss

In the last post, we talked about Siamese Network, but we didn’t talk how to actually define an objective function to make our neural network learn. So, in order to do that, here we will define Triplet Loss. Triplet Loss One way to learn the parameters of the neural network, which gives us a good encoding for our pictures of faces, is to define and apply gradient descent on the Triplet loss function. Let’s see…

### # 031 CNN Siamese Network

Siamese Network The job of the function $$d$$, which we presented in the previous post, is to use two faces and to tell us how similar or how different they are. A good way to accomplish this is to use a Siamese network. We get used to see pictures of $$convnets$$, like these two networks in the picture below. We have an input image, denoted with $$x^{(1)}$$, and through a sequence of $$Convolutional$$,…

### #030 CNN One-Shot Learning

One – Shot Learning Solving the one-shot problem represents a challenge of face recognition task. This means that for most face recognition applications we need to recognize a person having only a single image or given just one example of that person’s face. Typically, deep learning algorithms don’t work well if there is only one training example. However, we will show how this problem can be tackled.  Let’s say that we have a database of…
YOLO ALGORITHM In this post, we will finish with the theory behind object detection. We will combine last few posts together to complete the $$Yolo$$ object detection algorithm. As the first step, let’s see how we can construct our training set. Let’s suppose that we are trying to train an algorithm to detect three objects: pedestrians, cars and motorcycles. In addition, we will need to explicitly define the background class, so we just have…
Anchor Boxes As we can see from our previous posts, object detection is quite challenging. This is the final challenge that we are going to explain. Then, we will develop a holistic YOLO algorithm. One scenario that we may encounter in practice is that several objects of interest are present in the same grid cell. This is shown in the figure below. In this case, we can use the idea of $$Anchor\enspace boxes$$ to… 