The task of identifying an object using MATLAB is a simple example of deep learning using MATLAB. We will be using a deep neural network named AlexNet to identify the object in any image. Let us first see what a deep neural network is and how it works. Then let's see how it is implemented in our code using MATLAB.
What is Deep Learning?
Deep learning is a type of machine learning in which a model learns to perform classification tasks directly from images, text, or sound. Deep learning is usually implemented using a neural network architecture. The term "deep" refers to the number of layers in the network—the more layers, the deeper the network. Traditional neural networks contain only 2 or 3 layers, while deep networks can have more than 3.
Inside a Deep Neural Network
A deep neural network combines multiple non-linear processing layers, using simple elements operating in parallel. It is very much similar to our human biological system. It consists of an input layer, several hidden layers, and an output layer. The layers are interconnected through nodes, or neurons, with each hidden layer using the output of the previous layer as its input.
How does a neural network work?
Let's say we have a set of images where, each image contains one of three different categories of objects like a cat, dog, parrot, etc. We want the deep learning network to automatically recognize which object is in each image. We separate it into training data and testing data. From the input layer, we pass onto feature extraction layers that identify the unique feature present in each category. For example, a beak is a unique feature for the parrot. Each layer in the network takes in data from the previous layer, transforms it, and passes it on. After extracting the features, we move onto the next few layers which are classification layers. After performing the classification of training data, we then move onto the classification of testing data. That's how the images are classified using the previously learned data from the testing set.
If you're new to deep learning, a quick and easy way to get started is to use an existing network, such as AlexNet. AlexNet is a neural network that has already been trained on a million images. Hence, it can directly be used for testing. AlexNet is most commonly used for image classification. It can classify images into 1000 different categories, including keyboards, computer mice, pencils, and other office equipment, as well as various breeds of dogs, cats, horses, and other animals.
You can use AlexNet to classify objects in any image. In this example, we'll use it to identify the object in an image from a webcam installed on a desktop. In addition to MATLAB, we'll be using the following:
Let us now see the coding part and the results obtained. First, we start with clear statements for clearing the windows and to start new every time we run the code.
Next, we connect to the camera by naming it as a webcam to access the webcam.
Then we load the AlexNet which is already pre-trained and store it in nnet.
Then we take a snapshot using the webcam and store it in a picture using the snapshot function.
Next, we have that picture in figure 1, and the image() function displays it.
Then, we resize the picture to [227,227] since the AlexNet has been trained with images of the size [227,227] and our image must be compared with them. It will be easier to compare if the images are of the same dimensions.
Next, we use the classify command to find the category to which the image belongs and we store that in variable 'label'.
Finally, we use the label as the title for the image and that is displayed in uppercase characters. We also clear camera so that webcam is released at the end of execution.
Now, we can run the program. When we run the program, the webcam will be on automatically and keep the object to be captured near the webcam. Then we would get the image displayed along with its identified label. Here there are 2 categories where the program has identified the sunglasses and the coffee mug correctly.
This task can be extended to various other wide-ranging applications in today's life. For example:
- When an image of a pedestrian crosswalk or a speed breaker is identified by a neural network, a self-driving vehicle slows down.
- When an ATM can identify the image of an illegal note, it will automatically reject it.
- A smartphone app instantly identifies a language and gives a translation of the foreign language.
This image recognition is especially well-suited to identification applications such as face recognition, text translation, and advanced driver assistance systems, including, lane classification and traffic sign recognition.
Did you find some helpful content from our video or article and now looking for its code, model, or application? You can purchase the specific Title, if available, and instantly get the download link.
Thank you for reading this blog. Do share this blog if you found it helpful. If you have any queries, post them in the comments or contact us by emailing your questions to [email protected]. Follow us on LinkedIn Facebook, and Subscribe to our YouTube Channel. If you find any bug or error on this or any other page on our website, please inform us & we will correct it.
If you are looking for free help, you can post your comment below & wait for any community member to respond, which is not guaranteed. You can book Expert Help, a paid service, and get assistance in your requirement. If your timeline allows, we recommend you book the Research Assistance plan. If you want to get trained in MATLAB or Simulink, you may join one of our training modules.
If you are ready for the paid service, share your requirement with necessary attachments & inform us about any Service preference along with the timeline. Once evaluated, we will revert to you with more details and the next suggested step.
Education is our future. MATLAB is our feature. Happy MATLABing!