MITB Banner

How To Create Your first Artificial Neural Network In Python

Share

All machine Learning beginners and enthusiasts need some hands-on experience with Python, especially with creating neural networks. This tutorial aims to equip anyone with zero experience in coding to understand and create an Artificial Neural network in Python, provided you have the basic understanding of how an ANN works.

Prerequisites

  • Basic understanding of Artificial Neural Network
  • Basic understanding of python language

Before dipping your hands in the code jar be aware that we will not be using any specific dataset with the aim to generalize the concept. The codes can be used as templates for creating simple neural networks that can get you started with Machine Learning.

Neural Network in Python

We will use the Keras API with Tensorflow or Theano backends for creating our neural network.

Installing libraries

Theano

pip install --upgrade --no-deps git+git://github.com/Theano/Theano.git

Tensorflow

pip3 install --user --upgrade “tensorflow_url for specific to the environment”

Find the tensorflow_url here

Notes:

Requires a 64 bit Architecture

Keras

pip install --upgrade Keras

By default Keras uses Tensorflow backend. If you feel the need to use Theano backend instead, locate the file /home/user/.keras/keras.json (or %USERPROFILE%/.keras/keras.json in windows) and replace the line "backend": "tensorflow" with "backend": "theano"

If the file is not present, create the file in the same location and add the following lines :

{
"image_data_format": "channels_last",
"epsilon": 1e-07,
"floatx": "float32",
"backend": "theano"
}

Let’s get coding :

Import the  libraries

import keras
from keras.models import Sequential
from keras.layers import Dense


Initialising the Artificial Neural Network

model = Sequential()

The Sequential model is a linear stack of layers.

Creating the Input-layer and the first hidden layer

model.add(Dense(input_dim = 2, units = 10, activation='relu', kernel_initializer='uniform'))

This line adds the input layer and one hidden layer to our neural network. Lets break down the arguments one by one:

Dense(): lets us create a densely connected neural network

input_dim : shape or number of nodes in the input layer

units : the number of neurons or nodes in the current layer (hidden layer)

activation : the activation function applied to each node.”relu” stands for Rectified Linear Unit

kernel_initializer: initial random weights of the layer

The number of input layer depends on the problem you wish to solve using the network. For example, if the problem is to predict the salaries by considering a person’s experience and age, ‘Salary’ being the dependent and ‘Experience’ and ‘Age’ being the independent variables, the model will have 2 input nodes and a single output node. Giving an optimal number of nodes in the hidden layers requires a deeper understanding. However, you can experiment with different numbers and see its effects on the result.

Creating a second hidden layer

model.add(Dense(units = 20, activation='relu', kernel_initializer='uniform'))

The line creates and adds another hidden layer to the model with 20 nodes and ‘rectifier’ activation function and uniform distribution of weights. More layers can be added depending on the problem and its complexity.


Creating the output  layer

model.add(Dense(units = 1, activation='sigmoid', kernel_initializer='uniform'))

Units here is the number of nodes in the output layer. Here we have a single output layer.
Sigmoid or softmax are the commonly used activation functions for an output layer.

Compiling the ANN classifier

model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

The ANN needs to be compiled with an optimizer function and a loss function before being trained.

  • Optimizer: an optimizer function for the network, There are several types of optimizers and the choice depends on the nature of the problem
  • Loss: used for calculating the losses and errors.There are several types and the choice depends on the nature of the problem.
  • Metrics: the metric used to measure the accuracy of the model.Types of metrics.

Fitting the model with the training set

model.fit(X_train,Y_train,batch_size=500, epochs=10)

X_train: The training data consisting of only the independent factors

Y_train: The training data consisting of only the dependent factors

Batch_size: the weight is updated after training each batch of samples

epochs: one epoch stands for one complete training of the neural network with all samples.

You will see a similar output as follows:

The values of loss and accuracy displayed along the right side denote the loss and accuracy attained during the training of the model at each epoch

Evaluating the performance on the test set

model.evaluate(X_test,Y_test)

Output:

Loss = 0.48 0r 48%

Accuracy = 0.7975 or 79%

It means that 79% of the predicted results match with the actual values in the test set.

The parameters can be tweaked to see its effects on the results and optimal values can be chosen.The Python code can be used to solve any problems such as regression or classification and just requires you to change some arguments.

Share
Picture of Amal Nair

Amal Nair

A Computer Science Engineer turned Data Scientist who is passionate about AI and all related technologies. Contact: amal.nair@analyticsindiamag.com
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.