What is Calculus in Machine Learning | April 2021

I. Introduction

A machine learning algorithm (such as classification, clustering or regression) uses a training dataset to determine weight factors that can be applied to unseen data for predictive purposes. Behind every machine learning model is an optimization algorithm that relies heavily on calculus. In this article, we discuss one such optimization algorithm, namely, the Gradient Descent Approximation (GDA) and we’ll show how it can be used to build a simple regression estimator.

II. Optimization Using the Gradient Descent Algorithm

II.1 Derivatives and Gradients

In one-dimension, we can find the maximum and minimum of a function using derivatives. Let us consider a simple quadratic function f(x) as shown below.

Minimum of a simple function using gradient descent algorithm. Image by Benjamin O. Tayo

Suppose we want to find the minimum of the function f(x). Using the gradient descent method with some initial guess, X gets updated according to this equation:

where the constant eta is a small positive constant called the learning rate. Note the following:

when X_n > X_min, f’(X_n) > 0: this ensures that X_n+1 is less than X_n. Hence we are taking steps in the left direction to get to the minimum.
when X_n < X_min, f’(X_n) < 0: this ensures that X_n+1 is greater than X_n. Hence we are taking steps in the right direction to get to X_min.

The above observation shows that it doesn’t matter what the initial guess is, the gradient descent algorithm will always find the minimum. How many optimization steps it’s going to take to get to X_min depends on how good the initial guess is. Sometimes if the initial guess or the learning rate is not carefully chosen, the algorithm can completely miss the minimum. This is often referred to as an “overshoot”. Generally, one could ensure convergence by adding a convergence criterion such as:

where epsilon is a small positive number.

In higher dimensions, a function of several variables can be optimized (minimized) using the gradient descent algorithm as well. In this case, we use the gradient to update the vector X:

As in one-dimension, one could ensure convergence by adding a convergence criterion such as:

II.2 Case Study: Building a Simple Regression Estimator

In this subsection, we describe how a simple python estimator can be built to perform linear regression using the gradient descent method. Let’s assume we have a one-dimensional dataset containing a single feature (X) and an outcome (y), and let’s assume there are N observations in the dataset:

A linear model to fit the data is given as:

where w0 and w1 are the weights that the algorithm learns during training.

II.3 Gradient Descent Algorithm

If we assume that the error in the model is independent and normally distributed, then the likelihood function is given as:

To maximize the likelihood function, we minimize the sum of squared errors (SSE) with respect to w0 and w1:

The objective function or our SSE function is often minimized using the gradient descent approximation(GDA) algorithm. In the GDA method, the weights are updated according to the following procedure:

i.e., in the direction opposite to the gradient. Here, eta is a small positive constant referred to as the learning rate. This equation can be written in component form as:

II.4 Python Implementation

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
class GradientDescent(object):
    """Gradient descent optimizer.
    Parameters
    ------------
    eta : float
        Learning rate (between 0.0 and 1.0)
    n_iter : int
        Passes over the training dataset.
        
    Attributes
    -----------
    w_ : 1d-array
        Weights after fitting.
    errors_ : list
        Error in every epoch.
    """    def __init__(self, eta=0.01, n_iter=10):
        self.eta = eta
        self.n_iter = n_iter
        
    def fit(self, X, y):
        """Fit the data.
        
        Parameters
        ----------
        X : {array-like}, shape = [n_points]
        Independent variable or predictor.
        y : array-like, shape = [n_points]
        Outcome of prediction.
        Returns
        -------
        self : object
        """
        self.w_ = np.zeros(2)
        self.errors_ = []
        
        for i in range(self.n_iter):
            errors = 0
            for j in range(X.shape[0]):
                self.w_[1:] += self.eta*X[j]*(y[j] - self.w_[0] -                     self.w_[1]*X[j])
                self.w_[0] += self.eta*(y[j] - self.w_[0] - self.w_[1]*X[j])
                errors += 0.5*(y[j] - self.w_[0] - self.w_[1]*X[j])**2
            self.errors_.append(errors)
        return self    def predict(self, X):
        """Return predicted y values"""
        return self.w_[0] + self.w_[1]*X

II.5 Application of basic regression model

a) Create dataset

np.random.seed(1)
X=np.linspace(0,1,10)
y = 2*X + 1
y = y + np.random.normal(0,0.05,X.shape[0])

b) Fit and Predict

gda = GradientDescent(eta=0.1, n_iter=100)
gda.fit(X,y)
y_hat=gda.predict(X)

c) Plot Output

plt.figure()
plt.scatter(X,y, marker='x',c='r',alpha=0.5,label='data')
plt.plot(X,y_hat, marker='s',c='b',alpha=0.5,label='fit')
plt.xlabel('x')
plt.ylabel('y')
plt.legend()

d) Calculate R-square value

R_sq = 1-((y_hat - y)**2).sum()/((y-np.mean(y))**2).sum()
R_sq
0.991281901588877

III. Summary and Conclusion

In summary, we have shown how a simple linear regression estimator using the GDA algorithm can be built and implemented in Python. Behind every machine learning model is an optimization algorithm that relies heavily on calculus. If you would like to see how the GDA algorithm is used in a real machine learning classification algorithm, see the following Github repository.

For some Python Code you may follow this GitHub Repository

Comments

Advance Git Commands with example on terminal | Git commands

Explore repository There is a Git repository named food-scripts consisting of a couple of food-related Python scripts. Navigate to the repository using the following command: cd ~/food-scripts content_copy Now, list the files using the ls command. There are three files named favorite_foods.log , food_count.py , and food_question.py . Let's explore each file. Use the cat command to view each file. favorite_foods.log : This file consists of a list of food items. You can view it using the following command: cat favorite_foods.log content_copy Output: food_count.py : This script returns a list of each food and the number of times the food appeared in the favorite_foods.log file. Let's execute the script food_count.py : ./food_count.py content_copy Output: food_question.py : This prints a list of foods and prompts the user to enter one of those foods as their favorite. It then returns an answer of how many others in the list like that same food. Run the following comma

What is cloud computing in simple terms? | Definition & Examples | What is AWS ?

TABLE OF CONTENTS What Is Cloud Computing? Understanding Cloud Computing Types of Cloud Services Deployment Models Types of Cloud Computing Advantages of Cloud Computing Disadvantages of the Cloud The World of Business What Is Cloud Computing? Cloud computing is the delivery of different services through the Internet. These resources include tools and applications like data storage, servers, databases, networking, and software. Rather than keeping files on a proprietary hard drive or local storage device, cloud-based storage makes it possible to save them to a remote database. As long as an electronic device has access to the web, it has access to the data and the software programs to run it. Cloud computing is a popular option for people and businesses for a number of reasons including cost savings, increased productivity, speed and efficiency, performance, and security. Understanding Cloud Computing Cloud computing is named as such because the information being accessed is found rem

INDIA is no more Independent Nation ?

Top 15 Python Libraries For Data Science & Best Tutorials To Learn Them | April 2021

Python is the most widely used programming language today. When it comes to solving data science tasks and challenges, Python never ceases to surprise its users. Most data scientists are already leveraging the power of Python programming every day. Python is an easy-to-learn, easy-to-debug, widely used, object-oriented, open-source, high-performance language, and there are many more benefits to Python programming. Python has been built with extraordinary Python libraries for data science that are used by programmers every day in solving problems. Here today, We have curated a list of best 15 Python libraries that helps in Data Science and its periphery, when to use them, their advantages and best tutorials to learn them. For some Python Code you may follow this GitHub Repository 1. Pandas Pandas is an open-source Python package that provides high-performance, easy-to-use data structures and data analysis tools for the labeled data in Python programming language. Pandas stand for Pyth

Nine Things to Check While Choosing A Cloud Service Provider

As more and more IT systems are outsourced, zeroing in the best cloud providers is critical to long-term success. The market is already vast, with different brands offering large numbers of services. Apart from the big providers like Microsoft, Amazon, and Google, there are also smaller niche players who provide bespoke services. With too many choices to opt from, you must put down the selection and procurement process appropriate as per the needs. The Right Time to Select a Cloud Service Provider It is significant to understand the requirements of a business before choosing a cloud service provider. Clarifying the specific needs and minimum expectations in advance while assessing providers ensures that they are compared against the requirement checklist and not against their competitors. It is a faster way to narrow down the list of providers. With more clarity on the requirements such as technical, service, security, data governance and service management, you will be better pre

HyperX Cloud Core + 7.1 Gaming Headset for PC, PS4, Xbox One, Nintendo Switch, and Mobile Devices (HX-HSCC-2-BK/WW)

The HyperX Cloud Core with virtual 7.1 surround sound1 provides clear positional audio for a more immersive gaming experience. It also features signature HyperX memory foam and soft leatherette making it comfortable for long gaming sessions. The detachable noise-cancelling microphone keeps ambient sounds from interrupting your voice chat and can be removed when not in use. Cloud headsets are known for their legendary sound, comfort, and durability — optimized for the way you play Virtual 7.1 surround sound Advanced audio control box Signature HyperX comfort Durable aluminum frame Detachable noise-cancelling mic Multi-platform compatibility Brand HyperX Manufacturer Kingston Technology Corporation, 17600 Newhope Street, Fountain Valley, CA 92708 USA, Kingston Technology Corporation, 17600 Newhope Street, Fountain Valley, CA 92708 USA Model HX-HSCC-2-BK/WW Model Name HyperX Cloud Core + 7.1 Gaming Headset for PC, PS4, Xbox One, Nintendo Switch, and Mobile Devices (HX-HSCC-2-BK/WW) Model

What is Semantic AI? Is it a step towards Strong AI? | April 2021

M odern artificial intelligence can decide on its own whether it should use the width of a person’s l i ps to detect smile, or is it some other factor, or a combination of multiple factors (referred to as representation learning). This and a few other achievements of modern AI (such as reinforcement learning), have forced people to re-think whether Artificial General Intelligence ( AGI or Strong AI ) can actually be achieved anytime soon? No wonder, many articles have been published on this topic recently: Nature Journal [1], Forbes Magazine [2], McKinsey Consulting [3] etc. These articles profess that AGI is far from reality, anytime soon. After reading this blog one can realize “ why do they say so ” and also understand more about a new and emerging form of artificial intelligence, “ Semantic AI ”, which I believe is a step ahead of current form of AI (weak AI). In this article, I first share a perspective on the need of Semantic AI in enterprise context and d

Tips to buy a Laptop - Top 3 Laptops in your budget in 2021

1. A smaller screen means better portability. Most laptops come in screen sizes that range from 11 to 17 inches. The entire system is sized to fit the display. That means smaller notebooks are lighter and more compact, and larger ones are bulkier. If you don't move the laptop much, a 15-inch model is fine. But if you plan to use the laptop on your lap or carry it around, a model with a 13- or 14-inch screen, like the Dell XPS 13 , may provide the best balance between screen space and portability. Children under 12 will find it easier to handle a model with an 11.6- or 12.5-inch display. Get a 17-inch laptop only if it's going to stay on your desk. 2. Get a resolution of at least 1080p. If you can afford one (and they are available even for under $400), get a laptop with at least a 1920 x 1080 screen resolution , which is sometimes referred to as 1080 or "full HD" resolution. That number of pixels makes it easier to read web pages without scrolling and to stack

The Transformative Power of Artificial Intelligence: Shaping the Future

The Transformative Power of Artificial Intelligence: Shaping the Future In the realm of technological advancements, few innovations have captured the world's imagination as much as Artificial Intelligence (AI). From science fiction to reality, AI has become a powerful force driving transformative changes across various industries and sectors. Its significance cannot be overstated, as it has the potential to reshape the way we live, work, and interact with our surroundings. In this blog, we delve into the importance of AI and explore the profound impact it has on our society. 1. Enhancing Efficiency and Productivity: One of the most apparent benefits of AI is its ability to boost efficiency and productivity across industries. By automating repetitive tasks, AI liberates human resources to focus on more complex and creative endeavors. Businesses can streamline processes, optimize resource allocation, and make data-driven decisions faster, resulting in cost savings and increased com

Top 10 Data Visualization Tools for Every Data Scientist | April 2021

At present, the data scientist is one of the most sought after professions. That’s one of the main reasons why we decided to cover the latest data visualization tools that every data scientist can use to make their work more effective. By Andrea Laura, Freelance Writer One of the most well-settled fields of study and practice in the IT industry today, Data Science has been in the limelight for nearly a decade now. Yes, that's right! It has proven to be a boon in multiple industry verticals. From top of the line methodologies to analyzation of the market, this technology primarily includes obtaining valuable insights from data. This obtained data is then processed where data analysts further analyze the information to find a pattern and then predict the user behavior based on the analyzed information. This is the part where data visualization tools come into play. In this article, we will be discussing some of the best data visualization tools that data scientists need to try, i

TechnoCoders

Search This Blog

The Transformative Power of Artificial Intelligence: Shaping the Future

What is Calculus in Machine Learning | April 2021

I. Introduction

II. Optimization Using the Gradient Descent Algorithm

II.1 Derivatives and Gradients

II.2 Case Study: Building a Simple Regression Estimator

II.3 Gradient Descent Algorithm

II.4 Python Implementation

II.5 Application of basic regression model

III. Summary and Conclusion

Labels

Comments

Post a Comment

Ads

Popular posts from this blog

Advance Git Commands with example on terminal | Git commands

What is cloud computing in simple terms? | Definition & Examples | What is AWS ?

INDIA is no more Independent Nation ?

Top 15 Python Libraries For Data Science & Best Tutorials To Learn Them | April 2021

Nine Things to Check While Choosing A Cloud Service Provider

HyperX Cloud Core + 7.1 Gaming Headset for PC, PS4, Xbox One, Nintendo Switch, and Mobile Devices (HX-HSCC-2-BK/WW)

What is Semantic AI? Is it a step towards Strong AI? | April 2021

Tips to buy a Laptop - Top 3 Laptops in your budget in 2021

The Transformative Power of Artificial Intelligence: Shaping the Future

Top 10 Data Visualization Tools for Every Data Scientist | April 2021