Sign in

Data Science Enthusiastic | Electronics R&D | Data Visualization | BI | NLP |

Statistics

Important probability distributions in statistics

Normal/Gaussian Distribution

The ‘Normal distribution’, also known as ‘Gaussian distribution’ or ‘bell curve’ is one of the most important probability distributions in statistics as it fits many natural phenomena like blood pressure, IQ scores, measurement errors, etc. Hence, stating normal distributions are continuous.

Normal distribution using python program

In a normal distribution, we use random.normal() function to obtain a normal distribution.

The function mainly has three parameters:

  1. loc: This parameter depicts the mean value that also means till where the peak of the bell exists.
  2. scale: This parameter depicts the standard deviation, which also means to assign the flatness of…


Deep Learning

Generating artificial images in the deep convolutional neural network model

In this article, we will discuss Generative Adversarial Networks (GAN) that are deep neural net architecture comprised of two neural networks, competing for one against the other. Here, the GAN is composed of two words, and their meaning is shown below:

Generative means to generate probability distribution that become close to the original data which we want to approximate.

Adversarial means in general, is opposition because there are two models i.e. discriminator and generator, they try to oppose each other to learn the probability distribution function. …


Machine Learning

Techniques of choosing the best set of features from the data

Feature Selection and its types

We all work on bucket loads of data, not every column is important to make our model.

For example, consider a dataset of students having features like name, age, sex, hours of study, and school name. If you must make a model that predicts the score of students, it is obvious that the feature ‘hours of study is the one that helps compared to the others.

When you work on a large dataset it is not easy to know the importance of a feature as easy as I stated above. So, we need a technique…


Future

Software developer jobs

According to Global Developer Population and Demographic Study, Software Developers have a total population of 26.4 million worldwide and is going to exponentially rise in the coming years because of the relevance and need of software developments, data science field, and cybersecurity with emerging technologies like automation (AI, ML, and DL), cryptocurrency and virtual reality.

These fields would be the catalyst for jobs like engineering, data science, smart products, sales, and marketing. …


Machine Learning

A pre-processing step in machine learning modeling

Introduction

A Data Analyst spends most of the time preparing and cleaning the data because the raw data is unstructured and contains noise that can not be used by machine learning models directly. Therefore this data is to be cleaned/filter which enhances the quality of the model and also helps in feature engineering.

The main motive of Data cleaning is to deal with handling encoding categorical data, handling missing values, dropping the redundant features, and reducing the dimensionality with the help of standard dimensionality reduction techniques. This step makes/prepares our data as a whole to be applied to any machine…


Programming

Open source programming and object-oriented programming language

Introduction

Java is one of the most popular and powerful programming languages used all over the world to design mobile applications, desktop applications, games, database connections, and much more.

  • Java is a platform-independent language, i.e it can work on any platform like Windows, Linux, Mac, etc.
  • It is an open-source programming language.
  • It is an object-oriented programming language.

Java Identifiers

Java identifiers are those which is used to identify classes, methods, variables, etc.

public class Student1
{
public static void main(String[] args)
{
// TODO Auto-generated method stub
int roll_no=45;
String Student_name=”Arpita Sinha”
}
}

In this code, the identifiers…


Machine Learning

Scaling methods in machine learning and deep learning algorithms

Standardization and normalization are among the data preprocessing techniques used heavily in machine learning and deep learning projects.

The main role of these techniques

  • To scale all of the data in a similar format to make the learning process easy for the model.
  • The odd values in the data become scaled or normalized and behave like a part of the data.

We will discuss both the concept deeply with python examples.

Standardization

The basic scaling of the data is to do make it standard so that all the values will be in common range. In standardization, the mean and the…


Natural Language Processing

Text representation model in natural language processing

In this article, we will discuss bag of words (BOW) model building in natural language processing. Sometimes, we try to find the occurrence of the words in the text document and we try with a simple count method to search the count of the one word. But if we want to know the occurrence of each word in the text document and with its count then we use the bag of words method also known as word embeddings.

The bag of words is used to extract the information from the text and trying to make them a dictionary or histogram…


The man-made object to fly on a different planet

For the people who saw the name INGENUITY for the first time, let me tell you what actually it is and why you should know about it, I can bet one thing after you all read this article you all will be astonished and your mind will be blown away that humans have achieved something like this which is groundbreaking in the field of space exploration. Now let us talk about it.

INGENUITY is a helicopter designed and developed by JET PROPULSION LABORATORY (JPL) NASA,

It is the first man-made object to fly on a different planet on its own…


Programming

Basic understanding of control loops with examples

An IF statement makes simple decisions based on true or false conditions. This statement maintains the control flow of the statements. The concept of this statement is simple; if the condition is satisfied, it executes the true statements, or else the program stops. The below diagram represents this concept.

Syntax:

if condition:
statements

Program: (simple)

Age = 25if Age >= 18:
print(“Eligible to vote”)
Output:
Eligible to vote

Explanation: In the above program, we have defined the value of ‘Age’ as 25. The condition is if the ‘Age’ value is greater than or equal to 18, it will print…

Amit Chauhan

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store