Skip to main content

Statistics Introduction

 

  •  Statistics is the study of the collection ,analysis, interpretation, presentation, and organisation of data.
  •  It is a way to understand the data and find the patterns in that.
 

 Terminologies in Statistics:


  •  Population is the whole contains every events in an experiments.
  •  Parameter is the characteristics of population such as population such as population mean, median etc.
  •  Sample is a subset of the population.
  •  Statistics is a characteristics of sample such as sample mean, median etc.
 

Types of Analysis or data types

 

 
 

 Numerical or Quantitative

  • Quantitative is nothing but variables are expressed in numerical terms.
  • Example : Price , income, etc.
  • Their are two types of data in numerical data type.

        Continuous Data Type:

  • A continuous data set is a quantitative data set representing a scale of measurement that can consist of numbers other than whole numbers, like decimals and fractions. 
  • Example: Height, weight, length, temperature.

        Discrete Data Type:

  • Discrete data is based on counts. Only a finite number of values is possible.
  • There is constant interval for an instance.
  • Example: No of children’s, and interval is 1 because we can’t say 1.5 like that.
 

Categorical or Qualitative

  • Qualitative is nothing but variables  represents characteristics but can’t expressed in numerical terms.
  • Example : marital status etc 
  • Their are three data types in Categorical Data Type.

        Nominal Data Type:

  • The values which is not having specific order. 
  • Example: Names , TV, fan etc.

        Ordinal Data Type:

  • The ordinal data in which the categories are ordered.
  • Example: Education Scoring Class (Fail, Pass, First Class, Second Class, Distinction) , Ageing(Young age , Middle Age , Old age) etc.

        Binary Data Type

  • Binary data is an important special case of categorical data that takes only one of two values.
  • Example: 0/1, yes/no, accept/reject.




Here a small example. 

A data frame containing columns as  name, degree, gender, performance, Experience, Promotion and three records.

Name column is an example for nominal data type because their is no specific order.
Degree column is an example for ordinal data type because each degree has some qualification has to be done.
Gender column is an example for binary data type, because here we have two values either male or female.
Performance column is an example for ordinal data type.
Experience column is an example for discrete data type where experience column is integer, no floating values.
Promotion column is an example for binary data type.




Here "data" is variable we stored data frame. 
data.info()   

It will give the information about data Frame. On Dtype column will tell data type of each column. int64,float64 tells that column is numerical data type.
object Dtype tells us that column is categorical data type.

Why data types is important?

Datatypes are an important concept because in statistical analysis we analyze continuous data differently than categorical data otherwise it would result in a wrong analysis. Therefore knowing the types of data you are dealing with, enables you to choose the correct method for analysis.


Two types of statistics:

Descriptive Statistics 

  • In Descriptive Statistics your are describing, presenting, summarizing and organizing your data.
  • It gives basic information about data helps to further proceed the data analysis.

Inferential Statistics

  •  It is about using data from sample and then making inferences about the larger population from which the sample is drawn. 
  •  The goal of the inferential statistics is to draw conclusions from a sample and generalize them to the population.
    
Descriptive Statistics in Part - 2


Learn Data Science Material which helps to learn concepts in Python, Statistics , Data Visualization, Machine Learning , Deep Learning. And it contains Projects helps to understand the flow of building model , and what are the necessary steps should be taken depending on the data set. Interview Questions helps to crack the interview. 





Learn Python from basics to advanced. 



Join ML in python channel in telegram , Where you can learn every concepts in Python, Statistics, Data Visualization, Machine Learning, Deep Learning.

  

Join Aptitude Preparation channel in telegram , this channel helps to crack any interview.


Comments

Popular posts from this blog

Practice Problems in Python [ Part - 1 ]

                                            Python 1. Write a program which will find all such numbers which are divisible by 3 but are not a multiple of 7,between 2000 and 3200 (both included). soln :            def filter_numbers():           """           function to filter out numbers by extracting numbers           which is divisible by 3 but not multiple of 7.           """           filtered_list=[]           for i in range(2000, 3201):               if (i%3==0) and (i%7!=0):                   filtered_list.append(str(i))    ...

Python Introduction

 Introduction  Python is developed by Guido Van Rossum and released in 1991. Python is high level, interpreted, general purpose programming language. It is one of the top five most used languages in the world. Currently there are 8.2 million developers who code in Python. Python is one of the most preferred languages in the field of Data Science and Artificial Intelligence. Key Features Python is an interpreted language, unlike compiled languages like Java, C, C++, C#, Go etc., Python codes are executed directly even before compiling.  Python is Dynamically typed, no need to mention type of variable before assigning. Python handles it without raising any error. Python codes can be executed on different software or operating systems without changing it. Python supports both Functional and Object oriented programming as it supports creating classes and objects. Python has high number of modules and frameworks support. Python is free and Open Source, which means it is availa...

Types of Machine Learning

                                   Machine Learning  Machine Learning is an application of artificial intelligence where a computer/machine learns from the past experiences (input data) and makes future predictions. It finds the pattern in the data , based on the pattern it gives the future predictions from the unseen data.   It is a way to understand the data and find the patterns in that. Types of Machine Learning        Supervised Machine Learning An algorithm learns from example data and associated target responses that can consist of numeric values or string labels.  Generally the algorithm should find the pattern how input and output is mapped           Two types of Supervised Learning: Regression:  The problem is regression type when the output variable is real or continuous. Example :  Predicting salar...