Overview
Test Series
Correlation is a statistical measure that expresses the extent to which two variables are linearly related (meaning they change together at a constant rate). It’s a common tool for describing simple relationships between data sets. When two sets of data show high fidelity to change with respect to one another we say they have a high correlation. We use the correlation coefficient, r to quantify the magnitude of the relationship. The correlation coefficient r can have a value between -1 to 1. Here, 1 represents a perfect positive correlation between the two data sets, 0 represents no correlation and -1 represents a perfect negative correlation. In this math article, we will study correlation, its types, properties and different correlation coefficients. There are different types of correlation coefficients that indicate the nature of the relationship. Let's explore these types and their significance in statistical analysis.
Types of correlation is a vital topic to be known for the competitive exams such as the UGC-NET Commerce Examination.
Maths Notes Free PDFs
Topic | PDF Link |
---|---|
Class 12 Maths Important Topics Free Notes PDF | Download PDF |
Class 10, 11 Mathematics Study Notes | Download PDF |
Most Asked Maths Questions in Exams | Download PDF |
Increasing and Decreasing Function in Maths | Download PDF |
In this article, the readers will be able to know about the types of correlation in detail, along with certain other vital topics in detail.
Read about Correlation and regression.
Correlation is a process to establish a relationship between two variables. In statistics under relation and functions, methods of correlation summarize the relationship between two variables in a single unitless number called the correlation coefficient. The correlation coefficient is usually represented using the symbol r, and it ranges from -1 to +1. If the coefficient is close to 0 then the relation between the relationship between the two numbers is less and when the relationship is far away from 0 then the relationship is strong between the two variables.
The correlation coefficient close to plus 1 means a positive relationship between the two variables, with increases in one of the variables being associated with increases in the other variable. A correlation coefficient close to -1 indicates a negative relationship between two variables, with an increase in one of the variables being associated with a decrease in the other variable.
Find out about Mann Whitney test (Utest).
The correlation coefficient, usually written as r, is a number that tells us how strongly two things are related. These things, or variables, are often shown as X and Y.
The value of r is always between -1 and +1.
In simple terms, the closer r is to 1 or -1, the stronger the relationship. The closer it is to 0, the weaker the connection.
Correlation Coefficient Values and Their Meaning
Correlation Coefficient |
Type of Correlation |
Meaning |
+1 |
Perfect positive correlation |
When one variable goes up or down, the other moves in the same direction exactly. |
0 |
Zero correlation |
There is no connection between the two variables. Changes in one do not affect the other. |
-1 |
Perfect negative correlation |
When one variable goes up, the other moves exactly in the opposite direction. |
A scatter diagram is a graph used to show how two sets of numbers relate.
Each point on the graph represents a pair of X and Y values.
This diagram helps us see patterns or trends between the two variables. For example:
Correlation tells us how two variables are related to each other. The correlation coefficient measures this relationship and helps us understand how closely the two variables move together.
To compare two sets of data, we use formulas to calculate the correlation coefficient.
The most common formula for measuring correlation is called the Pearson correlation coefficient. It shows the strength and direction of a straight-line (linear) relationship between two data sets.
r = [n(Σxy) – (Σx)(Σy)] / √{[n(Σx²) – (Σx)²] × [n(Σy²) – (Σy)²]}
There are three classes of correlation in common.
A positive correlation is a relationship between two variables that are directly related to each other. A positive correlation exists when one variable decreases as the other variable decreases, or one variable increases while the other increases.
For example, the more money you save, the more financially secure you feel or when the temperature goes up, the rate at which ice melts also goes up. The graph for a strong positive correlation would look like this:
Fig: Positive Correlation
Negative correlation is a relationship between two variables in which one variable increases as the other decreases, and vice versa. A perfect negative correlation means the relationship that exists between two variables is exactly opposite all of the time. For example, as we climb up a mountain (increase in height) it gets colder (decrease in temperature). In statistics, a perfect negative correlation is represented by the value -1.0. The graph for a strong negative correlation would look like this:
Fig: Negative Correlation
There also exists a condition known as no Correlation, where there is no relation or dependence between two variables. A zero value of the correlation coefficient indicates no correlation. The graph for data sets with no correlation would look like this:
Fig: No correlation
Linearity of a correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. In simple words, correlation is said to be linear if the ratio of change of the two variables is constant, i.e if one of them doubles then the other one doubles or is halved i.e. changes by a factor of 2.
For example, the demand of vegetables and the prices of vegetables or the time spent on video games and the marks in exams.
Non-linear or curvilinear correlation is said to occur when the ratio of change between two variables is not constant. It can happen that as the value of one variable increases linearly with time, the value of another variable increases exponentially.
Simple correlation is a measure used to determine the strength and the direction of the relationship between two variables, X and Y. A simple correlation coefficient can range from –1 to 1. However, maximum (or minimum) values of some simple correlations cannot reach unity (i.e., 1 or –1).
Yield of paddy and the use of fertilizers is an example of simple correlation as yield of paddy depends on the use of fertilizers i.e. presence of one variable affects another variable.
In statistics, the coefficient of multiple correlation is a measure of how well a given variable can be predicted using a linear function of a set of other variables. It is the correlation between the variable’s values and the best predictions that can be computed linearly from the predictive variables.
For example, a researcher is interested in computing the correlation between crime rates in a region and multiple factors like unemployment, illiteracy, substance abuse, inflation etc.
Partial correlation measures the strength of a relationship between two variables, while controlling for the effect of one or more other variables.
For example, we might want to see if there is a correlation between the amount of food eaten and blood pressure, while controlling for weight or amount of exercise.
It’s possible to control for multiple variables (called control variables or covariates). However, more than one or two is usually not recommended because the more control variables, the less reliable our test.
Learn about Karl Pearson’s Correlation Coefficient
Correlation coefficients are used in the statistics for measuring how strong a relationship exists between two variables. There are many types of correlation coefficient like Pearson’s correlation that are used in linear regression analysis. It is very much popular and useful in statistics.
The main types of correlation coefficients are given below.
It is the most common formula used for linear dependency between the data set. Its value lies between -1 to +1. When the coefficient comes down to zero, then the data will be considered as not related.
The formula for Pearson correlation is,
r={ Σ(xi-x)−Σ(yi-y)}/√{Σ(xi-x)^2*Σ(yi-y)^2}
In statistics, Spearman’s rank correlation coefficient, named after Charles Spearman and often denoted by the Greek letter
ρ or rs, is a nonparametric measure of rank correlation. It assesses how well the relationship between two variables can be described using a monotonic function.
The formula for Spearman Correlation is given below.
ρ=1-{(6Σdi^2)/(n*(n^2-1))}
Where,
ρ=Spearman’s rank correlation coefficient
di= Difference between the two ranks of each observation
n = Number of observations.
The above formula is used to find correlation using Spearman Correlation.
The population correlation coefficient is a measure of linearity between A and B. The usual estimate is the sample correlation coefficient given by the below mentioned formula.
rab=σab/(σa×σb)
Where, rab= population correlation coefficient
σab= population covariance
σa=population standard deviation for variable A
σb=population standard deviation for variable B
Read about Variable.
The degree of correlation measures the strength and direction of the relationship between two variables. It is quantified by the correlation coefficient, which ranges from -1 to +1.
Perfect Correlation:
1. When two variables change in exact proportion to each other, the correlation is said to be perfect. This occurs in two forms:
2. Zero Correlation:
3. Limited Degree of Correlation:
Most real-world relationships fall between perfect correlation and no correlation. Here, the correlation coefficient lies between -1 and +1, but is not exactly these values.
The strength of this limited correlation is classified as:
The main properties of Correlation are listed below.
Symbolically,
−1≤r≤+1 or |r|<1
The difference between Correlation and Regression is listed below.
Correlation |
Regression |
|
|
|
|
|
|
|
|
|
|
|
|
The difference between Correlation and Covariance is listed below.
Correlation |
Covariance |
|
|
|
|
|
|
|
|
|
|
Problem 1:
Calculate the correlation coefficient for the following data, using Pearson's correlation coefficient.
A=4, 8, 12, 16 and B=5, 10, 15, 20.
Solution:
We need to first construct a table as follows to get the required values of the formula.
A |
B |
A2 |
B2 |
AB |
4 |
5 |
16 |
25 |
20 |
8 |
10 |
64 |
100 |
80 |
12 |
15 |
144 |
225 |
180 |
16 |
20 |
256 |
400 |
320 |
Now we calculate the sum for each column,
ΣA=4+8+12+16=40
ΣB=5+10+15+20=50
ΣA2=16+64+144+256=480
ΣB2=25+100+225+400=750
ΣAB=20+80+180+320=600, and
n=4
Now we just put the values directly in the formula.
Thus the correlation coefficient is 1.
Problem 2:
Calculate the correlation coefficient for the following data, using Spearman's rank correlation coefficient.
A = 15, 25, 35, 45
B = 20, 10, 30, 40
Solution:
We need to first construct a table as follows to get the required values for the formula.
Value 1 (A) |
Rank 1 |
Value 2 (B) |
Rank 2 |
di |
di2 |
15 |
1 |
20 |
2 |
-1 |
1 |
25 |
2 |
10 |
1 |
1 |
1 |
35 |
3 |
30 |
3 |
0 |
0 |
45 |
4 |
40 |
4 |
0 |
0 |
Use Spearman's rank correlation formula:
ρ = 1 - (6 × Σd²) / (n(n² - 1))
ρ = 1 - (6 × 2) / (4(16 - 1))
ρ = 1 - 12 / 60
ρ = 1 - 0.2
ρ = 0.8
Know about Measures of dispersion.
Understanding the types of correlation is crucial for researchers, statisticians, and anyone involved in data analysis. Whether exploring the relationship between economic indicators, studying the impact of variables in scientific research, or making predictions based on historical data, correlation plays a pivotal role. The nuances of positive, negative, or no correlation provide valuable insights into the dynamics of variables, helping to make informed decisions and predictions.
Types of correlation is a vital topic as per several competitive exams. It would help if you learned other similar topics with the Testbook App. For better practice, solve the below provided previous year papers and mock tests for each of the given entrance exam:
Download the Testbook APP & Get Pass Pro Max FREE for 7 Days
Download the testbook app and unlock advanced analytics.