19 Nov 2022

200

How to Understand Correlation: The Statistical Process of Establishing the Relationship Between Variables

Format: APA

Academic level: College

Paper type: Coursework

Words: 579

Pages: 2

Downloads: 0

Correlation is a statistical process of establishing the relationship between variables. There are three types of correlation, positive correlation, negative correlation, and no correlation. Positive correlation refers to a relationship where changes in the values of two variables vary directly. Therefore, an increase in the value of one variable predicts an increase in the value of the other variable. Similarly, a decrease in the values of one variable predicts a decrease in the values of the other variable. On the other hand, a negative correlation refers to a relationship where the values of two variables vary inversely (Weaver et al., 2017) . Whereas, no correlation means that the values of the variable are not related. Thus they do not exhibit any linear dependence. Other than the types of correlation, the strengths are also three. The correlation can be strong, weak, or no correlation. Strong correlation means that the values of one variable highly predict the placement of the values of a second variable. As such, when the values of the two variables are plotted on a scatter plot, the plotted points lie close to the best line of fit. Variables can also exhibit weak correlations. A weak correlation means that the values of the two values predict each other. However, the rate of predictability is low. Therefore, when plotted on a scatter plot. The points lie at varying distances along with the scatter plot. Finally, two variables are considered to have no correlation when the values of one variable cannot be used to predict the values of the second variable. Thus, it is impossible to draw the best line of fit for the various data points. In statistics, correlation is expressed using a correlation coefficient. The correlation coefficient is denoted by the letter r. its value ranges from 1 to -1. A correlation of r=1 indicates a strong correlation coefficient. When r = 0, the two variables are not correlated. Whereas, r= -1 indicates a strong negative correlation. For instance when given a correlation coefficient of r = -0.25, one can establish that the variables are negatively correlated. An increase in the values of one variable increase, predicts a decrease in the values of the second variable. Secondly, the closeness of 0.25 to 0 compared to -1 indicates that the correlation is somewhat weak. A correlation was put to taste by performing a survey that sought to establish the prevalence of cancer among US citizens age. The two variables that were under analysis are the prevalence of cancer and age. Thus, the survey involved the analysis of data from 522 USA citizens. The research question sought the respondent’s age and whether they had been diagnosed with cancer. It tested the hypothesis that stated that cancer prevalence increases with age. The ages of the respondents were clustered into groups of five years as shown in table 1. 

Age Group Midpoints 

Cancer Diagnosis 

2.5 

7.5 

12.5 

17.5 

22.5 

27.5 

32.5 

37.5 

14 

42.5 

22 

47.5 

35 

52.5 

38 

57.5 

40 

62.5 

43 

67.5 

57 

72.5 

52 

77.5 

47 

82.5 

64 

87.5 

30 

92.5 

20 

97.5 

34 

It’s time to jumpstart your paper!

Delegate your assignment to our experts and they will do the rest.

Get custom essay

The results accepted the null hypothesis giving r = 0.77. Thus, the survey proved that age and prevalence were positively correlated. The results were expected given that they concurred with the null hypothesis. Furthermore, the results concur with DeSantis et al., (2019) that noted that Americans at young ages are less prevalent to cancer. However, as their age advances, their prevalence of cancer increases. In conclusion, correlation is a process that seeks to establish the relationship between two variables. Correlation is represented using a correlation coefficient (r). the value of r ranges from -1 to 1. -1 indicates a strong negative correlation. Zero indicates no correlation, whereas, 1 indicates a strong positive correlation. When given a correlation coefficient of r = -0.25, it indicates a weak negative correlation. On the other hand, the correlation r= 0.77 indicates a somewhat strong positive correlation between age and the prevalence of cancer. 

References  

DeSantis, C., Miller, K., Dale, W., Mohile, S., Cohen, H., & Leach, C. et al. (2019). Cancer statistics for adults aged 85 years and older, 2019.  CA: A Cancer Journal For Clinicians 69 (6), 452-467. https://doi.org/10.3322/caac.21577 

Weaver, K., Morales, V., Dunn, S., Godde, K., & Weaver, P. (2017).  An Introduction to Statistical Analysis in Research: With Applications in the Biological and Life Sciences  (pp. 48-127). John Wiley & Sons. 

Illustration
Cite this page

Select style:

Reference

StudyBounty. (2023, September 15). How to Understand Correlation: The Statistical Process of Establishing the Relationship Between Variables.
https://studybounty.com/how-to-understand-correlation-the-statistical-process-of-establishing-the-relationship-between-variables-coursework

illustration

Related essays

We post free essay examples for college on a regular basis. Stay in the know!

HACCP: A Systematic Approach to Food Safety

HACCP entails an organized preventive undertaking to food safety from chemical, biological, and physical hazards in the processes of production which can make the finished products unsafe. A collaborative effort...

Words: 268

Pages: 1

Views: 142

Sampling: The Selection of a Particular Sample or Group to Represent an Entire Population

Sampling involves the selection of a particular sample or group to represent an entire population. Sampling techniques are categorized into two major groups that comprise non-probability and probability sampling. In...

Words: 564

Pages: 2

Views: 186

GIS Uses in National Wildlife Refuge Management

GIS is also known as the geographic information systems; these are computer systems that are used in the manipulation of data. These computer systems include both hardware and software systems, working together for...

Words: 679

Pages: 2

Views: 111

Factors That Least Affect the Global Environment

Introduction Kutz (1) defines environmental degradation as the destruction of habitats and ecosystems and the depletion of natural resources. The destruction of the environment arises from a combination of both...

Words: 1188

Pages: 4

Views: 88

Restoration of the Chesapeake Bay

A desirable ecological balance is one in which the factors that make the given environment what it is desirable. The Chesapeake Bay is one of those ecosystems which has lost the desirable balance and hence, has...

Words: 259

Pages: 1

Views: 132

Hazard Analysis Techniques for System Safety

A hazard is the potential of a risk occurring if a particular machine, item, or process is left uncontrolled. Workplaces have several hazards which may be caused by machines, energy sources, raw materials, chemicals,...

Words: 679

Pages: 2

Views: 143

illustration

Running out of time?

Entrust your assignment to proficient writers and receive TOP-quality paper before the deadline is over.

Illustration