5 Dec 2022

73

Normality of Data and Z-Scores

Format: APA

Academic level: University

Paper type: Assignment

Words: 598

Pages: 2

Downloads: 0

In statistics, normality and the normality tests are used in the determination of whether a set of data is well-modeled by a normal distribution and subsequently used to compute or calculate to what degree an underlying random variable of the data is normally distributed. In other words, a normality test ascertains if the sample under investigation has been drawn from a normally distributed population. Nonetheless, Ghasemi & Zahediasl (2012) defines normality as the ability of an analyst or a researcher to ascertain if a sample is drawn from a non-normal distribution. Normality of data, therefore, is an essential and crucial concept to a statistician. Whereas there are several techniques or tests to ascertain normality, they all serve the same purpose of detecting non-normality or deviation from normality. Primarily, most statistical tests depend upon the assumption of normality –this is the assumption that the sample data follows a normal or Gaussian distribution, or it is drawn from a normal population.

Furthermore, according to Lumley et al. (2002), normality of data is fundamental because it is pivotal in making inferences. Therefore, if the population from which the sample was drawn was non-normal, then the assumption is violated, and the data results become skewed or false. Non-normality, in other words, renders statistical tests inaccurate. Therefore, this emphasizes the paramount nature of knowing whether the data is normal or non-normal to avoid the adverse implications of an inaccurate test. Tests that depend on the assumption of normality are referred to as parametric tests.

It’s time to jumpstart your paper!

Delegate your assignment to our experts and they will do the rest.

Get custom essay

On the other hand, if the sample data is non-normal, then non-parametric tests (tests that do not rely on the assumption of normality) should be used. Because non-parametric tests cannot ascertain the difference or variability in the sample data, they cannot offer significant and generalizable results compared to parametric tests that use normalized data. However, if a sample large enough is used, then the normality assumption can be violated because the central limit theorem maintains that approximately normal sample data means a normal sampling distribution, (Ghasemi & Zahediasl, 2012).

Z-Scores 

The z-score also referred to as the standard score is the number of standard deviations from the mean data points. DeVries (2007) further elaborates by asserting that a z-score is a measure of how many standard deviations below or above the population mean a raw or random score is. According to Calculating Z-scores (n.d), Z-scores range from -3 standard deviations at the far left of the normal distribution curve to +3 standard deviations at the far right of the curve. Z-scores are salient because they proffer a platform to compare results from a test to a normal population. In other words, Z-scores provide a platform for determining or ascertaining the normality of data. Z-scores further aid researchers and statistician in the calculation of the probability of a score or data occurring within the normal distribution. Also, z-scores enable analysts to compare two scores from different normal distributions by standardizing the scores.

Mathematically, the z-score informs how many standard deviations are from the population mean. Therefore, the z-score not only provides information regarding the score but also its relativity to the normal distribution. Also, the z-score allows for statistical inferencing, especially for quantitative variables. As such, the z-score formula is represented as follows; 

Therefore, if a specific attribute under investigation is three standard deviations above the mean, it means that the attribute is three times the average distance above the mean and probably represents one of the higher scores in the sample. In contrary, if the value of the attribute is -2 deviations from the mean, then the attribute is twice the average distance below the mean and represents one of the midrange values from the sample below the mean value.

In essence, while normality of data and z-scores are not mutually inclusive, they both serve a fundamental purpose of ensuring that the results of a test are accurate. As such, both of the concepts are important and should both be considered when conducting statistical analyses.

References

Calculating Z-scores. (n.d). Retrieved from https://www.wsfcs.k12.nc.us/cms/lib/NC01001395/Centricity/Domain/3165/calculating_z_scores.pdf 

DeVries, J. (2007). About z-scores. The University of Guelph. Retrieved from https://atrium.lib.uoguelph.ca/xmlui/bitstream/handle/10214/1842/A_About_Z-Scores.pdf?sequence=7 

Ghasemi, A., & Zahediasl, S. (2012). Normality tests for statistical analysis: a guide for non-statisticians.  International journal of endocrinology and metabolism 10 (2), 486-489. DOI: 10.5812/ijem.3505

Lumley, T., Diehr, P., Emerson, S., & Chen, L. (2002). The importance of the normality assumption in large public health data sets.  Annual review of public health 23 (1), 151-169. DOI: 10.1146/annurev.publheath.23.100901.140546

Illustration
Cite this page

Select style:

Reference

StudyBounty. (2023, September 14). Normality of Data and Z-Scores.
https://studybounty.com/normality-of-data-and-z-scores-assignment

illustration

Related essays

We post free essay examples for college on a regular basis. Stay in the know!

17 Sep 2023
Statistics

Scatter Diagram: How to Create a Scatter Plot in Excel

Trends in statistical data are interpreted using scatter diagrams. A scatter diagram presents each data point in two coordinates. The first point of data representation is done in correlation to the x-axis while the...

Words: 317

Pages: 2

Views: 187

17 Sep 2023
Statistics

Calculating and Reporting Healthcare Statistics

10\. The denominator is usually calculated using the formula: No. of available beds x No. of days 50 bed x 1 day =50 11\. Percentage Occupancy is calculated as: = =86.0% 12\. Percentage Occupancy is calculated...

Words: 133

Pages: 1

Views: 151

17 Sep 2023
Statistics

Survival Rate for COVID-19 Patients: A Comparative Analysis

Null: There is no difference in the survival rate of COVID-19 patients in tropical countries compared to temperate countries. Alternative: There is a difference in the survival rate of COVID-19 patients in tropical...

Words: 255

Pages: 1

Views: 251

17 Sep 2023
Statistics

5 Types of Regression Models You Should Know

Theobald et al. (2019) explore the appropriateness of various types of regression models. Despite the importance of regression in testing hypotheses, the authors were concerned that linear regression is used without...

Words: 543

Pages: 2

Views: 175

17 Sep 2023
Statistics

The Motion Picture Industry - A Comprehensive Overview

The motion picture industry is among some of the best performing industries in the country. Having over fifty major films produced each year with different performances, it is necessary to determine the success of a...

Words: 464

Pages: 2

Views: 86

17 Sep 2023
Statistics

Spearman's Rank Correlation Coefficient (Spearman's Rho)

The Spearman’s rank coefficient, sometimes called Spearman’s rho is widely used in statistics. It is a nonparametric concept used to measure statistical dependence between two variables. It employs the use of a...

Words: 590

Pages: 2

Views: 309

illustration

Running out of time?

Entrust your assignment to proficient writers and receive TOP-quality paper before the deadline is over.

Illustration