27 Nov 2022

158

How to Identify Potential Outliers

Format: Other

Academic level: University

Paper type: Coursework

Words: 284

Pages: 2

Downloads: 0

Q1: Unordered (original data) 

Student ID 

Distance to Work (whole miles) 

31 

11 

18 

20 

14 

10 

10 

12 

11 

17 

12 

13 

13 

33 

14 

15 

15 

20 

16 

17 

43 

18 

19 

24 

Q2: Ordered Data 

Student ID 

Distance to Work (whole miles) 

16 

18 

10 

11 

10 

12 

12 

13 

14 

14 

15 

11 

17 

18 

20 

15 

20 

19 

24 

31 

13 

33 

17 

43 

It’s time to jumpstart your paper!

Delegate your assignment to our experts and they will do the rest.

Get custom essay

Q3: Calculated z-scores associated with each student 

In calculating the z-scores, this formula was used; z = (x - µ)/δ. Where x = sample item (mile), δ= 11.23, µ= 24. Below are the z-scores

Student ID 

z-scores 

-2.14 

-2.14 

16 

-1.69 

18 

-1.69 

-1.51 

-1.25 

-1.16 

10 

-1.07 

12 

-0.98 

-0.89 

14 

-0.80 

11 

-0.62 

-0.53 

-0.36 

15 

-0.36 

19 

0.00 

0.62 

13 

0.80 

17 

1.69 

Q4: Identify potential outliers and explain your reasoning 

Based on the data, There are no significant potential outliers, except the value 43. The box plot below shows the value.

Q5: 95% and 99% CI using Sample of 4 

To calculate CI, the z-score table values for 95%, i.e., 1.96 and for 99%, i.e., 2.58 were used

95% CI 

27.07 

Lower bound 
 

6.43 

Upper bound 
     
99% CI 

30.31 

Upper bound 
 

3.19 

Lower bound 

As shown above, the CI for the 99% is wider than the CI for 95%. Therefore, the upper bound for 99% CI is 30.31 to 3.19 for the lower bound.

Q6: 95% CI for sample of 7 and 95% for same mean and SD, but with 20 size. 

95% CI, n = 7 

21.80 

Upper bound 
 

7.06 

Lower bound 
       
95% CI, n = 20       
 

18.79 

Upper bound 
 

10.07 

Lower bound 

The CI for seven samples is smaller than that of a 20 sample, based on the same mean and standard deviation. Therefore, increasing the sample size, despite having the same mean and standard deviation, as shown above, leads to a smaller confidence interval range. For example, in using n = 20, the upper bound is decreased to that of 18.79, while the lower bound is 10.07, which is higher than with the sample of 7 items.

Illustration
Cite this page

Select style:

Reference

StudyBounty. (2023, September 16). How to Identify Potential Outliers.
https://studybounty.com/how-to-identify-potential-outliers-coursework

illustration

Related essays

We post free essay examples for college on a regular basis. Stay in the know!

17 Sep 2023
Statistics

Scatter Diagram: How to Create a Scatter Plot in Excel

Trends in statistical data are interpreted using scatter diagrams. A scatter diagram presents each data point in two coordinates. The first point of data representation is done in correlation to the x-axis while the...

Words: 317

Pages: 2

Views: 187

17 Sep 2023
Statistics

Calculating and Reporting Healthcare Statistics

10\. The denominator is usually calculated using the formula: No. of available beds x No. of days 50 bed x 1 day =50 11\. Percentage Occupancy is calculated as: = =86.0% 12\. Percentage Occupancy is calculated...

Words: 133

Pages: 1

Views: 151

17 Sep 2023
Statistics

Survival Rate for COVID-19 Patients: A Comparative Analysis

Null: There is no difference in the survival rate of COVID-19 patients in tropical countries compared to temperate countries. Alternative: There is a difference in the survival rate of COVID-19 patients in tropical...

Words: 255

Pages: 1

Views: 251

17 Sep 2023
Statistics

5 Types of Regression Models You Should Know

Theobald et al. (2019) explore the appropriateness of various types of regression models. Despite the importance of regression in testing hypotheses, the authors were concerned that linear regression is used without...

Words: 543

Pages: 2

Views: 175

17 Sep 2023
Statistics

The Motion Picture Industry - A Comprehensive Overview

The motion picture industry is among some of the best performing industries in the country. Having over fifty major films produced each year with different performances, it is necessary to determine the success of a...

Words: 464

Pages: 2

Views: 86

17 Sep 2023
Statistics

Spearman's Rank Correlation Coefficient (Spearman's Rho)

The Spearman’s rank coefficient, sometimes called Spearman’s rho is widely used in statistics. It is a nonparametric concept used to measure statistical dependence between two variables. It employs the use of a...

Words: 590

Pages: 2

Views: 309

illustration

Running out of time?

Entrust your assignment to proficient writers and receive TOP-quality paper before the deadline is over.

Illustration