Language:EN
Pages: 10
Rating : ⭐⭐⭐⭐⭐
Price: $10.99
Page 1 Preview
sol fall introductory biostatistics page excel wor

Sol fall introductory biostatistics page excel worksheet

BIOSTATS 540 – Fall 2015 Introductory Biostatistics Page 1 of 10

Unit 9 – Regression and Correlation
Homework #14 (Unit 9 – Regression and Correlation)

Y = lung cancer cases

(per capita in 1930)

(per 100,000 in 1950)

1300 20
1100 46
1100 35
510 25

Canada

500 15

Holland

490 24
480 18
380 17
300 11
250 9

Iceland

230 6

sol_regression.docx

From here, you can copy and paste your data into an appropriate application.

You will be brought to an empty plot with an empty data set just below

Insert a comma between each X and Y as shown below. Be sure you scroll down and edit every row!

Scatterplot Using Stata

. * Initialize data set
. generate xcigs=.
. generate ycancer=.

variable | min max
-------------+--------------------
xcigs | 230 1300
----------------------------------

. tabstat ycancer, statistics(min max)

2. Interpret the graph you produced in exercise #1 with respect to form, direction, and strength.

This scatter suggests a linear relationship between cigarette consumption (X) and lung cancer cases (Y) that is positive, with higher cigarette consumption being associated with higher numbers of cancer cases. There are no outliers. However, there are more data in the lower left quadrant of this plot; thus, the full nature and strength of the association may be difficult to assess.

11

i=1

(XiX ) (YiY
(XiX

= 1432254.545

11

(YiY ) 2 = 1374.727273

4. Now you have what you need to solve for the least squares estimate of the slope and intercept.

By hand, or using Excel, or using any software you like, calculate the values of the following:

(XiX ) YiY )

⎤⎥⎥

=SXY

= 32718.18182/1432254.545 = 0.0228

1 ⎢⎢⎣ (XiX )
⎥⎥⎦ ⎣⎢SXX

b) Estimated intercept, ˆβ0 = Y −ˆβ1X= 20.5454545 – (0.0228*603.6363636) = 6.756086989

sol_regression.docx

A unit increase in X = per capita consumption of cigarettes (in 1930) is estimated to be associated with a .02 increase in Y = the number of lung cancer cases per 100,000 in 1950.

6. By hand, or using Excel, or using any software you like, calculate the values of the following sums of squares that are in the analysis of variance:

(YiY

= 1374.727273

hint – This is the same as SYY in #3

( ˆYiY
=ˆβ1 2

11

i=1

(XiX ) 2

i=1

(Yi− ˆY

2 )

Excel Worksheet:

Mean Square

F-Statistic

SSR =
( ˆYiY

)
2

= 747.4086

(n-2) = 9

SSE =

n

(Yi− ˆYi

Total, corrected

(n-1) = 10
SST =

n

(YiY )

sol_regression.docx

BIOSTATS 540 – Fall 2015 Introductory Biostatistics Page 10 of 10

http://epitools.ausvet.com.au/content.php?page=f_dist

You are viewing 1/3rd of the document.Purchase the document to get full access instantly

Immediately available after payment
Both online and downloadable
No strings attached
How It Works
Login account
Login Your Account
Place in cart
Add to Cart
send in the money
Make payment
Document download
Download File
img

Uploaded by : Miraan Bahl

PageId: DOC1629209