In an area of Russia, records were kept on the relationship between the rainfall (in inches) and the yield of wheat (bushels per acre).

 

Rainfall (inches)

Yield(Bushels per acre)

11

42.3

14.6

52

13.7

61.3

23.2

73.9

28.3

92.1

10.7

44.3

9.4

44.7

23.7

77.2

24.2

80.3

 

 

 

1. Calculate an LSRE that predicts the yield of wheat from the amount of rainfall

2. Use your regression equation to predict the amount of wheat produced for a rainfall amount that is an example of extrapolation. Explain why you should not be confident of the result.

3. Calculate the correlation coefficient and the coefficient of determination and explain the meaning of each in the context of this situation.

4. Test to see, at the 5% level of significance, if the correlation is statistically significant

in Statistics Answers by

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
Anti-spam verification:
To avoid this verification in future, please log in or register.

1 Answer

 
The linear equation takes the form Y=A+BX.

The correlation coefficient, r, is 0.9778 approx, which implies a very good correlation between rainfall and yield. When r=1, the correlation is direct and perfect, so 0.9778, being close to 1, implies a good correlation. The coefficient of determination=r²=0.9561 approx and it is a measure of how good the statistical model is (the linear equation).

A=18.69, B=2.52 (approx). The derivation of these is explained below. The picture below shows the regression line in red as the best fit for the given data, represented by points on the graph.

 

Although the red line is the best fit it clearly does not pass through most of the points, so extrapolation cannot accurately predict yield, given a specific rainfall. It can only be used as a general indication of the yield. For example, no rainfall would still produce a yield according to the graph, whereas, in fact, there may be little or no yield without rain.

CORRELATION COEFFICIENT

First, we calculate the means of the X and Y values: mean X=158.8/9, mean Y=567.9/9.

Then we subtract mean X from each X value (call this difference p) and mean Y from each Y value (q). For each pair of differences, we find the product (pq). And we calculate the squares of each member of the pair (p² and q²). The next step is to sum the pair products (∑pq) and sum both sets of squares (∑p² and ∑q²). To save space and time, these calculations are omitted.

r=∑pq/√(∑p²×∑q²). Other versions of the formula are available.

LINEAR REGRESSION EQUATION

Using the table total lines we can calculate A and B:

B=(n∑XY-(∑X)(∑Y))/(n∑X²-(∑X)²) where n=9. Other versions of the formula are available.

B=(9(11050.17)-(158.8)(567.9))/(9(3211.16)-158.8²),

B=(99451.53-90182.52)/(28900.44-25217.44)=9269.01/3683=2.5167 approx.

A=(1/n)(∑Y-B∑X)=(1/9)(567.9-158.8B)=18.6942 approx.

Y=18.69+2.52X is the equation of best fit.

STATISTICAL SIGNIFICANCE

We use a t test where the test statistic t=r√((n-2)/(1-r²))=0.9778√(7/(1-0.9561))=12.33 approx.

Using the body of the t table to find the probability (p-value) associated with this t value at 7 DOF, we find that it is beyond 0.001 (0.1%) so it’s highly significant, certainly better than 5% significance. The model for correlating rainfall and wheat yield is highly significant.

by Top Rated User (1.2m points)

Related questions

1 answer
asked Aug 17, 2020 in Statistics Answers by anonymous | 1.3k views
1 answer
1 answer
2 answers
1 answer
asked Jan 26, 2014 in Other Math Topics by JESSA MAE CLAUD | 934 views
Welcome to MathHomeworkAnswers.org, where students, teachers and math enthusiasts can ask and answer any math question. Get help and answers to any math problem including algebra, trigonometry, geometry, calculus, trigonometry, fractions, solving expression, simplifying expressions and more. Get answers to math questions. Help is always 100% free!
87,516 questions
100,279 answers
2,420 comments
732,181 users