# An Analysis Report on Furphy Project Sample Assignment

**An Analysis Report on**

Furphy Project

**Introduction:**

Furphy Beer, is a Geelong (Australia) imagined littler scale refinery association with under fifteen years of contribution in mixing ale. Despite its limited operations in Melbourne and regional Victoria, the association has experienced a fast improvement in its creation and arrangements in the current years. In 2016, the association offered an explanation to fabricate its planning capacity to 3 million liters for every year to deal with the growing interest of its pale ale mix.

Furphy sells beer to pubs, bars, restaurants and bottle shops. Buyers buy the beer directly or buy from the sales representative. Because of the high amount of success rate Furphy is estimating a change in business trend in next years. By and by more than ever, Furphy organization needs to ensure a solid association with its client base. In development, they are expecting to set up a formal philosophy to gauge their beer production. This would help Furphy unequivocally wander future free market action and alter era needs in like way. To do an analysis process beautiful data has conduct an online survey for collecting information which can affect the sales of beer.

**Main Body:**

**Task 1:**

The mean value for intention to repurchase furphy beer is 7.665** .** The maximum value for intention to repurchase beer is 9.9/10 and minimum value is 4.3/10. The value of standard deviation is 0.893 which shows the little variation in intention to repurchase furphy beer. The value for Q1 is 7.1 and for Q3 it is 8.2. The value of inter quartile range is 1.1. The lower fence value and upper fence value for intention to repurchase furphy beer are 5.45 and 9.85. This shows that outliers exist in the collected database.

We are __95% confident__ that the __true average value of intention to repurchase furphy beer__ is from __7.54 to 7.79__.

From the available 200 records, only 101 clients recommend a furphy beer to others.

We are __95% confident__ that the __true average percentage of client would recommend furphy beer to others__ is from __43.57% to 57.43%__.

**Task 2.1:**

The correlation statistical technique is used for identifying the key variables which may influence the intention to repurchase furphy beer. The loyalty, dist_channel, quality, brand image, shipping speed and shipping cost has moderate relationship with the intention to repurchase furphy beer. And region, SM_Presence, order_fulfillment advert, comp_pricing have week relationship with intention to repurchase furphy beer. Cust_type and flex_price variables have very week relationship with dependent variable so that are not included in modle building process.

SM_Presence and Brand image has multicollinearity. Here in this analysis correlation value greater than 0.7 is consider for the multicollinearity. So as per the relationship with dependent variable SM_Presence is removed from the modle building process with the use of statistical technique. Order_fulfillment has multicollinearity with shipping speed and shipping cost. So as per the relationship with dependent variable order_fulfillment is removed from the modle building process with the use of statistical technique. Shipping speed has multicollinearity with shipping cost. So as per the relationship with dependent variable shipping cost is removed from the modle building process with the use of statistical technique.

The variable which have week relationship with dependent variable also included in model building process to check whether it influence intention to repurchase furphy beer or not.

So finally following are the potential variables which are used for the modle building process for predicting intention to repurchase furphy beer: __Loyalty, region, dist_channel, quality, advert, brand_image, comp_pricing, shipping_speed.__

**Task 2.2:**

The model building process to predict intension to repurchase furphy beer gone through four iterations. After completing four iteration the model is finalized.

After first iteration, there are four variables which have p-value greater than 0.05. In next iteration comp_pricing was removed for the modle building process because it has 0.706 p-value. After second iteration, there are three variables which have p-value greater than 0.05. In next iteration Advert was removed for the modle building process because it has 0.648 p-value. After third iteration, there are two variables which have p-value greater than 0.05. In next iteration region was removed for the modle building process because it has 0.361 p-value. After fourth iteration, there are zero variables which have p-value greater than 0.05. So, this is the final model to predict intention to repurchase furphy beer. Detail knowledge of this modle is as follows:

The final model has total five independent variables: ** Loyalty, Dist_channel, quality, brand_image, shipping_speed**. As per the F-test we can say that this

**and has some predictive power.**

__modle is significant__**percent of the variation in intention to repurchase furphy beer is explained by the variation in independent variables. Standard error for this model is 0.669. By analyzing residuals, it is clear that this model does not violate any key assumptions. The estimated regression equation for this model is as follows:**

__43.8%__**Repurchase_intension = 3.997 + (loyalty*0.280) + (dist_channel*0.227) + (quality*0.157) + (brand_image*0.203) + (Shipping_speed*0.182)**

We can say that if we do not consider independent variables then on average intension of repurchase furphy beer is 3.997.

Assuming that loyalty’s value is increased by 1 unit and no change in other independent variables can increase intension of repurchase furphy beer by 0.280.

Assuming that dist_channel’s value is directly and no change in other independent variables can increase intension of repurchase furphy beer by 0.227.

Assuming that quality’s value is increased by 1 unit and no change in other independent variables can increase intension of repurchase furphy beer by 0.157.

Assuming that brand_image’s value is increased by 1 unit and no change in other independent variables can increase intension of repurchase furphy beer by 0.203.

Assuming that shippig_speed’s value is increased by 1 unit and no change in other independent variables can increase intension of repurchase furphy beer by 0.182.

So, these are the final details of the model for predicting the intention to repurchase furphy beer.

**Task 2.3:**

Todd believes that if people have good perception of brand image then relationship of quality with intention to repurchase furphy beer would be stronger. To analyze this new modle is finalized where quality is independent variable and brand image is interacting variable. The interaction term is quality*brand_image. Detail knowledge of this modle is as follows:

All variables in the final model has p-value less than 0.05. So, we can say that at ** 5% significance** level there is

**between quality and brand image in explaining the intention to repurchase furphy beer.**

__moderating effect__As per the F-test we can say that this ** modle is significant** and has some predictive power.

**percent of the variation in intention to repurchase furphy beer is explained by the variation in independent variables. Standard error for this model is 0.723. By analyzing residuals, it is clear that this model does not violate any key assumptions.**

__34.5%__At ** low level of quality**, for

**intention to repurchase furphy beer is**

__high brand image__**than low brand image. Quality**

__high__**affect the intention to repurchase furphy for both high and low brand image. At high level of quality, intention to repurchase furphy is more for high brand image than the low brand image. Overall brand image interacts with quality and intention to repurchase furphy such that the**

__positively__**than low brand image.**

__relationship is significantly stronger for high brand image__The estimated regression equation for this model is as follows:

**Repurchase intension = 0.501 + (Quality*0.691) + (Brand_image*0.864) -(Quality*brand_image*0.069)**

So, these are the final details of the interaction model for predicting the intention to repurchase furphy beer.

**Task 3.1:**

To predict the likelihood of recommending furphy to others model building process gone through total 10 iterations. In first iteration, all independent variables are included in the model building process. In each iteration, as per the p-value and statistical significance variables were eliminated. And the in the end final model to predict the likelihood of recommending furphy to others has total four variables: ** Dist_channel, Quality, Brand_image, Shipping_speed.** The detail knowledge of this modle is as follows:

In the sample of 200 records 101 people would recommend furphy beer to others. Out of 101 records this model correctly classified 78 records as successful predictors. Overall classification accuracy is 76% for the model. As per the ** proportion of chance criteria** and

**we can say that this model is**

__rule of thumb__

__practically significant.__According to Chi-Sq test model is significant and has ** prediction power.** According to the

**method we can say that**

__logistic pseudo R-square__**variation in the dependent variable can be explained by this model. According to the**

__31.19%__**) method we can say that**

__R-square(CS__**variation in the dependent variable can be explained by this model. According to the**

__35.10%__**method we can say that**

__R-square(N)__**variation in the dependent variable can be explained by this model.**

__46.81%__The direction between all independent variable of this model and dependent variable is positive. One unit increase in Dist_channel can increase odds of recommend furphy beer to others by 163%. One unit increase in quality can increase odds of recommend furphy beer to others by 92%. One unit increase in Brand_image can increase odds of recommend furphy beer to others by 86%. One unit increase in shipping_speed can increase odds of recommend furphy beer to others by 219%. So, by this this detail we can say that model is __statistically significant.__

The logistic regression equation for this model is as follows:

__Recommend = -13.278 + (dist_channel*0.968) + (quality*0.654) + (brand_image*0.621) + (shipping_speed*1.159)__

**Task 3.2:**

For this task four variables are used for visualizing and interpreting predicted probabilities. Those variables are ** Dist_channel, quality, brand_image and shipping_speed**. By analyzing this model, we can say that model

**. The logistic regression equation for this model is as follows:**

__is practically and statistically significant____Recommend = -13.278 + (dist_channel*0.968) + (quality*0.654) + (brand_image*0.621) + (shipping_speed*1.159)__

Here value of shipping speed is neutral (5). Predicted probability is calculated for quality value from 1 to 10 and 1,5 and 10 for brand image which shows negative neutral and positive brand image. These calculations are done for people who buys beer through sales representative and for those who buys directly. So, interpretation of the result is as follows:

If ** quality is high** and

**then**

__brand image is positive__**of recommend furphy beer to others is**

__probability__**for both who buys directly and who buys through sales representative.**

__very high__If ** quality is high** but

**then**

__brand image is negative__**of recommend furphy beer to others is**

__probability__**for both who buys directly and who buys through sales representative.**

__very low__If ** quality is low** but

**then**

__brand image is positive__**of recommend furphy beer to others is**

__probability__**for both who buys directly and who buys through sales representative.**

__low__If ** brand image is neutral** then

**of recommend furphy beer to others is**

__probability__**for both who buys directly and who buys through sales representative.**

__increase as quality is increase__The ** probability** of recommend furphy beer to others is

**for those who**

__greater__**than for those**

__buys directly__**.**

__who buys through sales representative__So, from this result we can say that Todd’s assumptions are correct that brand image and quality define the success of furphy beer. So, if efforts and money are put in the direction of improvement of quality and brand image then it will increase the probability of recommend furphy beer to others.

**Task 4:**

The detail knowledge of a time series model to forecast furphy production of pale ale for four quarters of 2018.

By observing line chart which is included in appendices we can say this is a ** non-stationary time series **because it has

**This time series has three components like**

__a upward trend.__**,**

__seasonal component__**and**

__trend component__

__random component.__** To smooth out the seasonality** in the time series

**method is used because data has cycles in four quarters. Then**

__centered 4 moving average__**are calculated for each quarter. Then by multiplying index value with observed data**

__indices__**process take place in time series modle building process. The equation which is used to calculate trend in the given data is as follows:**

__deseasonalised____Y = 17.095*x + 1097.5__

Then forecast value is calculated by multiplying trend value with the index values. So, the forecast value for next four quarters are as follows:

__2018 Q1: Forecast Value = 1728.061 (litters)__

__2018 Q2: Forecast Value = 2128.077 (litters)__

__2018 Q3: Forecast Value = 1601.062 (litters)__

__2018 Q4: Forecast Value = 1767.855 (litters)__

The ** mean absolute percentage error** for this multiplicative time series modle is

**. This shows that there is this much amount of error in the forecast values.**

__3.7%__So, these are the final details of the time series model to forecast furphy production of pale ale for four quarters of 2018.

**Conclusion:**

By analyzing all the aspects of multiple regression model and logistic regression model it is clear that intension to repurchase beer and recommendation of furphy beer to others both aspects are highly reliable on the brand image and quality. So, if quality and brand image will improve then there is no doubt that sales of furphy beer will increase. Distribution channel and shipping speed also affect the dependent variables. So mainly these four are the variable which mostly influence the dependent variables.

**Appendices:**

Task 1: Summery measures

Task1: Confidence Interval

Task 1: Histogram

Task 1: Pivot table

Task 1: Confidence interval

Task 1: Bar chart

Task 2.1: Correlation matrix

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.1: Scatter plot

Task 2.2: Residual plot

Task 2.2: Residual plot

Task 2.2: Residual plot

Task 2.2: Residual plot

Task 2.2: Residual plot

Task 2.2: Normal Probability plot

Task 2.3: Residual plot

Task 2.3: Residual plot

Task 2.3: Residual plot

Task 2.3: Normal Probability plot

Task 2.3: Interaction Chart

Task 3.1: ROC Curve

Task 3.2: Probability line chart

Task 3.2: Probability line chart

Task 4: Time series line chart