Chat with us, powered by LiveChat Build a Linear Regression model on Boston Housing dataset, “Boston.csv” to predict median house values “MEDV” from the other variables - EssayAbode

Build a Linear Regression model on Boston Housing dataset, “Boston.csv” to predict median house values “MEDV” from the other variables

Case study #3 – PCA applied to Linear Regression – 20 points

Objective:

Build a Linear Regression model on Boston Housing dataset, “Boston.csv” to predict median house values “MEDV” from the other variables. Then build another model after carrying out Principal Component Analysis (PCA) and compare the performance of both models to draw conclusions about the efficacy of PCA.

Tasks:

Exploratory and preparatory 4 points

· Explore the data set and carry out EDA

· Separate target and predictor variables and split the data 70% – 30% (use random_state=42)

Build Linear regression model on original data and evaluate 4 points

· Build a linear regression model on the train data

· Calculate evaluation metrics, R-square, RMSE, MAE and MAPE on train and test data

Carrying out PCA 8 points

· Normalize (scale) the original data (only predictor variables)

· Carry out PCA and examine cumulative variance explained by PCs

· Select number of PCs that explain at least 85% variance

· Extract the chosen number of PCs and fit on scaled data (use random_state=42)

Linear regression on scaled data and comparison 4 points

· Construct another linear regression model – on PCA transformed data

· Evaluate performance and compare both models

· Conclusion about efficacy of PCA

 

Guidelines for submitting:

· Annotate your Jupyter Notebook, to explain your procedures, comments and conclusions

· After completion, run the Jupyter notebook from start to finish

· Download the notebook in HTML format and upload on CANVAS in assignment space.

Related Tags

Academic APA Assignment Business Capstone College Conclusion Course Day Discussion Double Spaced Essay English Finance General Graduate History Information Justify Literature Management Market Masters Math Minimum MLA Nursing Organizational Outline Pages Paper Presentation Questions Questionnaire Reference Response Response School Subject Slides Sources Student Support Times New Roman Title Topics Word Write Writing