• Home
  • Blog
  • Regularization models using Ridge and Lasso

Regularization models using Ridge and Lasso

0 comments

Overview and Rationale

In order to consolidate your
theoretical knowledge into technique and skills with practical and
applicational value, you will use the glmnet() package in R to implement
Ridge and LASSO functions to build linear and logistic models through
Ridge and LASSO regression over values of the regularization parameter
lambda.

Course Outcomes

This assignment is directly linked to the following key learning outcomes from the course syllabus:

Assignment Summary

Use the College dataset (https://rdrr.io/cran/ISLR/man/College.html)
from the ISLR library to build regularization models by using Ridge and
Lasso (least absolute shrinkage and selection operator). Predict
Grad.Rate for all models.

  1. Split the data into a train and
    test set – refer to the Feature_Selection_R.pdf document for information
    on how to split a dataset.

Ridge Regression

  1. Use the cv.glmnet function to estimate the lambda.min and lambda.1se values. Compare and discuss the values.
  2. Plot the results from the glmnet function provide an interpretation. What does this plot tell us?
  3. Fit a Ridge regression model against the training set and report on the coefficients. Is there anything interesting?
  4. Determine
    the performance of the fit model against the training set by
    calculating the root mean square error (RMSE). sqrt(mean((actual –
    predicted)^2)).
  5. Determine the performance of the fit model
    against the test set by calculating the root mean square error (RMSE).
    Is your model overfit?

LASSO

  1. Use the cv.glmnet function to estimate the lambda.min and lambda.1se values. Compare and discuss the values.
  2. Plot the results from the glmnet function provide an interpretation. What does this plot tell us?
  3. Fit
    a LASSO regression model against the training set and report on the
    coefficients. Do any coefficients reduce to zero? If so, which ones?
  4. Determine
    the performance of the fit model against the training set by
    calculating the root mean square error (RMSE). sqrt(mean((actual –
    predicted)^2)).
  5. Determine the performance of the fit model
    against the test set by calculating the root mean square error (RMSE).
    Is your model overfit?

Comparison

  1. Which model performed better and why? Is that what you expected?
  2. Refer
    to the Intermediate_Analytics_Feature_Selection_R.pdf document for how
    to perform stepwise selection and then fit a model. Did this model
    perform better or as well as Ridge regression or LASSO? Which method do
    you prefer and why?

Report

Refer to
the attached rubric for more details on the report. The report should
contain a well written cover/title page, introduction, body, conclusion,
and references. It must follow APA format and have at least 1000 words
(excluding title page and references page. All R code used for your
report should be included in an appendix at the end of the report.

Graphs,
figures, charts, and tables are very useful visual effects to
communicate your results and impress your readers. However, such items
should not be included in the report unless they are well described and
interpreted. Please use subtitles to make your assignment more reader
friendly as well.

Format & Guidelines

The report should follow the following format:

  • Title page
  • Introduction
  • Analysis
  • Conclusion/Interpretations
  • References

About the Author

Follow me


{"email":"Email address invalid","url":"Website address invalid","required":"Required field missing"}