Home
Blog
Write a report on a regression analysis of observational data, available in the OpenToronto Data Portal.

Write a report on a regression analysis of observational data, available in the OpenToronto Data Portal.

Daniel Kevins

0 comments

Go to this link to access some starter code (Assignment2.Rmd) and the Assignment2-Instructions on JupyterHub:

https://jupyter.utoronto.ca/hub/user-redirect/git-pull?repo=https%3A%2F%2Fgithub.com%2FSamanthaJoCaetano%2FSTA304-F21-Assignment2.git&urlpath=rstudio%2F&branch=masterLinks to an external site.

I can also provide the rmd file:

—

title: “Put an informative title in here”

author: “ADD YOUR NAME HERE – STUDENT NUMBER”

subtitle: Assignment 2

output:

pdf_document: default

—

“`{r, include=FALSE}

knitr::opts_chunk$set(warning = FALSE, message = FALSE)

“`

# Introduction

<Optional: You can also include a description of each section of this report as a last paragraph.>

# Data

## Data Collection Process

## Data Summary

“`{r, include = FALSE}

# Here you can load in and clean the data (you may need to do the cleaning in a separate R script).

# You may need additional chunks, in case you want to include some of the cleaning output.

# Notice that the include=FALSE means that the code, and its resulting output, in this chunk will not appear in the pdf.

“`

“`{r, include=FALSE}

# Use this to calculate some summary measures.

“`

“`{r, echo = TRUE}

# Use this to create some plots.

“`

All analysis for this report was programmed using `R version 4.1.1`.

# Methods

<Note: One dollar sign outside the LaTeX will leave the math notation in the line of text. Whereas, two dollar signs outside the LaTeX will put the math notation on its own line outside of the text. See examples below.>

**Example 1:**

The simple linear regression model is $y=beta_0 + beta_1x + epsilon$. Where $beta_0$ represents the intercept of the regression line….

**Example 2:**

The simple linear regression model is: $$y=beta_0 + beta_1x + epsilon$$. Where $beta_0$ represents the intercept of the regression line…

# Results

“`{r, include = FALSE}

# Here you can load in and clean the data.

# You may need additional chunks.

# I would recommend not including any of the Cleaning process output in the pdf – hence the “include = FALSE” at the start of the chunk.

“`

“`{r, include = FALSE}

# Here you can run a linear regression on your two variables of interest.

#lm(y~x, data = your_data) # This is for a simple linear regression.

“`

| $hat{beta}_0$ | 1.000 |

|————— | ——–|

| $hat{beta}_1$ | 2.000 |

“`{r, echo=FALSE}

# Use this to calculate generate a scatterplot of your variables if desired.

# You can use abline to overlay the scatterplot with the regression line (again, if desired).

“`

All analysis for this report was programmed using `R version 4.1.1`. I used the `glm()` function in base `R` to derive the estimates of a frquentist logistic regression in this section [4].

# Conclusions

## Weaknesses

## Next Steps

## Discussion

newpage

# Bibliography

1. Grolemund, G. (2014, July 16) *Introduction to R Markdown*. RStudio. [https://rmarkdown.rstudio.com/articles_intro.html](https://rmarkdown.rstudio.com/articles_intro.html). (Last Accessed: October 12, 2021)

2. Dekking, F. M., et al. (2005) *A Modern Introduction to Probability and Statistics: Understanding why and how.* Springer Science & Business Media.

3. Allaire, J.J., et. el. *References: Introduction to R Markdown*. RStudio. [https://rmarkdown.rstudio.com/docs/](https://rmarkdown.rstudio.com/docs/). (Last Accessed: October 12, 2021)

4. Peter Dalgaard. (2008) *Introductory Statistics with R, 2nd edition*.

About the Author

Daniel Kevins

Follow me