Each row represents a film produced by RKO Studios from 1929-1941: Column A shows the (sometimes abbreviated) name of the film; Column B shows whether the movie was a re-release (1) or not (0); Column C shows total production cost (in $1000s); Column D shows the domestic (USA) revenue of the film (in $1000s); Column E shows the foreign revenue of the film (in $1000s); Column F shows the Total Revenue (Cols D+E); Column G shows the total profits (Cols D + E – C – H); Column H shows the distribution costs of the film (in $1000s); and Column I shows the year of release. All information is attached.
A. Create a boxplot of production cost variable. Identify any and all outliers by name.
B. Calculate and report the best model of production cost (Yi = β0 + ε), considering only the films from this dataset
C. Calculate and report the best model of production cost using these RKO films from 1929-1941 as a representation of all film work at that time.
D. Fill in the table with data using the total revenue variable. For Column 1, treat this dataset as a population; for Column 2, treat this dataset as a sample. Replace all ?s with the correct statistical symbol.
undefined
Statistic |
Column 1 |
Column 2 |
μ or |
? = |
? = |
SS |
SS = |
SS = |
σ2 or s2 |
? = |
? = |
σ or s |
? = |
? = |
undefined


0 comments