Exam Instructions

Assigned: Monday May 4, 2020 (5:00pm)

Due: Uploaded to Canvas on Friday, May 8, 2020, before 11:59pm (Central Time). Upload your exam as a

single pdf file.

This is a takehome exam. You may not work (give or receive help) with any one else on this exam; that

includes other students, as well as friends, family, colleagues, faculty, or anyone else. If you have questions you should contact the professor. You may reference all of your course materials (as well as outside

textbooks or the internet). (However: for full credit you will generally need to give an answer that refers

to what you learned specifically in class. If you use ideas or material from outside class, you will need to

explain what you are doing, and they may not yield full credit.)

Note: statistics students have been caught cheating by working on takehome exams together and sadly have

been forced to leave our program and university. Any students caught cheating in such a manner will fail

the course and have a mark put on their academic record, as a minimum punishment. Being caught cheating

can lead to expulsion from the university.

General formatting guidelines:

The usual formatting rules:

• Your homework (HW) should be formatted to be easily readable by the grader.

• You may use knitr or Sweave in general to produce the code portions of the HW. However, the output from knitr/Sweave that you include should be only what is necessary to answer

the question, rather than just any automatic output that R produces. (You may thus need to avoid using default R functions if they output too much unnecessary material, and/or should

make use of invisible() or capture.output().)

– For example: for output from regression, the main things we would want to see are the estimates for each coefficient (with appropriate labels of course) together with the

computed OLS/linear regression standard errors and p-values.

• Code snippets that directly answer the questions can be included in your main homework document; ideally these should be preceded by comments or text at least explaining what

question they are answering. Extra code can be placed in an appendix.

• All plots produced in R should have appropriate labels on the axes as well as titles. Any plot should have explanation of what is being plotted given clearly in the accompanying text.

• Plots and figures should be appropriately sized, meaning they should not be too large, so that the page length is not too long. (The arguments fig.height and fig.width to knitr chunks can

achieve this.)

Questions

On this takehome exam, you will analyze three data sets. Find final-data.rsav on the course webpage. Load it into R by running load("path/final-data.rsav") where “path/final-data.rsav” is

replaced by the full path on your hard drive to the file final-data.rsav (the syntax for which is operating

system dependent). The file contains three data objects: dat1, dat2, and dat3. Each dataset corresponds

to a separate exam question. (Thus, this exam has three questions.) Your output should be in the following

format. (Points will be deducted if it is not.)

• The analysis for each question/dataset should begin on a new page and should have as label the name

of the dataset (“dat1”, “dat2”, or “dat3”).

• On the first page of output for each problem, you should first have a summary (labeled “Summary”)

that provides a succinct answer to the question.

E.g., for SARIMA modeling, provide the model chosen, whether any transformation was used, parameter estimates, standard errors, and p-values in that model. Specify explicitly if you exclude a

constant term. For example, “For the series Yt = X

1/2

, I chose an ARIMA(1, 2, 3) model, including the intercept term. The parameter estimates were ...”. If you believe the data cannot distinguish

between two (or more) models you should describe both (all) of them in this manner here.

• After the summary, should be an explanation (labeled “Explanation”). Here you should provide a

clear explanation of the methodology or reasoning that lead to your answer in the Summary above.

Refer to the output of your analysis, which will be below. This part is still succinct (and does not

include code output) but explains in words why you got the answer(s) you got.

For SARIMA modeling, the model selection and diagnostic techniques we have discussed in class

can be discussed here. You do not need to (and should not) provide an exhaustive list of all possible

models, but should rather provide explanation for which models were reasonable contenders (and

why), and which model (or models) were the best out of those contenders (and why).

• After the explanation is the “Output” you refer to in your summary. (The output may/will be plots or

output from various commands.) All of it should be clearly labeled or described. You do not need to

provide exhaustive output from every single command you have run, but you should include enough

to justify all the arguments you make in your summary.

Finally, in Questions 1 and 2 please refer to the original/raw (untransformed) time series as Xt

in your

descriptions and as xx in your code. Refer to any transformed series as Zt

in your descriptions and zz in

your code. In Question 3, the two series are named xx and yy and you should not rename them.

Dat1

For dat1, your job is to fit the best SARIMA(p, d, q) × (P, D, Q)s model you can to the dataset.

Dat2

Consider dat2. There are two parts to this question.

(A). Pretend we have a “baseline” model, which is AR(1) with φ1 = .7. Your job is to assess whether

the true spectral density of the data match the spectrum from this AR(1) model or not, specifically at

the following frequencies: ω = 0.027, 0.049, 0.25, 0.35, 0.45. That is, using (two-sided) confidence

intervals, test the hypotheses that the true f(ω) equals fAR(1)(ω) (with φ1 = .7) at the 5 omega

values given. (Do not adjust for multiple comparisons.)

(B). Remove just the final observation in the time series (so the length will decrease by just 1). Now again

answer the previous question – but do this just for ω = .027.

To format your answers, follow the same general formatting used for our analysis of SARIMA models: a

Summary (describe just what your result is for each test), Explanation (the broad overview of the methodology that led you to your summary results), and Output. (Note: you may use any R functions you wish for

this question, you do not need to do anything ’by hand’.)

Dat3

dat3 has two columns, yy and xx, which are both time series. Your job is to regress yy on xx using the

(“transfer modelling”) methodology we discussed in class.

QQ：99515681
WeChat：codinghelp
Email：99515681@qq.com
Work Time：8:00-23:00

Hots

Ghostwriter Cs1b Spring 2024 Tth Hw08h... 2024-04-19
Help With Managing Financial Risk Prob... 2024-04-19
Ghostwriter Cs 0449 – Project 5: /Dev/ 2024-04-19
Ghostwriter Elec 2141 Digital Circuit ... 2024-04-19
Help With Csc171 — Videogame Projecthe 2024-04-19
Help With Comp3411 Artificial Intellig 2024-04-19
Help With Stat3061: Random Processes &... 2024-04-19
Ghostwriter Accounting 452, Spring 202... 2024-04-19
Ghostwriter Finc5001 Foundations In Fi... 2024-04-19
Ghostwriter 7Ssmm712 – Topics In Appli 2024-04-19
Help With Com 337 - Film Studies For T... 2024-04-19
Ghostwriter Mes202tc - Digital Vlsi Sy... 2024-04-19
Ghostwriter Geography 2041B Distance S... 2024-04-19
Ghostwriter Ecos3006 International Tra... 2024-04-19
Help With Fit5225 2024 Sm1 Creating An... 2024-04-19
Help With Cit 593: Introduction To Com... 2024-04-19
Help With Math 4931: Take Home Examgho... 2024-04-19
Ghostwriter Csci 547|Info 533: Systems... 2024-04-19
Ghostwriter Cs536-S24 Intro To Pls And... 2024-04-19
Help With Fit5212 - Assignment 1Ghostw... 2024-04-19

Programming Assignment Help！