diff --git a/docs/GP_Practical_files/figure-html/unnamed-chunk-6-1.png b/docs/GP_Practical_files/figure-html/unnamed-chunk-6-1.png
index a7c1e3f..a2200f6 100644
Binary files a/docs/GP_Practical_files/figure-html/unnamed-chunk-6-1.png and b/docs/GP_Practical_files/figure-html/unnamed-chunk-6-1.png differ
diff --git a/docs/GP_Solutions.html b/docs/GP_Solutions.html
index 6f308c9..6a615b0 100644
--- a/docs/GP_Solutions.html
+++ b/docs/GP_Solutions.html
@@ -254,7 +254,7 @@ <h5 class="anchored" data-anchor-id="challenges">Challenges</h5>
 <div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a><span class="co"># Pulling the data from the NEON data base. </span></span>
 <span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>target <span class="ot">&lt;-</span> readr<span class="sc">::</span><span class="fu">read_csv</span>(<span class="st">"https://data.ecoforecast.org/neon4cast-targets/ticks/ticks-targets.csv.gz"</span>, <span class="at">guess_max =</span> <span class="fl">1e1</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stderr">
-<pre><code>Rows: 601 Columns: 5
+<pre><code>Rows: 637 Columns: 5
 ── Column specification ────────────────────────────────────────────────────────
 Delimiter: ","
 chr  (3): site_id, variable, iso_week
@@ -756,8 +756,8 @@ <h6 class="anchored" data-anchor-id="fit-a-gp-model-for-all-the-locations-more-a
 OSBS 1.2057049
 SCBI 0.8610973
 SERC 0.9532001
-TALL 0.8883320
-UKFS 0.9521605</code></pre>
+TALL 1.2216955
+UKFS 2.0705090</code></pre>
 </div>
 </div>
 
diff --git a/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-8.png b/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-8.png
index cda4c21..1d5618b 100644
Binary files a/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-8.png and b/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-8.png differ
diff --git a/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-9.png b/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-9.png
index 7a72dbc..18a6e7c 100644
Binary files a/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-9.png and b/docs/GP_Solutions_files/figure-html/unnamed-chunk-9-9.png differ
diff --git a/docs/Stats_review.html b/docs/Stats_review.html
index 1ae78b8..e941848 100644
--- a/docs/Stats_review.html
+++ b/docs/Stats_review.html
@@ -353,7 +353,7 @@ <h1>Random Variables (RVs)</h1>
 <li>discrete (numbers of items or successes)</li>
 <li>continuous (heights, times, weights)</li>
 </ul>
-<p>We usually use capital letters – e.g.&nbsp;<span class="math inline">X</span>, <span class="math inline">Y</span>, sometimes with bold or with subscripts – to denote the RVs. In contrast we use lower case letters, e.g.&nbsp;<span class="math inline">x</span>, <span class="math inline">y</span>, <span class="math inline">k</span>, to denote the values that the RV takes. For instance, lets say that the heights of the woman at Virginia Tech are the RV, <span class="math inline">X</span>, and <span class="math inline">X</span> has a normal distribution with mean 62 inches and variance 6<span class="math inline">^2</span>, i.e., <span class="math inline">X \sim \mathrm{N}(62,6^2)</span> distribution. Say we then observe the heights of 3 individuals drawn from this distribution – we would write this as: <span class="math inline">x=(</span> 60.3, 62.9, 63.8 <span class="math inline">)</span>.</p>
+<p>We usually use capital letters – e.g.&nbsp;<span class="math inline">X</span>, <span class="math inline">Y</span>, sometimes with bold or with subscripts – to denote the RVs. In contrast we use lower case letters, e.g.&nbsp;<span class="math inline">x</span>, <span class="math inline">y</span>, <span class="math inline">k</span>, to denote the values that the RV takes. For instance, lets say that the heights of the woman at Virginia Tech are the RV, <span class="math inline">X</span>, and <span class="math inline">X</span> has a normal distribution with mean 62 inches and variance 6<span class="math inline">^2</span>, i.e., <span class="math inline">X \sim \mathrm{N}(62,6^2)</span> distribution. Say we then observe the heights of 3 individuals drawn from this distribution – we would write this as: <span class="math inline">x=(</span> 69.2, 63.3, 61.3 <span class="math inline">)</span>.</p>
 <p><br> <br> </p>
 </section>
 <section id="probability-distributions" class="level1">
@@ -577,7 +577,7 @@ <h1>Probability Distributions in <code>R</code></h1>
 <div class="cell">
 <div class="sourceCode cell-code" id="cb3"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="fu">rnorm</span>(<span class="dv">3</span>, <span class="at">mean=</span><span class="dv">0</span>, <span class="at">sd=</span><span class="dv">1</span>) <span class="do">## random draws</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
-<pre><code>[1] 1.410971 1.186608 0.194505</code></pre>
+<pre><code>[1] -1.1472029 -0.2967273 -1.4990834</code></pre>
 </div>
 </div>
 <div class="cell">
diff --git a/docs/Stats_review_files/figure-html/unnamed-chunk-7-1.png b/docs/Stats_review_files/figure-html/unnamed-chunk-7-1.png
index 5f84ef0..9fc043c 100644
Binary files a/docs/Stats_review_files/figure-html/unnamed-chunk-7-1.png and b/docs/Stats_review_files/figure-html/unnamed-chunk-7-1.png differ
diff --git a/docs/Stats_review_files/figure-html/unnamed-chunk-8-1.png b/docs/Stats_review_files/figure-html/unnamed-chunk-8-1.png
index 224288c..bc3d2f8 100644
Binary files a/docs/Stats_review_files/figure-html/unnamed-chunk-8-1.png and b/docs/Stats_review_files/figure-html/unnamed-chunk-8-1.png differ
diff --git a/docs/Stats_review_files/figure-html/unnamed-chunk-9-1.png b/docs/Stats_review_files/figure-html/unnamed-chunk-9-1.png
index 70cf592..caf7ef7 100644
Binary files a/docs/Stats_review_files/figure-html/unnamed-chunk-9-1.png and b/docs/Stats_review_files/figure-html/unnamed-chunk-9-1.png differ
diff --git a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-15-1.png b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-15-1.png
index 69cdf4c..8c53272 100644
Binary files a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-15-1.png and b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-15-1.png differ
diff --git a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-19-1.png b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-19-1.png
index 0a58c91..e7c7a83 100644
Binary files a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-19-1.png and b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-19-1.png differ
diff --git a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-20-1.png b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-20-1.png
index 4373d14..f4dc586 100644
Binary files a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-20-1.png and b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-20-1.png differ
diff --git a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-7-1.png b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-7-1.png
index 3c41c0f..5cd07c7 100644
Binary files a/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-7-1.png and b/docs/VB_RegDiagTrans_files/figure-revealjs/unnamed-chunk-7-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln.html b/docs/VB_TimeDepData_practical_soln.html
new file mode 100644
index 0000000..306669c
--- /dev/null
+++ b/docs/VB_TimeDepData_practical_soln.html
@@ -0,0 +1,1355 @@
+<!DOCTYPE html>
+<html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en"><head>
+
+<meta charset="utf-8">
+<meta name="generator" content="quarto-1.4.552">
+
+<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
+
+<meta name="author" content="Leah R. Johnson">
+<meta name="dcterms.date" content="2024-07-23">
+
+<title>VectorByte Training 2024 - VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)</title>
+<style>
+code{white-space: pre-wrap;}
+span.smallcaps{font-variant: small-caps;}
+div.columns{display: flex; gap: min(4vw, 1.5em);}
+div.column{flex: auto; overflow-x: auto;}
+div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
+ul.task-list{list-style: none;}
+ul.task-list li input[type="checkbox"] {
+  width: 0.8em;
+  margin: 0 0.8em 0.2em -1em; /* quarto-specific, see https://github.com/quarto-dev/quarto-cli/issues/4556 */ 
+  vertical-align: middle;
+}
+/* CSS for syntax highlighting */
+pre > code.sourceCode { white-space: pre; position: relative; }
+pre > code.sourceCode > span { line-height: 1.25; }
+pre > code.sourceCode > span:empty { height: 1.2em; }
+.sourceCode { overflow: visible; }
+code.sourceCode > span { color: inherit; text-decoration: inherit; }
+div.sourceCode { margin: 1em 0; }
+pre.sourceCode { margin: 0; }
+@media screen {
+div.sourceCode { overflow: auto; }
+}
+@media print {
+pre > code.sourceCode { white-space: pre-wrap; }
+pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
+}
+pre.numberSource code
+  { counter-reset: source-line 0; }
+pre.numberSource code > span
+  { position: relative; left: -4em; counter-increment: source-line; }
+pre.numberSource code > span > a:first-child::before
+  { content: counter(source-line);
+    position: relative; left: -1em; text-align: right; vertical-align: baseline;
+    border: none; display: inline-block;
+    -webkit-touch-callout: none; -webkit-user-select: none;
+    -khtml-user-select: none; -moz-user-select: none;
+    -ms-user-select: none; user-select: none;
+    padding: 0 4px; width: 4em;
+  }
+pre.numberSource { margin-left: 3em;  padding-left: 4px; }
+div.sourceCode
+  {   }
+@media screen {
+pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; }
+}
+</style>
+
+
+<script src="site_libs/quarto-nav/quarto-nav.js"></script>
+<script src="site_libs/quarto-nav/headroom.min.js"></script>
+<script src="site_libs/clipboard/clipboard.min.js"></script>
+<script src="site_libs/quarto-search/autocomplete.umd.js"></script>
+<script src="site_libs/quarto-search/fuse.min.js"></script>
+<script src="site_libs/quarto-search/quarto-search.js"></script>
+<meta name="quarto:offset" content="./">
+<script src="site_libs/quarto-html/quarto.js"></script>
+<script src="site_libs/quarto-html/popper.min.js"></script>
+<script src="site_libs/quarto-html/tippy.umd.min.js"></script>
+<script src="site_libs/quarto-html/anchor.min.js"></script>
+<link href="site_libs/quarto-html/tippy.css" rel="stylesheet">
+<link href="site_libs/quarto-html/quarto-syntax-highlighting.css" rel="stylesheet" id="quarto-text-highlighting-styles">
+<script src="site_libs/bootstrap/bootstrap.min.js"></script>
+<link href="site_libs/bootstrap/bootstrap-icons.css" rel="stylesheet">
+<link href="site_libs/bootstrap/bootstrap.min.css" rel="stylesheet" id="quarto-bootstrap" data-mode="light">
+<script id="quarto-search-options" type="application/json">{
+  "location": "navbar",
+  "copy-button": false,
+  "collapse-after": 3,
+  "panel-placement": "end",
+  "type": "overlay",
+  "limit": 50,
+  "keyboard-shortcut": [
+    "f",
+    "/",
+    "s"
+  ],
+  "show-item-context": false,
+  "language": {
+    "search-no-results-text": "No results",
+    "search-matching-documents-text": "matching documents",
+    "search-copy-link-title": "Copy link to search",
+    "search-hide-matches-text": "Hide additional matches",
+    "search-more-match-text": "more match in this document",
+    "search-more-matches-text": "more matches in this document",
+    "search-clear-button-title": "Clear",
+    "search-text-placeholder": "",
+    "search-detached-cancel-button-title": "Cancel",
+    "search-submit-button-title": "Submit",
+    "search-label": "Search"
+  }
+}</script>
+
+  <script>window.backupDefine = window.define; window.define = undefined;</script><script src="https://cdn.jsdelivr.net/npm/katex@0.15.1/dist/katex.min.js"></script>
+  <script>document.addEventListener("DOMContentLoaded", function () {
+ var mathElements = document.getElementsByClassName("math");
+ var macros = [];
+ for (var i = 0; i < mathElements.length; i++) {
+  var texText = mathElements[i].firstChild;
+  if (mathElements[i].tagName == "SPAN") {
+   katex.render(texText.data, mathElements[i], {
+    displayMode: mathElements[i].classList.contains('display'),
+    throwOnError: false,
+    macros: macros,
+    fleqn: false
+   });
+}}});
+  </script>
+  <script>window.define = window.backupDefine; window.backupDefine = undefined;</script><link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/katex@0.15.1/dist/katex.min.css">
+
+<script type="text/javascript">
+const typesetMath = (el) => {
+  if (window.MathJax) {
+    // MathJax Typeset
+    window.MathJax.typeset([el]);
+  } else if (window.katex) {
+    // KaTeX Render
+    var mathElements = el.getElementsByClassName("math");
+    var macros = [];
+    for (var i = 0; i < mathElements.length; i++) {
+      var texText = mathElements[i].firstChild;
+      if (mathElements[i].tagName == "SPAN") {
+        window.katex.render(texText.data, mathElements[i], {
+          displayMode: mathElements[i].classList.contains('display'),
+          throwOnError: false,
+          macros: macros,
+          fleqn: false
+        });
+      }
+    }
+  }
+}
+window.Quarto = {
+  typesetMath
+};
+</script>
+
+<link rel="stylesheet" href="styles.css">
+</head>
+
+<body class="floating nav-fixed">
+
+<div id="quarto-search-results"></div>
+  <header id="quarto-header" class="headroom fixed-top">
+    <nav class="navbar navbar-expand-lg " data-bs-theme="dark">
+      <div class="navbar-container container-fluid">
+      <div class="navbar-brand-container mx-auto">
+    <a href="./index.html" class="navbar-brand navbar-brand-logo">
+    <img src="./graphics/vblogoedit.png" alt="" class="navbar-logo">
+    </a>
+    <a class="navbar-brand" href="./index.html">
+    <span class="navbar-title">VectorByte Training 2024</span>
+    </a>
+  </div>
+            <div id="quarto-search" class="" title="Search"></div>
+          <button class="navbar-toggler" type="button" data-bs-toggle="collapse" data-bs-target="#navbarCollapse" aria-controls="navbarCollapse" aria-expanded="false" aria-label="Toggle navigation" onclick="if (window.quartoToggleHeadroom) { window.quartoToggleHeadroom(); }">
+  <span class="navbar-toggler-icon"></span>
+</button>
+          <div class="collapse navbar-collapse" id="navbarCollapse">
+            <ul class="navbar-nav navbar-nav-scroll me-auto">
+  <li class="nav-item">
+    <a class="nav-link" href="./about.html"> 
+<span class="menu-text">About</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="./schedule2024.html"> 
+<span class="menu-text">Schedule</span></a>
+  </li>  
+  <li class="nav-item">
+    <a class="nav-link" href="./materials.html"> 
+<span class="menu-text">Materials</span></a>
+  </li>  
+</ul>
+            <ul class="navbar-nav navbar-nav-scroll ms-auto">
+  <li class="nav-item compact">
+    <a class="nav-link" href="https://github.com/VectorByteOrg/vectorbyte-training2024"> <i class="bi bi-github" role="img">
+</i> 
+<span class="menu-text"></span></a>
+  </li>  
+  <li class="nav-item compact">
+    <a class="nav-link" href="https://twitter.com/vectorbite_rcn"> <i class="bi bi-twitter" role="img">
+</i> 
+<span class="menu-text"></span></a>
+  </li>  
+</ul>
+          </div> <!-- /navcollapse -->
+          <div class="quarto-navbar-tools">
+</div>
+      </div> <!-- /container-fluid -->
+    </nav>
+</header>
+<!-- content -->
+<div id="quarto-content" class="quarto-container page-columns page-rows-contents page-layout-article page-navbar">
+<!-- sidebar -->
+  <nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal quarto-sidebar-collapse-item sidebar-navigation floating overflow-auto">
+    <nav id="TOC" role="doc-toc" class="toc-active">
+    <h2 id="toc-title">On this page</h2>
+   
+  <ul>
+  <li><a href="#overview-and-instructions" id="toc-overview-and-instructions" class="nav-link active" data-scroll-target="#overview-and-instructions">Overview and Instructions</a></li>
+  <li><a href="#guided-example-monthly-average-mosquito-counts-in-walton-county-fl" id="toc-guided-example-monthly-average-mosquito-counts-in-walton-county-fl" class="nav-link" data-scroll-target="#guided-example-monthly-average-mosquito-counts-in-walton-county-fl">Guided example: Monthly average mosquito counts in Walton County, FL</a>
+  <ul class="collapse">
+  <li><a href="#exploring-the-data" id="toc-exploring-the-data" class="nav-link" data-scroll-target="#exploring-the-data">Exploring the Data</a></li>
+  <li><a href="#plotting-the-data" id="toc-plotting-the-data" class="nav-link" data-scroll-target="#plotting-the-data">Plotting the data</a></li>
+  <li><a href="#building-a-data-frame" id="toc-building-a-data-frame" class="nav-link" data-scroll-target="#building-a-data-frame">Building a data frame</a></li>
+  <li><a href="#building-a-first-model" id="toc-building-a-first-model" class="nav-link" data-scroll-target="#building-a-first-model">Building a first model</a></li>
+  </ul></li>
+  <li><a href="#build-and-compare-your-own-models-example-solution" id="toc-build-and-compare-your-own-models-example-solution" class="nav-link" data-scroll-target="#build-and-compare-your-own-models-example-solution">Build and compare your own models (Example solution)</a>
+  <ul class="collapse">
+  <li><a href="#example-solution-ar1-model-only" id="toc-example-solution-ar1-model-only" class="nav-link" data-scroll-target="#example-solution-ar1-model-only">Example Solution: AR1 model only</a></li>
+  <li><a href="#example-solution-sinecosine-terms-only" id="toc-example-solution-sinecosine-terms-only" class="nav-link" data-scroll-target="#example-solution-sinecosine-terms-only">Example Solution: sine/cosine terms only</a></li>
+  <li><a href="#example-solution-environmental-predictors-only" id="toc-example-solution-environmental-predictors-only" class="nav-link" data-scroll-target="#example-solution-environmental-predictors-only">Example Solution: environmental predictors only</a></li>
+  <li><a href="#example-solution-ar1-plus-sincos" id="toc-example-solution-ar1-plus-sincos" class="nav-link" data-scroll-target="#example-solution-ar1-plus-sincos">Example Solution: AR1 plus sin/cos</a></li>
+  </ul></li>
+  <li><a href="#extra-practice" id="toc-extra-practice" class="nav-link" data-scroll-target="#extra-practice">Extra Practice</a></li>
+  </ul>
+</nav>
+</nav>
+<div id="quarto-sidebar-glass" class="quarto-sidebar-collapse-item" data-bs-toggle="collapse" data-bs-target=".quarto-sidebar-collapse-item"></div>
+<!-- margin-sidebar -->
+    <div id="quarto-margin-sidebar" class="sidebar margin-sidebar">
+    </div>
+<!-- main -->
+<main class="content" id="quarto-document-content">
+
+<header id="title-block-header" class="quarto-title-block default">
+<div class="quarto-title">
+<h1 class="title">VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)</h1>
+</div>
+
+
+<div class="quarto-title-meta-author">
+  <div class="quarto-title-meta-heading">Author</div>
+  <div class="quarto-title-meta-heading">Affiliation</div>
+  
+    <div class="quarto-title-meta-contents">
+    <p class="author"><a href="https://lrjohnson0.github.io/QEDLab/leahJ.html">Leah R. Johnson</a> </p>
+  </div>
+  <div class="quarto-title-meta-contents">
+        <p class="affiliation">
+            Virginia Tech and VectorByte
+          </p>
+      </div>
+  </div>
+
+<div class="quarto-title-meta">
+
+      
+    <div>
+    <div class="quarto-title-meta-heading">Published</div>
+    <div class="quarto-title-meta-contents">
+      <p class="date">July 23, 2024</p>
+    </div>
+  </div>
+  
+    
+  </div>
+  
+
+
+</header>
+
+
+<p><br></p>
+<section id="overview-and-instructions" class="level1">
+<h1>Overview and Instructions</h1>
+<p>The goal of this practical is to practice building models for time-dependent data using simple regression based techniques. This includes incorporated possible transformations, trying out different time dependent predictors (including lagged variables) and assessing model fit using diagnostic plots.</p>
+<p><br></p>
+</section>
+<section id="guided-example-monthly-average-mosquito-counts-in-walton-county-fl" class="level1">
+<h1>Guided example: Monthly average mosquito counts in Walton County, FL</h1>
+<p>The file <a href="data/Culex_erraticus_walton_covariates_aggregated.csv">Culex_erraticus_walton_covariates_aggregated.csv</a> on the course website contains data on <strong>average monthly counts of mosquitos</strong> (<code>sample_value</code>) in Walton, FL, together with monthly average maximum temperature (<code>MaxTemp</code> in C) and precipitation (<code>Precip</code> in inches) for each month from January 2015 through December 2017 (<code>Month_Yr</code>).</p>
+<section id="exploring-the-data" class="level2">
+<h2 class="anchored" data-anchor-id="exploring-the-data">Exploring the Data</h2>
+<p>As always, we first want to take a look at the data, to make sure we understand it, and that we don’t have missing or weird values.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a>mozData<span class="ot">&lt;-</span><span class="fu">read.csv</span>(<span class="st">"data/Culex_erraticus_walton_covariates_aggregated.csv"</span>)</span>
+<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mozData)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>   Month_Yr          sample_value        MaxTemp          Precip      
+ Length:36          Min.   :0.00000   Min.   :16.02   Min.   : 0.000  
+ Class :character   1st Qu.:0.04318   1st Qu.:22.99   1st Qu.: 2.162  
+ Mode  :character   Median :0.73001   Median :26.69   Median : 4.606  
+                    Mean   :0.80798   Mean   :26.23   Mean   : 5.595  
+                    3rd Qu.:1.22443   3rd Qu.:30.70   3rd Qu.: 7.864  
+                    Max.   :3.00595   Max.   :33.31   Max.   :18.307  </code></pre>
+</div>
+</div>
+<p>We can see that the minimum observed average number of mosquitoes it zero, and max is only 3 (there are likely many zeros averaged over many days in the month). There don’t appear to be any <code>NA</code>s in the data. In this case the dataset itself is small enough that we can print the whole thing to ensure it’s complete:</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb3"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a>mozData</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>   Month_Yr sample_value  MaxTemp       Precip
+1   2015-01  0.000000000 17.74602  3.303991888
+2   2015-02  0.018181818 17.87269 16.544265802
+3   2015-03  0.468085106 23.81767  2.405651215
+4   2015-04  1.619047619 26.03559  8.974406168
+5   2015-05  0.821428571 30.01602  0.567960943
+6   2015-06  3.005952381 31.12094  4.841342729
+7   2015-07  2.380952381 32.81130  3.849010353
+8   2015-08  1.826347305 32.56245  5.562845324
+9   2015-09  0.648809524 30.55155 10.409724627
+10  2015-10  0.988023952 27.22605  0.337750269
+11  2015-11  0.737804878 24.86768 18.306749680
+12  2015-12  0.142857143 22.46588  5.621475377
+13  2016-01  0.000000000 16.02406  3.550622029
+14  2016-02  0.020202020 19.42057 11.254680803
+15  2016-03  0.015151515 23.13610  4.785664728
+16  2016-04  0.026143791 24.98082  4.580424519
+17  2016-05  0.025252525 28.72884  0.053057634
+18  2016-06  0.833333333 30.96990  6.155417473
+19  2016-07  1.261363636 33.30509  4.496368193
+20  2016-08  1.685279188 32.09633 11.338749182
+21  2016-09  2.617142857 31.60575  2.868288451
+22  2016-10  1.212121212 29.14275  0.000000000
+23  2016-11  1.539772727 24.48482  0.005462681
+24  2016-12  0.771573604 20.46054 11.615521725
+25  2017-01  0.045454545 18.35473  0.000000000
+26  2017-02  0.036363636 23.65584  3.150710053
+27  2017-03  0.194285714 22.53573  1.430094952
+28  2017-04  0.436548223 26.15299  0.499381616
+29  2017-05  1.202020202 28.00173  6.580562663
+30  2017-06  0.834196891 29.48951 13.333939858
+31  2017-07  1.765363128 32.25135  7.493927035
+32  2017-08  0.744791667 31.86476  6.082113434
+33  2017-09  0.722222222 30.60566  4.631037395
+34  2017-10  0.142131980 27.73453 11.567112214
+35  2017-11  0.289772727 23.23140  1.195760473
+36  2017-12  0.009174312 18.93603  4.018254442</code></pre>
+</div>
+</div>
+</section>
+<section id="plotting-the-data" class="level2">
+<h2 class="anchored" data-anchor-id="plotting-the-data">Plotting the data</h2>
+<p>First we’ll examine the data itself, including the predictors:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a>months<span class="ot">&lt;-</span><span class="fu">dim</span>(mozData)[<span class="dv">1</span>]</span>
+<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a>t<span class="ot">&lt;-</span><span class="dv">1</span><span class="sc">:</span>months <span class="do">## counter for months in the data set</span></span>
+<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a><span class="fu">par</span>(<span class="at">mfrow=</span><span class="fu">c</span>(<span class="dv">3</span>,<span class="dv">1</span>))</span>
+<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(t, mozData<span class="sc">$</span>sample_value, <span class="at">type=</span><span class="st">"l"</span>, <span class="at">lwd=</span><span class="dv">2</span>, </span>
+<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">"Average Monthly Abundance"</span>, </span>
+<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab =</span><span class="st">"Time (months)"</span>, </span>
+<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab =</span> <span class="st">"Average Count"</span>)</span>
+<span id="cb5-8"><a href="#cb5-8" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(t, mozData<span class="sc">$</span>MaxTemp, <span class="at">type=</span><span class="st">"l"</span>,</span>
+<span id="cb5-9"><a href="#cb5-9" aria-hidden="true" tabindex="-1"></a>     <span class="at">col =</span> <span class="dv">2</span>, <span class="at">lwd=</span><span class="dv">2</span>, </span>
+<span id="cb5-10"><a href="#cb5-10" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">"Average Maximum Temp"</span>, </span>
+<span id="cb5-11"><a href="#cb5-11" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab =</span><span class="st">"Time (months)"</span>, </span>
+<span id="cb5-12"><a href="#cb5-12" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab =</span> <span class="st">"Temperature (C)"</span>)</span>
+<span id="cb5-13"><a href="#cb5-13" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(t, mozData<span class="sc">$</span>Precip, <span class="at">type=</span><span class="st">"l"</span>,</span>
+<span id="cb5-14"><a href="#cb5-14" aria-hidden="true" tabindex="-1"></a>     <span class="at">col=</span><span class="st">"dodgerblue"</span>, <span class="at">lwd=</span><span class="dv">2</span>,</span>
+<span id="cb5-15"><a href="#cb5-15" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">"Average Monthly Precip"</span>, </span>
+<span id="cb5-16"><a href="#cb5-16" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab =</span><span class="st">"Time (months)"</span>, </span>
+<span id="cb5-17"><a href="#cb5-17" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab =</span> <span class="st">"Precipitation (in)"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-3-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Visually we noticed that there may be a bit of clumping in the values for abundance (this is subtle) – in particular, since we have a lot of very small/nearly zero counts, a transform, such as a square root, may spread things out for the abundances. It also looks like both the abundance and temperature data are more cyclical than the precipitation, and thus more likely to be related to each other. There’s also not visually a lot of indication of a trend, but it’s usually worthwhile to consider it anyway. Replotting the abundance data with a transformation:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a>months<span class="ot">&lt;-</span><span class="fu">dim</span>(mozData)[<span class="dv">1</span>]</span>
+<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>t<span class="ot">&lt;-</span><span class="dv">1</span><span class="sc">:</span>months <span class="do">## counter for months in the data set</span></span>
+<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(t, <span class="fu">sqrt</span>(mozData<span class="sc">$</span>sample_value), <span class="at">type=</span><span class="st">"l"</span>, <span class="at">lwd=</span><span class="dv">2</span>, </span>
+<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">"Sqrt Average Monthly Abundance"</span>, </span>
+<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab =</span><span class="st">"Time (months)"</span>, </span>
+<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab =</span> <span class="st">"Average Count"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-4-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>That looks a little bit better. I suggest we go with this for our response.</p>
+</section>
+<section id="building-a-data-frame" class="level2">
+<h2 class="anchored" data-anchor-id="building-a-data-frame">Building a data frame</h2>
+<p>Before we get into model building, we always want to build a data frame to contain all of the predictors that we want to consider, at the potential lags that we’re interested in. In the lecture we saw building the AR, sine/cosine, and trend predictors:</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>t <span class="ot">&lt;-</span> <span class="dv">2</span><span class="sc">:</span>months <span class="do">## to make building the AR1 predictors easier</span></span>
+<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a>mozTS <span class="ot">&lt;-</span> <span class="fu">data.frame</span>(</span>
+<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a>  <span class="at">Y=</span><span class="fu">sqrt</span>(mozData<span class="sc">$</span>sample_value[t]), <span class="co"># transformed response</span></span>
+<span id="cb7-5"><a href="#cb7-5" aria-hidden="true" tabindex="-1"></a>  <span class="at">Yl1=</span><span class="fu">sqrt</span>(mozData<span class="sc">$</span>sample_value[t<span class="dv">-1</span>]), <span class="co"># AR1 predictor</span></span>
+<span id="cb7-6"><a href="#cb7-6" aria-hidden="true" tabindex="-1"></a>  <span class="at">t=</span>t, <span class="co"># trend predictor</span></span>
+<span id="cb7-7"><a href="#cb7-7" aria-hidden="true" tabindex="-1"></a>  <span class="at">sin12=</span><span class="fu">sin</span>(<span class="dv">2</span><span class="sc">*</span>pi<span class="sc">*</span>t<span class="sc">/</span><span class="dv">12</span>), </span>
+<span id="cb7-8"><a href="#cb7-8" aria-hidden="true" tabindex="-1"></a>  <span class="at">cos12=</span><span class="fu">cos</span>(<span class="dv">2</span><span class="sc">*</span>pi<span class="sc">*</span>t<span class="sc">/</span><span class="dv">12</span>) <span class="co"># periodic predictors</span></span>
+<span id="cb7-9"><a href="#cb7-9" aria-hidden="true" tabindex="-1"></a>  )</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</div>
+<p>We will also put in the temperature and precipitation predictors. But we need to think about what might be an appropriate lag. If this were daily or weekly data, we’d probably want to have a fairly sizable lag – mosquitoes take a while to develop, so the number we see today is not likely related to the temperature today. However, since these data are agregated across a whole month, as is the temperature/precipitaion, the current month values are likely to be useful. However, it’s even possible that last month’s values may be so we’ll add those in as well:</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb8"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a>mozTS<span class="sc">$</span>MaxTemp<span class="ot">&lt;-</span>mozData<span class="sc">$</span>MaxTemp[t] <span class="do">## current temps</span></span>
+<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a>mozTS<span class="sc">$</span>MaxTempl1<span class="ot">&lt;-</span>mozData<span class="sc">$</span>MaxTemp[t<span class="dv">-1</span>] <span class="do">## previous temps</span></span>
+<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a>mozTS<span class="sc">$</span>Precip<span class="ot">&lt;-</span>mozData<span class="sc">$</span>Precip[t] <span class="do">## current precip</span></span>
+<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a>mozTS<span class="sc">$</span>Precipl1<span class="ot">&lt;-</span>mozData<span class="sc">$</span>Precip[t<span class="dv">-1</span>] <span class="do">## previous precip</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+</div>
+<p>Thus our full dataframe:</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mozTS)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>       Y               Yl1               t            sin12         
+ Min.   :0.0000   Min.   :0.0000   Min.   : 2.0   Min.   :-1.00000  
+ 1st Qu.:0.2951   1st Qu.:0.2951   1st Qu.:10.5   1st Qu.:-0.68301  
+ Median :0.8590   Median :0.8590   Median :19.0   Median : 0.00000  
+ Mean   :0.7711   Mean   :0.7684   Mean   :19.0   Mean   :-0.01429  
+ 3rd Qu.:1.1120   3rd Qu.:1.1120   3rd Qu.:27.5   3rd Qu.: 0.68301  
+ Max.   :1.7338   Max.   :1.7338   Max.   :36.0   Max.   : 1.00000  
+     cos12             MaxTemp        MaxTempl1         Precip      
+ Min.   :-1.00000   Min.   :16.02   Min.   :16.02   Min.   : 0.000  
+ 1st Qu.:-0.68301   1st Qu.:23.18   1st Qu.:23.18   1st Qu.: 1.918  
+ Median : 0.00000   Median :27.23   Median :27.23   Median : 4.631  
+ Mean   :-0.02474   Mean   :26.47   Mean   :26.44   Mean   : 5.660  
+ 3rd Qu.: 0.50000   3rd Qu.:30.79   3rd Qu.:30.79   3rd Qu.: 8.234  
+ Max.   : 1.00000   Max.   :33.31   Max.   :33.31   Max.   :18.307  
+    Precipl1     
+ Min.   : 0.000  
+ 1st Qu.: 1.918  
+ Median : 4.631  
+ Mean   : 5.640  
+ 3rd Qu.: 8.234  
+ Max.   :18.307  </code></pre>
+</div>
+</div>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb11"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a><span class="fu">head</span>(mozTS)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>          Y       Yl1 t         sin12         cos12  MaxTemp MaxTempl1
+1 0.1348400 0.0000000 2  8.660254e-01  5.000000e-01 17.87269  17.74602
+2 0.6841675 0.1348400 3  1.000000e+00  6.123234e-17 23.81767  17.87269
+3 1.2724180 0.6841675 4  8.660254e-01 -5.000000e-01 26.03559  23.81767
+4 0.9063270 1.2724180 5  5.000000e-01 -8.660254e-01 30.01602  26.03559
+5 1.7337683 0.9063270 6  1.224647e-16 -1.000000e+00 31.12094  30.01602
+6 1.5430335 1.7337683 7 -5.000000e-01 -8.660254e-01 32.81130  31.12094
+      Precip   Precipl1
+1 16.5442658  3.3039919
+2  2.4056512 16.5442658
+3  8.9744062  2.4056512
+4  0.5679609  8.9744062
+5  4.8413427  0.5679609
+6  3.8490104  4.8413427</code></pre>
+</div>
+</div>
+</section>
+<section id="building-a-first-model" class="level2">
+<h2 class="anchored" data-anchor-id="building-a-first-model">Building a first model</h2>
+<p>We will first build a very simple model – just a trend – to practice building the model, checking diagnostics, and plotting predictions.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb13"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a>mod1<span class="ot">&lt;-</span><span class="fu">lm</span>(Y <span class="sc">~</span> t, <span class="at">data=</span>mozTS)</span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mod1)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = Y ~ t, data = mozTS)
+
+Residuals:
+     Min       1Q   Median       3Q      Max 
+-0.81332 -0.47902  0.03671  0.37384  0.87119 
+
+Coefficients:
+             Estimate Std. Error t value Pr(&gt;|t|)    
+(Intercept)  0.904809   0.178421   5.071  1.5e-05 ***
+t           -0.007038   0.008292  -0.849    0.402    
+---
+Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
+
+Residual standard error: 0.4954 on 33 degrees of freedom
+Multiple R-squared:  0.02136,   Adjusted R-squared:  -0.008291 
+F-statistic: 0.7204 on 1 and 33 DF,  p-value: 0.4021</code></pre>
+</div>
+</div>
+<p>The model output indicates that this model is not useful – the trend is not significant and it only explains about 2% of the variability. Let’s plot the predictions:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb15"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a><span class="do">## plot points and fitted lines</span></span>
+<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(Y<span class="sc">~</span>t, <span class="at">data=</span>mozTS, <span class="at">col=</span><span class="dv">1</span>, <span class="at">type=</span><span class="st">"l"</span>)</span>
+<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a><span class="fu">lines</span>(t, mod1<span class="sc">$</span>fitted, <span class="at">col=</span><span class="st">"dodgerblue"</span>, <span class="at">lwd=</span><span class="dv">2</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-10-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="480"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Not good – we’ll definitely need to try something else! Remember that since we’re using a linear model for this, that we should check our residual plots as usual, and then also plot the <code>acf</code> of the residuals:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb16"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a><span class="fu">par</span>(<span class="at">mfrow=</span><span class="fu">c</span>(<span class="dv">1</span>,<span class="dv">3</span>), <span class="at">mar=</span><span class="fu">c</span>(<span class="dv">4</span>,<span class="dv">4</span>,<span class="dv">2</span>,<span class="fl">0.5</span>))   </span>
+<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a><span class="do">## studentized residuals vs fitted</span></span>
+<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(mod1<span class="sc">$</span>fitted, <span class="fu">rstudent</span>(mod1), <span class="at">col=</span><span class="dv">1</span>,</span>
+<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Fitted Values"</span>, </span>
+<span id="cb16-6"><a href="#cb16-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb16-7"><a href="#cb16-7" aria-hidden="true" tabindex="-1"></a>     <span class="at">pch=</span><span class="dv">20</span>, <span class="at">main=</span><span class="st">"AR 1 only model"</span>)</span>
+<span id="cb16-8"><a href="#cb16-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-9"><a href="#cb16-9" aria-hidden="true" tabindex="-1"></a><span class="do">## qq plot of studentized residuals</span></span>
+<span id="cb16-10"><a href="#cb16-10" aria-hidden="true" tabindex="-1"></a><span class="fu">qqnorm</span>(<span class="fu">rstudent</span>(mod1), <span class="at">pch=</span><span class="dv">20</span>, <span class="at">col=</span><span class="dv">1</span>, <span class="at">main=</span><span class="st">""</span> )</span>
+<span id="cb16-11"><a href="#cb16-11" aria-hidden="true" tabindex="-1"></a><span class="fu">abline</span>(<span class="at">a=</span><span class="dv">0</span>,<span class="at">b=</span><span class="dv">1</span>,<span class="at">lty=</span><span class="dv">2</span>, <span class="at">col=</span><span class="dv">2</span>)</span>
+<span id="cb16-12"><a href="#cb16-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-13"><a href="#cb16-13" aria-hidden="true" tabindex="-1"></a><span class="do">## histogram of studentized residuals</span></span>
+<span id="cb16-14"><a href="#cb16-14" aria-hidden="true" tabindex="-1"></a><span class="fu">hist</span>(<span class="fu">rstudent</span>(mod1), <span class="at">col=</span><span class="dv">1</span>, </span>
+<span id="cb16-15"><a href="#cb16-15" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb16-16"><a href="#cb16-16" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">""</span>, <span class="at">border=</span><span class="dv">8</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-11-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>This doesn’t look really bad, although the histogram might be a bit weird. Finally the <code>acf</code></p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb17"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a><span class="fu">acf</span>(mod1<span class="sc">$</span>residuals)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-12-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>This is where we can see that we definitely aren’t able to capture the pattern. There’s substantial autocorrelation left at a 1 month lag, and around 6 months.</p>
+<p>Finally, for moving forward, we can extract the BIC for this model so that we can compare with other models that you’ll build next.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb18"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a>n<span class="ot">&lt;-</span><span class="fu">length</span>(t)</span>
+<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a><span class="fu">extractAIC</span>(mod1, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>]</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>[1] -44.11057</code></pre>
+</div>
+</div>
+</section>
+</section>
+<section id="build-and-compare-your-own-models-example-solution" class="level1">
+<h1>Build and compare your own models (Example solution)</h1>
+<p>Follow the procedure I showed for the model with a simple trend, and build <strong><em>at least</em></strong> 4 more models:</p>
+<ol type="1">
+<li>one that contains an AR term</li>
+<li>one with the sine/cosine terms</li>
+<li>one with the environmental predictors</li>
+<li>one with a combination</li>
+</ol>
+<p>Check diagnostics/model assumptions as you go. Then at the end compare all of your models via BIC. What is your best model by that metric? We’ll share among the group what folks found to be good models.</p>
+<p><strong><em>NOTE: The solutions I show below are examples of what one could do, but your models might be a bit different</em></strong></p>
+<section id="example-solution-ar1-model-only" class="level2">
+<h2 class="anchored" data-anchor-id="example-solution-ar1-model-only">Example Solution: AR1 model only</h2>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb20"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a>mod2<span class="ot">&lt;-</span><span class="fu">lm</span>(Y <span class="sc">~</span> Yl1, <span class="at">data=</span>mozTS)</span>
+<span id="cb20-2"><a href="#cb20-2" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mod2)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = Y ~ Yl1, data = mozTS)
+
+Residuals:
+    Min      1Q  Median      3Q     Max 
+-0.6338 -0.2173 -0.0678  0.2463  0.8675 
+
+Coefficients:
+            Estimate Std. Error t value Pr(&gt;|t|)    
+(Intercept)   0.2410     0.1130   2.132   0.0405 *  
+Yl1           0.6899     0.1240   5.562 3.51e-06 ***
+---
+Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
+
+Residual standard error: 0.3598 on 33 degrees of freedom
+Multiple R-squared:  0.4839,    Adjusted R-squared:  0.4682 
+F-statistic: 30.94 on 1 and 33 DF,  p-value: 3.507e-06</code></pre>
+</div>
+</div>
+<p>The model is better than the original trend only model – the AR1 term explains about 48% of the variability. Let’s plot the predictions:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb22"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb22-1"><a href="#cb22-1" aria-hidden="true" tabindex="-1"></a><span class="do">## plot points and fitted lines</span></span>
+<span id="cb22-2"><a href="#cb22-2" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(Y<span class="sc">~</span>t, <span class="at">data=</span>mozTS, <span class="at">col=</span><span class="dv">1</span>, <span class="at">type=</span><span class="st">"l"</span>)</span>
+<span id="cb22-3"><a href="#cb22-3" aria-hidden="true" tabindex="-1"></a><span class="fu">lines</span>(t, mod2<span class="sc">$</span>fitted, <span class="at">col=</span><span class="dv">2</span>, <span class="at">lwd=</span><span class="dv">2</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-15-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="480"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Pretty good! Look at all of the diagnostic plots:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb23"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="fu">par</span>(<span class="at">mfrow=</span><span class="fu">c</span>(<span class="dv">1</span>,<span class="dv">3</span>), <span class="at">mar=</span><span class="fu">c</span>(<span class="dv">4</span>,<span class="dv">4</span>,<span class="dv">2</span>,<span class="fl">0.5</span>))   </span>
+<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a><span class="do">## studentized residuals vs fitted</span></span>
+<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(mod2<span class="sc">$</span>fitted, <span class="fu">rstudent</span>(mod2), <span class="at">col=</span><span class="dv">2</span>,</span>
+<span id="cb23-5"><a href="#cb23-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Fitted Values"</span>, </span>
+<span id="cb23-6"><a href="#cb23-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb23-7"><a href="#cb23-7" aria-hidden="true" tabindex="-1"></a>     <span class="at">pch=</span><span class="dv">20</span>, <span class="at">main=</span><span class="st">"AR 1 only model"</span>)</span>
+<span id="cb23-8"><a href="#cb23-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb23-9"><a href="#cb23-9" aria-hidden="true" tabindex="-1"></a><span class="do">## qq plot of studentized residuals</span></span>
+<span id="cb23-10"><a href="#cb23-10" aria-hidden="true" tabindex="-1"></a><span class="fu">qqnorm</span>(<span class="fu">rstudent</span>(mod2), <span class="at">pch=</span><span class="dv">20</span>, <span class="at">col=</span><span class="dv">2</span>, <span class="at">main=</span><span class="st">""</span> )</span>
+<span id="cb23-11"><a href="#cb23-11" aria-hidden="true" tabindex="-1"></a><span class="fu">abline</span>(<span class="at">a=</span><span class="dv">0</span>,<span class="at">b=</span><span class="dv">1</span>,<span class="at">lty=</span><span class="dv">2</span>, <span class="at">col=</span><span class="dv">1</span>)</span>
+<span id="cb23-12"><a href="#cb23-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb23-13"><a href="#cb23-13" aria-hidden="true" tabindex="-1"></a><span class="do">## histogram of studentized residuals</span></span>
+<span id="cb23-14"><a href="#cb23-14" aria-hidden="true" tabindex="-1"></a><span class="fu">hist</span>(<span class="fu">rstudent</span>(mod2), <span class="at">col=</span><span class="dv">2</span>, </span>
+<span id="cb23-15"><a href="#cb23-15" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb23-16"><a href="#cb23-16" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">""</span>, <span class="at">border=</span><span class="dv">8</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-16-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Maybe one outlier, but not too bad.</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb24"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb24-1"><a href="#cb24-1" aria-hidden="true" tabindex="-1"></a><span class="fu">acf</span>(mod2<span class="sc">$</span>residuals)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-17-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>We seem to have taken care of all of the autoregression, even at multiple lags!</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb25"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a>n<span class="ot">&lt;-</span><span class="fu">length</span>(t)</span>
+<span id="cb25-2"><a href="#cb25-2" aria-hidden="true" tabindex="-1"></a><span class="fu">extractAIC</span>(mod2, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>]</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>[1] -66.50482</code></pre>
+</div>
+</div>
+<p>BIC is much lower – overall a much much better model than the first one.</p>
+</section>
+<section id="example-solution-sinecosine-terms-only" class="level2">
+<h2 class="anchored" data-anchor-id="example-solution-sinecosine-terms-only">Example Solution: sine/cosine terms only</h2>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb27"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb27-1"><a href="#cb27-1" aria-hidden="true" tabindex="-1"></a>mod3<span class="ot">&lt;-</span><span class="fu">lm</span>(Y <span class="sc">~</span> sin12 <span class="sc">+</span> cos12, <span class="at">data=</span>mozTS)</span>
+<span id="cb27-2"><a href="#cb27-2" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mod3)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = Y ~ sin12 + cos12, data = mozTS)
+
+Residuals:
+     Min       1Q   Median       3Q      Max 
+-0.70116 -0.21655 -0.03611  0.19213  0.67992 
+
+Coefficients:
+            Estimate Std. Error t value Pr(&gt;|t|)    
+(Intercept)  0.75706    0.05750  13.165 1.83e-14 ***
+sin12       -0.38804    0.08072  -4.807 3.48e-05 ***
+cos12       -0.34298    0.08192  -4.187 0.000207 ***
+---
+Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
+
+Residual standard error: 0.3399 on 32 degrees of freedom
+Multiple R-squared:  0.5533,    Adjusted R-squared:  0.5254 
+F-statistic: 19.82 on 2 and 32 DF,  p-value: 2.512e-06</code></pre>
+</div>
+</div>
+<p>The model is better than the original trend only model – it explains about 55% of the variability (we expect <span class="math inline">R^2</span> to increase as we have more predictors). Let’s plot the predictions:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb29"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb29-1"><a href="#cb29-1" aria-hidden="true" tabindex="-1"></a><span class="do">## plot points and fitted lines</span></span>
+<span id="cb29-2"><a href="#cb29-2" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(Y<span class="sc">~</span>t, <span class="at">data=</span>mozTS, <span class="at">col=</span><span class="dv">1</span>, <span class="at">type=</span><span class="st">"l"</span>)</span>
+<span id="cb29-3"><a href="#cb29-3" aria-hidden="true" tabindex="-1"></a><span class="fu">lines</span>(t, mod3<span class="sc">$</span>fitted, <span class="at">col=</span><span class="dv">3</span>, <span class="at">lwd=</span><span class="dv">2</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-20-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="480"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Pretty good! Look at all of the diagnostic plots:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb30"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb30-1"><a href="#cb30-1" aria-hidden="true" tabindex="-1"></a><span class="fu">par</span>(<span class="at">mfrow=</span><span class="fu">c</span>(<span class="dv">1</span>,<span class="dv">3</span>), <span class="at">mar=</span><span class="fu">c</span>(<span class="dv">4</span>,<span class="dv">4</span>,<span class="dv">2</span>,<span class="fl">0.5</span>))   </span>
+<span id="cb30-2"><a href="#cb30-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb30-3"><a href="#cb30-3" aria-hidden="true" tabindex="-1"></a><span class="do">## studentized residuals vs fitted</span></span>
+<span id="cb30-4"><a href="#cb30-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(mod3<span class="sc">$</span>fitted, <span class="fu">rstudent</span>(mod3), <span class="at">col=</span><span class="dv">3</span>,</span>
+<span id="cb30-5"><a href="#cb30-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Fitted Values"</span>, </span>
+<span id="cb30-6"><a href="#cb30-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb30-7"><a href="#cb30-7" aria-hidden="true" tabindex="-1"></a>     <span class="at">pch=</span><span class="dv">20</span>, <span class="at">main=</span><span class="st">"sin/cos only model"</span>)</span>
+<span id="cb30-8"><a href="#cb30-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb30-9"><a href="#cb30-9" aria-hidden="true" tabindex="-1"></a><span class="do">## qq plot of studentized residuals</span></span>
+<span id="cb30-10"><a href="#cb30-10" aria-hidden="true" tabindex="-1"></a><span class="fu">qqnorm</span>(<span class="fu">rstudent</span>(mod3), <span class="at">pch=</span><span class="dv">20</span>, <span class="at">col=</span><span class="dv">3</span>, <span class="at">main=</span><span class="st">""</span> )</span>
+<span id="cb30-11"><a href="#cb30-11" aria-hidden="true" tabindex="-1"></a><span class="fu">abline</span>(<span class="at">a=</span><span class="dv">0</span>,<span class="at">b=</span><span class="dv">1</span>,<span class="at">lty=</span><span class="dv">2</span>, <span class="at">col=</span><span class="dv">2</span>)</span>
+<span id="cb30-12"><a href="#cb30-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb30-13"><a href="#cb30-13" aria-hidden="true" tabindex="-1"></a><span class="do">## histogram of studentized residuals</span></span>
+<span id="cb30-14"><a href="#cb30-14" aria-hidden="true" tabindex="-1"></a><span class="fu">hist</span>(<span class="fu">rstudent</span>(mod3), <span class="at">col=</span><span class="dv">3</span>, </span>
+<span id="cb30-15"><a href="#cb30-15" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb30-16"><a href="#cb30-16" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">""</span>, <span class="at">border=</span><span class="dv">8</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-21-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Maybe one outlier, but not too bad.</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb31"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb31-1"><a href="#cb31-1" aria-hidden="true" tabindex="-1"></a><span class="fu">acf</span>(mod3<span class="sc">$</span>residuals)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-22-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>We seem have taken care of the longer lag autocorrelation, but still some lag 1 left.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb32"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb32-1"><a href="#cb32-1" aria-hidden="true" tabindex="-1"></a>n<span class="ot">&lt;-</span><span class="fu">length</span>(t)</span>
+<span id="cb32-2"><a href="#cb32-2" aria-hidden="true" tabindex="-1"></a><span class="fu">extractAIC</span>(mod3, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>]</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>[1] -68.00597</code></pre>
+</div>
+</div>
+<p>This model is even better than the AR1 model. We’ll keep this in mind….</p>
+</section>
+<section id="example-solution-environmental-predictors-only" class="level2">
+<h2 class="anchored" data-anchor-id="example-solution-environmental-predictors-only">Example Solution: environmental predictors only</h2>
+<p>I’ll put in the predictors at the current time period. Since this is monthly averaged data we could probably do either current or lagged.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb34"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb34-1"><a href="#cb34-1" aria-hidden="true" tabindex="-1"></a>mod4<span class="ot">&lt;-</span><span class="fu">lm</span>(Y <span class="sc">~</span> MaxTemp <span class="sc">+</span> Precip, <span class="at">data=</span>mozTS)</span>
+<span id="cb34-2"><a href="#cb34-2" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mod4)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = Y ~ MaxTemp + Precip, data = mozTS)
+
+Residuals:
+     Min       1Q   Median       3Q      Max 
+-0.76043 -0.17925 -0.01671  0.15491  0.64193 
+
+Coefficients:
+             Estimate Std. Error t value Pr(&gt;|t|)    
+(Intercept) -1.248452   0.323576  -3.858 0.000521 ***
+MaxTemp      0.075450   0.011641   6.481 2.72e-07 ***
+Precip       0.003928   0.011870   0.331 0.742852    
+---
+Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
+
+Residual standard error: 0.3344 on 32 degrees of freedom
+Multiple R-squared:  0.5676,    Adjusted R-squared:  0.5406 
+F-statistic:    21 on 2 and 32 DF,  p-value: 1.493e-06</code></pre>
+</div>
+</div>
+<p>The model is even better than the last – the model explains about 58% of the variability, although the Precip isn’t significant and we might want to consider dropping it. Let’s plot the predictions:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb36"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb36-1"><a href="#cb36-1" aria-hidden="true" tabindex="-1"></a><span class="do">## plot points and fitted lines</span></span>
+<span id="cb36-2"><a href="#cb36-2" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(Y<span class="sc">~</span>t, <span class="at">data=</span>mozTS, <span class="at">col=</span><span class="dv">1</span>, <span class="at">type=</span><span class="st">"l"</span>)</span>
+<span id="cb36-3"><a href="#cb36-3" aria-hidden="true" tabindex="-1"></a><span class="fu">lines</span>(t, mod4<span class="sc">$</span>fitted, <span class="at">col=</span><span class="dv">4</span>, <span class="at">lwd=</span><span class="dv">2</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-25-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="480"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Pretty good! Look at all of the diagnostic plots:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb37"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb37-1"><a href="#cb37-1" aria-hidden="true" tabindex="-1"></a><span class="fu">par</span>(<span class="at">mfrow=</span><span class="fu">c</span>(<span class="dv">1</span>,<span class="dv">3</span>), <span class="at">mar=</span><span class="fu">c</span>(<span class="dv">4</span>,<span class="dv">4</span>,<span class="dv">2</span>,<span class="fl">0.5</span>))   </span>
+<span id="cb37-2"><a href="#cb37-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb37-3"><a href="#cb37-3" aria-hidden="true" tabindex="-1"></a><span class="do">## studentized residuals vs fitted</span></span>
+<span id="cb37-4"><a href="#cb37-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(mod4<span class="sc">$</span>fitted, <span class="fu">rstudent</span>(mod4), <span class="at">col=</span><span class="dv">4</span>,</span>
+<span id="cb37-5"><a href="#cb37-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Fitted Values"</span>, </span>
+<span id="cb37-6"><a href="#cb37-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb37-7"><a href="#cb37-7" aria-hidden="true" tabindex="-1"></a>     <span class="at">pch=</span><span class="dv">20</span>, <span class="at">main=</span><span class="st">"weather model"</span>)</span>
+<span id="cb37-8"><a href="#cb37-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb37-9"><a href="#cb37-9" aria-hidden="true" tabindex="-1"></a><span class="do">## qq plot of studentized residuals</span></span>
+<span id="cb37-10"><a href="#cb37-10" aria-hidden="true" tabindex="-1"></a><span class="fu">qqnorm</span>(<span class="fu">rstudent</span>(mod4), <span class="at">pch=</span><span class="dv">20</span>, <span class="at">col=</span><span class="dv">4</span>, <span class="at">main=</span><span class="st">""</span> )</span>
+<span id="cb37-11"><a href="#cb37-11" aria-hidden="true" tabindex="-1"></a><span class="fu">abline</span>(<span class="at">a=</span><span class="dv">0</span>,<span class="at">b=</span><span class="dv">1</span>,<span class="at">lty=</span><span class="dv">2</span>, <span class="at">col=</span><span class="dv">2</span>)</span>
+<span id="cb37-12"><a href="#cb37-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb37-13"><a href="#cb37-13" aria-hidden="true" tabindex="-1"></a><span class="do">## histogram of studentized residuals</span></span>
+<span id="cb37-14"><a href="#cb37-14" aria-hidden="true" tabindex="-1"></a><span class="fu">hist</span>(<span class="fu">rstudent</span>(mod4), <span class="at">col=</span><span class="dv">4</span>, </span>
+<span id="cb37-15"><a href="#cb37-15" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb37-16"><a href="#cb37-16" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">""</span>, <span class="at">border=</span><span class="dv">8</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-26-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Maybe one outlier again, but not too bad.</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb38"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb38-1"><a href="#cb38-1" aria-hidden="true" tabindex="-1"></a><span class="fu">acf</span>(mod4<span class="sc">$</span>residuals)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-27-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>We seem to have taken care of all of the autoregression, except maybe a bit of AR1.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb39"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb39-1"><a href="#cb39-1" aria-hidden="true" tabindex="-1"></a>n<span class="ot">&lt;-</span><span class="fu">length</span>(t)</span>
+<span id="cb39-2"><a href="#cb39-2" aria-hidden="true" tabindex="-1"></a><span class="fu">extractAIC</span>(mod4, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>]</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>[1] -69.14372</code></pre>
+</div>
+</div>
+<p>Even better, although it’s not much different than the sin/cos</p>
+</section>
+<section id="example-solution-ar1-plus-sincos" class="level2">
+<h2 class="anchored" data-anchor-id="example-solution-ar1-plus-sincos">Example Solution: AR1 plus sin/cos</h2>
+<p>Ok, now to combine things:</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb41"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb41-1"><a href="#cb41-1" aria-hidden="true" tabindex="-1"></a>mod5<span class="ot">&lt;-</span><span class="fu">lm</span>(Y <span class="sc">~</span> Yl1 <span class="sc">+</span> sin12 <span class="sc">+</span> cos12, <span class="at">data=</span>mozTS)</span>
+<span id="cb41-2"><a href="#cb41-2" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(mod5)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = Y ~ Yl1 + sin12 + cos12, data = mozTS)
+
+Residuals:
+     Min       1Q   Median       3Q      Max 
+-0.49092 -0.25028 -0.02153  0.17287  0.60748 
+
+Coefficients:
+            Estimate Std. Error t value Pr(&gt;|t|)    
+(Intercept)  0.38035    0.12935   2.940 0.006148 ** 
+Yl1          0.49652    0.15681   3.166 0.003453 ** 
+sin12       -0.13417    0.10729  -1.251 0.220457    
+cos12       -0.29593    0.07386  -4.007 0.000358 ***
+---
+Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
+
+Residual standard error: 0.3002 on 31 degrees of freedom
+Multiple R-squared:  0.6625,    Adjusted R-squared:  0.6298 
+F-statistic: 20.28 on 3 and 31 DF,  p-value: 1.835e-07</code></pre>
+</div>
+</div>
+<p>The model is better than the original trend only model – the AR1 term explains about 48% of the variability. Let’s plot the predictions:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb43"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb43-1"><a href="#cb43-1" aria-hidden="true" tabindex="-1"></a><span class="do">## plot points and fitted lines</span></span>
+<span id="cb43-2"><a href="#cb43-2" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(Y<span class="sc">~</span>t, <span class="at">data=</span>mozTS, <span class="at">col=</span><span class="dv">1</span>, <span class="at">type=</span><span class="st">"l"</span>)</span>
+<span id="cb43-3"><a href="#cb43-3" aria-hidden="true" tabindex="-1"></a><span class="fu">lines</span>(t, mod5<span class="sc">$</span>fitted, <span class="at">col=</span><span class="dv">5</span>, <span class="at">lwd=</span><span class="dv">2</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-30-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="480"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>Pretty good! Look at all of the diagnostic plots:</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb44"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb44-1"><a href="#cb44-1" aria-hidden="true" tabindex="-1"></a><span class="fu">par</span>(<span class="at">mfrow=</span><span class="fu">c</span>(<span class="dv">1</span>,<span class="dv">3</span>), <span class="at">mar=</span><span class="fu">c</span>(<span class="dv">4</span>,<span class="dv">4</span>,<span class="dv">2</span>,<span class="fl">0.5</span>))   </span>
+<span id="cb44-2"><a href="#cb44-2" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb44-3"><a href="#cb44-3" aria-hidden="true" tabindex="-1"></a><span class="do">## studentized residuals vs fitted</span></span>
+<span id="cb44-4"><a href="#cb44-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(mod5<span class="sc">$</span>fitted, <span class="fu">rstudent</span>(mod5), <span class="at">col=</span><span class="dv">5</span>,</span>
+<span id="cb44-5"><a href="#cb44-5" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Fitted Values"</span>, </span>
+<span id="cb44-6"><a href="#cb44-6" aria-hidden="true" tabindex="-1"></a>     <span class="at">ylab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb44-7"><a href="#cb44-7" aria-hidden="true" tabindex="-1"></a>     <span class="at">pch=</span><span class="dv">20</span>, <span class="at">main=</span><span class="st">"AR 1 only model"</span>)</span>
+<span id="cb44-8"><a href="#cb44-8" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb44-9"><a href="#cb44-9" aria-hidden="true" tabindex="-1"></a><span class="do">## qq plot of studentized residuals</span></span>
+<span id="cb44-10"><a href="#cb44-10" aria-hidden="true" tabindex="-1"></a><span class="fu">qqnorm</span>(<span class="fu">rstudent</span>(mod5), <span class="at">pch=</span><span class="dv">20</span>, <span class="at">col=</span><span class="dv">5</span>, <span class="at">main=</span><span class="st">""</span> )</span>
+<span id="cb44-11"><a href="#cb44-11" aria-hidden="true" tabindex="-1"></a><span class="fu">abline</span>(<span class="at">a=</span><span class="dv">0</span>,<span class="at">b=</span><span class="dv">1</span>,<span class="at">lty=</span><span class="dv">2</span>, <span class="at">col=</span><span class="dv">2</span>)</span>
+<span id="cb44-12"><a href="#cb44-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb44-13"><a href="#cb44-13" aria-hidden="true" tabindex="-1"></a><span class="do">## histogram of studentized residuals</span></span>
+<span id="cb44-14"><a href="#cb44-14" aria-hidden="true" tabindex="-1"></a><span class="fu">hist</span>(<span class="fu">rstudent</span>(mod5), <span class="at">col=</span><span class="dv">5</span>, </span>
+<span id="cb44-15"><a href="#cb44-15" aria-hidden="true" tabindex="-1"></a>     <span class="at">xlab=</span><span class="st">"Studentized Residuals"</span>, </span>
+<span id="cb44-16"><a href="#cb44-16" aria-hidden="true" tabindex="-1"></a>     <span class="at">main=</span><span class="st">""</span>, <span class="at">border=</span><span class="dv">8</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-31-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>That’s really good!.</p>
+<div class="cell" data-layout-align="center">
+<div class="sourceCode cell-code" id="cb45"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb45-1"><a href="#cb45-1" aria-hidden="true" tabindex="-1"></a><span class="fu">acf</span>(mod5<span class="sc">$</span>residuals)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<div class="quarto-figure quarto-figure-center">
+<figure class="figure">
+<p><img src="VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-32-1.png" class="img-fluid quarto-figure quarto-figure-center figure-img" width="768"></p>
+</figure>
+</div>
+</div>
+</div>
+<p>We seem to have taken care of all of the autoregression!</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb46"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb46-1"><a href="#cb46-1" aria-hidden="true" tabindex="-1"></a>n<span class="ot">&lt;-</span><span class="fu">length</span>(t)</span>
+<span id="cb46-2"><a href="#cb46-2" aria-hidden="true" tabindex="-1"></a><span class="fu">extractAIC</span>(mod5, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>]</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>[1] -74.25862</code></pre>
+</div>
+</div>
+<p>And definitely the best so far. Just to compare more easily:</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb48"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb48-1"><a href="#cb48-1" aria-hidden="true" tabindex="-1"></a><span class="fu">c</span>(<span class="at">mod1 =</span> <span class="fu">extractAIC</span>(mod1, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>],</span>
+<span id="cb48-2"><a href="#cb48-2" aria-hidden="true" tabindex="-1"></a>  <span class="at">mod2 =</span> <span class="fu">extractAIC</span>(mod2, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>],</span>
+<span id="cb48-3"><a href="#cb48-3" aria-hidden="true" tabindex="-1"></a>  <span class="at">mod3 =</span> <span class="fu">extractAIC</span>(mod3, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>],</span>
+<span id="cb48-4"><a href="#cb48-4" aria-hidden="true" tabindex="-1"></a>  <span class="at">mod4 =</span> <span class="fu">extractAIC</span>(mod4, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>],</span>
+<span id="cb48-5"><a href="#cb48-5" aria-hidden="true" tabindex="-1"></a>  <span class="at">mod5 =</span> <span class="fu">extractAIC</span>(mod5, <span class="at">k=</span><span class="fu">log</span>(n))[<span class="dv">2</span>])</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>     mod1      mod2      mod3      mod4      mod5 
+-44.11057 -66.50482 -68.00597 -69.14372 -74.25862 </code></pre>
+</div>
+</div>
+<p>We’re looking for difference of about 5 to determine if a model is better. Model 5 is about 5 better than model 4, and models 2-4 are all about even. It may be that AR1 plus temperature might be even better, but it’s easier to forecast with a sine/cosine than using temperature, so I went for that….</p>
+</section>
+</section>
+<section id="extra-practice" class="level1">
+<h1>Extra Practice</h1>
+<p>Imagine that you are missing a few months at random – how would you need to modify the analysis. Try it out by removing about 5 months not at the beginning or end of the time series.</p>
+
+
+</section>
+
+<div id="quarto-appendix" class="default"><section class="quarto-appendix-contents" id="quarto-citation"><h2 class="anchored quarto-appendix-heading">Citation</h2><div><div class="quarto-appendix-secondary-label">BibTeX citation:</div><pre class="sourceCode code-with-copy quarto-appendix-bibtex"><code class="sourceCode bibtex">@online{r. johnson2024,
+  author = {R. Johnson, Leah},
+  title = {VectorByte {Methods} {Training:} {Regression} {Methods} for
+    {Time} {Dependent} {Data} (Practical - Solution)},
+  date = {2024-07-23},
+  langid = {en}
+}
+</code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre><div class="quarto-appendix-secondary-label">For attribution, please cite this work as:</div><div id="ref-r. johnson2024" class="csl-entry quarto-appendix-citeas" role="listitem">
+R. Johnson, Leah. 2024. <span>“VectorByte Methods Training: Regression
+Methods for Time Dependent Data (Practical - Solution).”</span> July 23,
+2024.
+</div></div></section></div></main> <!-- /main -->
+<script id="quarto-html-after-body" type="application/javascript">
+window.document.addEventListener("DOMContentLoaded", function (event) {
+  const toggleBodyColorMode = (bsSheetEl) => {
+    const mode = bsSheetEl.getAttribute("data-mode");
+    const bodyEl = window.document.querySelector("body");
+    if (mode === "dark") {
+      bodyEl.classList.add("quarto-dark");
+      bodyEl.classList.remove("quarto-light");
+    } else {
+      bodyEl.classList.add("quarto-light");
+      bodyEl.classList.remove("quarto-dark");
+    }
+  }
+  const toggleBodyColorPrimary = () => {
+    const bsSheetEl = window.document.querySelector("link#quarto-bootstrap");
+    if (bsSheetEl) {
+      toggleBodyColorMode(bsSheetEl);
+    }
+  }
+  toggleBodyColorPrimary();  
+  const icon = "";
+  const anchorJS = new window.AnchorJS();
+  anchorJS.options = {
+    placement: 'right',
+    icon: icon
+  };
+  anchorJS.add('.anchored');
+  const isCodeAnnotation = (el) => {
+    for (const clz of el.classList) {
+      if (clz.startsWith('code-annotation-')) {                     
+        return true;
+      }
+    }
+    return false;
+  }
+  const clipboard = new window.ClipboardJS('.code-copy-button', {
+    text: function(trigger) {
+      const codeEl = trigger.previousElementSibling.cloneNode(true);
+      for (const childEl of codeEl.children) {
+        if (isCodeAnnotation(childEl)) {
+          childEl.remove();
+        }
+      }
+      return codeEl.innerText;
+    }
+  });
+  clipboard.on('success', function(e) {
+    // button target
+    const button = e.trigger;
+    // don't keep focus
+    button.blur();
+    // flash "checked"
+    button.classList.add('code-copy-button-checked');
+    var currentTitle = button.getAttribute("title");
+    button.setAttribute("title", "Copied!");
+    let tooltip;
+    if (window.bootstrap) {
+      button.setAttribute("data-bs-toggle", "tooltip");
+      button.setAttribute("data-bs-placement", "left");
+      button.setAttribute("data-bs-title", "Copied!");
+      tooltip = new bootstrap.Tooltip(button, 
+        { trigger: "manual", 
+          customClass: "code-copy-button-tooltip",
+          offset: [0, -8]});
+      tooltip.show();    
+    }
+    setTimeout(function() {
+      if (tooltip) {
+        tooltip.hide();
+        button.removeAttribute("data-bs-title");
+        button.removeAttribute("data-bs-toggle");
+        button.removeAttribute("data-bs-placement");
+      }
+      button.setAttribute("title", currentTitle);
+      button.classList.remove('code-copy-button-checked');
+    }, 1000);
+    // clear code selection
+    e.clearSelection();
+  });
+    var localhostRegex = new RegExp(/^(?:http|https):\/\/localhost\:?[0-9]*\//);
+    var mailtoRegex = new RegExp(/^mailto:/);
+      var filterRegex = new RegExp('/' + window.location.host + '/');
+    var isInternal = (href) => {
+        return filterRegex.test(href) || localhostRegex.test(href) || mailtoRegex.test(href);
+    }
+    // Inspect non-navigation links and adorn them if external
+ 	var links = window.document.querySelectorAll('a[href]:not(.nav-link):not(.navbar-brand):not(.toc-action):not(.sidebar-link):not(.sidebar-item-toggle):not(.pagination-link):not(.no-external):not([aria-hidden]):not(.dropdown-item):not(.quarto-navigation-tool)');
+    for (var i=0; i<links.length; i++) {
+      const link = links[i];
+      if (!isInternal(link.href)) {
+        // undo the damage that might have been done by quarto-nav.js in the case of
+        // links that we want to consider external
+        if (link.dataset.originalHref !== undefined) {
+          link.href = link.dataset.originalHref;
+        }
+      }
+    }
+  function tippyHover(el, contentFn, onTriggerFn, onUntriggerFn) {
+    const config = {
+      allowHTML: true,
+      maxWidth: 500,
+      delay: 100,
+      arrow: false,
+      appendTo: function(el) {
+          return el.parentElement;
+      },
+      interactive: true,
+      interactiveBorder: 10,
+      theme: 'quarto',
+      placement: 'bottom-start',
+    };
+    if (contentFn) {
+      config.content = contentFn;
+    }
+    if (onTriggerFn) {
+      config.onTrigger = onTriggerFn;
+    }
+    if (onUntriggerFn) {
+      config.onUntrigger = onUntriggerFn;
+    }
+    window.tippy(el, config); 
+  }
+  const noterefs = window.document.querySelectorAll('a[role="doc-noteref"]');
+  for (var i=0; i<noterefs.length; i++) {
+    const ref = noterefs[i];
+    tippyHover(ref, function() {
+      // use id or data attribute instead here
+      let href = ref.getAttribute('data-footnote-href') || ref.getAttribute('href');
+      try { href = new URL(href).hash; } catch {}
+      const id = href.replace(/^#\/?/, "");
+      const note = window.document.getElementById(id);
+      if (note) {
+        return note.innerHTML;
+      } else {
+        return "";
+      }
+    });
+  }
+  const xrefs = window.document.querySelectorAll('a.quarto-xref');
+  const processXRef = (id, note) => {
+    // Strip column container classes
+    const stripColumnClz = (el) => {
+      el.classList.remove("page-full", "page-columns");
+      if (el.children) {
+        for (const child of el.children) {
+          stripColumnClz(child);
+        }
+      }
+    }
+    stripColumnClz(note)
+    if (id === null || id.startsWith('sec-')) {
+      // Special case sections, only their first couple elements
+      const container = document.createElement("div");
+      if (note.children && note.children.length > 2) {
+        container.appendChild(note.children[0].cloneNode(true));
+        for (let i = 1; i < note.children.length; i++) {
+          const child = note.children[i];
+          if (child.tagName === "P" && child.innerText === "") {
+            continue;
+          } else {
+            container.appendChild(child.cloneNode(true));
+            break;
+          }
+        }
+        if (window.Quarto?.typesetMath) {
+          window.Quarto.typesetMath(container);
+        }
+        return container.innerHTML
+      } else {
+        if (window.Quarto?.typesetMath) {
+          window.Quarto.typesetMath(note);
+        }
+        return note.innerHTML;
+      }
+    } else {
+      // Remove any anchor links if they are present
+      const anchorLink = note.querySelector('a.anchorjs-link');
+      if (anchorLink) {
+        anchorLink.remove();
+      }
+      if (window.Quarto?.typesetMath) {
+        window.Quarto.typesetMath(note);
+      }
+      // TODO in 1.5, we should make sure this works without a callout special case
+      if (note.classList.contains("callout")) {
+        return note.outerHTML;
+      } else {
+        return note.innerHTML;
+      }
+    }
+  }
+  for (var i=0; i<xrefs.length; i++) {
+    const xref = xrefs[i];
+    tippyHover(xref, undefined, function(instance) {
+      instance.disable();
+      let url = xref.getAttribute('href');
+      let hash = undefined; 
+      if (url.startsWith('#')) {
+        hash = url;
+      } else {
+        try { hash = new URL(url).hash; } catch {}
+      }
+      if (hash) {
+        const id = hash.replace(/^#\/?/, "");
+        const note = window.document.getElementById(id);
+        if (note !== null) {
+          try {
+            const html = processXRef(id, note.cloneNode(true));
+            instance.setContent(html);
+          } finally {
+            instance.enable();
+            instance.show();
+          }
+        } else {
+          // See if we can fetch this
+          fetch(url.split('#')[0])
+          .then(res => res.text())
+          .then(html => {
+            const parser = new DOMParser();
+            const htmlDoc = parser.parseFromString(html, "text/html");
+            const note = htmlDoc.getElementById(id);
+            if (note !== null) {
+              const html = processXRef(id, note);
+              instance.setContent(html);
+            } 
+          }).finally(() => {
+            instance.enable();
+            instance.show();
+          });
+        }
+      } else {
+        // See if we can fetch a full url (with no hash to target)
+        // This is a special case and we should probably do some content thinning / targeting
+        fetch(url)
+        .then(res => res.text())
+        .then(html => {
+          const parser = new DOMParser();
+          const htmlDoc = parser.parseFromString(html, "text/html");
+          const note = htmlDoc.querySelector('main.content');
+          if (note !== null) {
+            // This should only happen for chapter cross references
+            // (since there is no id in the URL)
+            // remove the first header
+            if (note.children.length > 0 && note.children[0].tagName === "HEADER") {
+              note.children[0].remove();
+            }
+            const html = processXRef(null, note);
+            instance.setContent(html);
+          } 
+        }).finally(() => {
+          instance.enable();
+          instance.show();
+        });
+      }
+    }, function(instance) {
+    });
+  }
+      let selectedAnnoteEl;
+      const selectorForAnnotation = ( cell, annotation) => {
+        let cellAttr = 'data-code-cell="' + cell + '"';
+        let lineAttr = 'data-code-annotation="' +  annotation + '"';
+        const selector = 'span[' + cellAttr + '][' + lineAttr + ']';
+        return selector;
+      }
+      const selectCodeLines = (annoteEl) => {
+        const doc = window.document;
+        const targetCell = annoteEl.getAttribute("data-target-cell");
+        const targetAnnotation = annoteEl.getAttribute("data-target-annotation");
+        const annoteSpan = window.document.querySelector(selectorForAnnotation(targetCell, targetAnnotation));
+        const lines = annoteSpan.getAttribute("data-code-lines").split(",");
+        const lineIds = lines.map((line) => {
+          return targetCell + "-" + line;
+        })
+        let top = null;
+        let height = null;
+        let parent = null;
+        if (lineIds.length > 0) {
+            //compute the position of the single el (top and bottom and make a div)
+            const el = window.document.getElementById(lineIds[0]);
+            top = el.offsetTop;
+            height = el.offsetHeight;
+            parent = el.parentElement.parentElement;
+          if (lineIds.length > 1) {
+            const lastEl = window.document.getElementById(lineIds[lineIds.length - 1]);
+            const bottom = lastEl.offsetTop + lastEl.offsetHeight;
+            height = bottom - top;
+          }
+          if (top !== null && height !== null && parent !== null) {
+            // cook up a div (if necessary) and position it 
+            let div = window.document.getElementById("code-annotation-line-highlight");
+            if (div === null) {
+              div = window.document.createElement("div");
+              div.setAttribute("id", "code-annotation-line-highlight");
+              div.style.position = 'absolute';
+              parent.appendChild(div);
+            }
+            div.style.top = top - 2 + "px";
+            div.style.height = height + 4 + "px";
+            div.style.left = 0;
+            let gutterDiv = window.document.getElementById("code-annotation-line-highlight-gutter");
+            if (gutterDiv === null) {
+              gutterDiv = window.document.createElement("div");
+              gutterDiv.setAttribute("id", "code-annotation-line-highlight-gutter");
+              gutterDiv.style.position = 'absolute';
+              const codeCell = window.document.getElementById(targetCell);
+              const gutter = codeCell.querySelector('.code-annotation-gutter');
+              gutter.appendChild(gutterDiv);
+            }
+            gutterDiv.style.top = top - 2 + "px";
+            gutterDiv.style.height = height + 4 + "px";
+          }
+          selectedAnnoteEl = annoteEl;
+        }
+      };
+      const unselectCodeLines = () => {
+        const elementsIds = ["code-annotation-line-highlight", "code-annotation-line-highlight-gutter"];
+        elementsIds.forEach((elId) => {
+          const div = window.document.getElementById(elId);
+          if (div) {
+            div.remove();
+          }
+        });
+        selectedAnnoteEl = undefined;
+      };
+        // Handle positioning of the toggle
+    window.addEventListener(
+      "resize",
+      throttle(() => {
+        elRect = undefined;
+        if (selectedAnnoteEl) {
+          selectCodeLines(selectedAnnoteEl);
+        }
+      }, 10)
+    );
+    function throttle(fn, ms) {
+    let throttle = false;
+    let timer;
+      return (...args) => {
+        if(!throttle) { // first call gets through
+            fn.apply(this, args);
+            throttle = true;
+        } else { // all the others get throttled
+            if(timer) clearTimeout(timer); // cancel #2
+            timer = setTimeout(() => {
+              fn.apply(this, args);
+              timer = throttle = false;
+            }, ms);
+        }
+      };
+    }
+      // Attach click handler to the DT
+      const annoteDls = window.document.querySelectorAll('dt[data-target-cell]');
+      for (const annoteDlNode of annoteDls) {
+        annoteDlNode.addEventListener('click', (event) => {
+          const clickedEl = event.target;
+          if (clickedEl !== selectedAnnoteEl) {
+            unselectCodeLines();
+            const activeEl = window.document.querySelector('dt[data-target-cell].code-annotation-active');
+            if (activeEl) {
+              activeEl.classList.remove('code-annotation-active');
+            }
+            selectCodeLines(clickedEl);
+            clickedEl.classList.add('code-annotation-active');
+          } else {
+            // Unselect the line
+            unselectCodeLines();
+            clickedEl.classList.remove('code-annotation-active');
+          }
+        });
+      }
+  const findCites = (el) => {
+    const parentEl = el.parentElement;
+    if (parentEl) {
+      const cites = parentEl.dataset.cites;
+      if (cites) {
+        return {
+          el,
+          cites: cites.split(' ')
+        };
+      } else {
+        return findCites(el.parentElement)
+      }
+    } else {
+      return undefined;
+    }
+  };
+  var bibliorefs = window.document.querySelectorAll('a[role="doc-biblioref"]');
+  for (var i=0; i<bibliorefs.length; i++) {
+    const ref = bibliorefs[i];
+    const citeInfo = findCites(ref);
+    if (citeInfo) {
+      tippyHover(citeInfo.el, function() {
+        var popup = window.document.createElement('div');
+        citeInfo.cites.forEach(function(cite) {
+          var citeDiv = window.document.createElement('div');
+          citeDiv.classList.add('hanging-indent');
+          citeDiv.classList.add('csl-entry');
+          var biblioDiv = window.document.getElementById('ref-' + cite);
+          if (biblioDiv) {
+            citeDiv.innerHTML = biblioDiv.innerHTML;
+          }
+          popup.appendChild(citeDiv);
+        });
+        return popup.innerHTML;
+      });
+    }
+  }
+});
+</script>
+</div> <!-- /content -->
+
+
+
+
+</body></html>
\ No newline at end of file
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-10-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-10-1.png
new file mode 100644
index 0000000..92d91fb
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-10-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-11-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-11-1.png
new file mode 100644
index 0000000..021a990
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-11-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-12-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-12-1.png
new file mode 100644
index 0000000..d0e16aa
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-12-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-15-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-15-1.png
new file mode 100644
index 0000000..d7f46cf
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-15-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-16-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-16-1.png
new file mode 100644
index 0000000..d27c705
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-16-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-17-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-17-1.png
new file mode 100644
index 0000000..2a397ec
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-17-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-20-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-20-1.png
new file mode 100644
index 0000000..d8ddef1
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-20-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-21-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-21-1.png
new file mode 100644
index 0000000..0bbae3e
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-21-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-22-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-22-1.png
new file mode 100644
index 0000000..a0b050b
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-22-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-25-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-25-1.png
new file mode 100644
index 0000000..fa1733b
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-25-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-26-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-26-1.png
new file mode 100644
index 0000000..d485610
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-26-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-27-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-27-1.png
new file mode 100644
index 0000000..7122209
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-27-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-3-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-3-1.png
new file mode 100644
index 0000000..a1b87b6
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-3-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-30-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-30-1.png
new file mode 100644
index 0000000..5783c25
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-30-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-31-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-31-1.png
new file mode 100644
index 0000000..8fd6176
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-31-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-32-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-32-1.png
new file mode 100644
index 0000000..bb5160a
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-32-1.png differ
diff --git a/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-4-1.png b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-4-1.png
new file mode 100644
index 0000000..4c166ec
Binary files /dev/null and b/docs/VB_TimeDepData_practical_soln_files/figure-html/unnamed-chunk-4-1.png differ
diff --git a/docs/search.json b/docs/search.json
index 8f37c19..76dfd5a 100644
--- a/docs/search.json
+++ b/docs/search.json
@@ -53,7 +53,7 @@
     "href": "GP_Solutions.html",
     "title": "GP_Solutions",
     "section": "",
-    "text": "Libraries\n\nlibrary(mvtnorm)\nlibrary(laGP)\nlibrary(hetGP)\nlibrary(ggplot2)\n\n\n\nHetGP (sin wave eg)\n\n# Your turn\nset.seed(26)\nn &lt;- 8 # number of points\nX &lt;- matrix(seq(0, 2*pi, length= n), ncol=1) # build inputs \ny &lt;- 5*sin(X) + rnorm(n, 0 , 2) # response with some noise\n\n# Predict on this set\nXX &lt;- matrix(seq(-0.5, 2*pi + 0.5, length= 100), ncol=1)\n\n# Data visualization\nplot(X, y)\n\n\n\n\n\n\n\n# ------ Solutions ------------------------------\n\nhet_fit &lt;- hetGP::mleHetGP(X, y)\nhet_pred &lt;- predict(het_fit, XX)\n\nmean &lt;- het_pred$mean\ns2 &lt;- het_pred$sd2 + het_pred$nugs\n\nyy &lt;- 5*sin(XX)\n\npar(mfrow = c(1, 1), mar = c(4, 4, 4, 1))\nplot(X, y, ylim = c(-10, 10))\nlines(XX, yy, col = 3)\nlines(XX, mean, col = 2)\nlines(XX, mean + 2 * sqrt(s2), col = 4)\nlines(XX, mean - 2 * sqrt(s2), col = 4)\n\n\n\n\n\n\n\n# You can check the nuggets (each one will be different)\nnugs &lt;- het_pred$nugs\nsummary(nugs)\n\n   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. \n  3.904   3.927   3.931   3.936   3.949   3.971 \n\n\n\n\nChallenges\nWe need to load the data and the functions\n\n# Pulling the data from the NEON data base. \ntarget &lt;- readr::read_csv(\"https://data.ecoforecast.org/neon4cast-targets/ticks/ticks-targets.csv.gz\", guess_max = 1e1)\n\nRows: 601 Columns: 5\n── Column specification ────────────────────────────────────────────────────────\nDelimiter: \",\"\nchr  (3): site_id, variable, iso_week\ndbl  (1): observation\ndate (1): datetime\n\nℹ Use `spec()` to retrieve the full column specification for this data.\nℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.\n\n# transforms y\nf &lt;- function(x) {\n  y &lt;- log(x + 1)\n  return(y)\n}\n\n# This function back transforms the input argument\nfi &lt;- function(y) {\n  x &lt;- exp(y) - 1\n  return(x)\n}\n\n# This function tells us the iso-week number given the date\nfx.iso_week &lt;- function(datetime){\n  # Gives ISO-week in the format yyyy-w## and we extract the ##\n  x1 &lt;- as.numeric(stringr::str_sub(ISOweek::ISOweek(datetime), 7, 8)) # find iso week #\n  return(x1)\n}\n\nfx.sin &lt;- function(datetime, f1 = fx.iso_week){\n  # identify iso week#\n  x &lt;- f1(datetime) \n  # calculate sin value for that week\n  x2 &lt;- (sin(2*pi*x/106))^2 \n  return(x2)\n}\n\n\nFit a GP Model for the location “SERC” i.e. site_number = 7.\nJust change site = 7\n\nsite_number &lt;- 7 # (site_number = 4) for the other challenge\n\n# Obtaining site name\nsite_names &lt;- unique(target$site_id)\n\n# Subsetting all the data at that location\ndf &lt;- subset(target, target$site_id == site_names[site_number])\n\n# extracting only the datetime and obs columns\ndf &lt;- df[, c(\"datetime\", \"observation\")]\n\n# Selecting a date before which we consider everything as training data and after this is testing data.\ncutoff = as.Date('2020-12-31')\ndf_train &lt;- subset(df, df$datetime &lt;= cutoff)\ndf_test &lt;- subset(df, df$datetime &gt; cutoff)\n\n# Setting up iso-week and sin wave predictors by calling the functions\nX1 &lt;- fx.iso_week(df_train$datetime) # range is 1-53\nX2 &lt;- fx.sin(df_train$datetime) # range is 0 to 1\n\n# Centering the iso-week by diving by 53\nX1c &lt;- X1/ 53\n\n# We combine columns centered X1 and X2, into a matrix as our input space\nX &lt;- as.matrix(cbind.data.frame(X1c, X2))\nhead(X)\n\n           X1c        X2\n[1,] 0.3584906 0.8150439\n[2,] 0.3962264 0.8974272\n[3,] 0.4528302 0.9782005\n[4,] 0.5094340 0.9991219\n[5,] 0.6226415 0.8587536\n[6,] 0.6792453 0.7150326\n\ny_obs &lt;- df_train$observation\ny &lt;- f(y_obs) # transform y\n\n# A very small value for stability\neps &lt;- sqrt(.Machine$double.eps) \n  \n# Priors for theta and g. \nd &lt;- darg(list(mle=TRUE, min =eps, max=5), X)\ng &lt;- garg(list(mle=TRUE, min = eps, max = 1), y)\n\n# Fitting a GP with our data, and some starting values for theta and g\ngpi &lt;- newGPsep(X, y, d = 0.1, g = 1, dK = T)\n\n# Jointly infer MLE for all parameters\nmle &lt;- jmleGPsep(gpi, drange = c(d$min, d$max), grange = c(g$min, g$max), \n                 dab = d$ab, gab=  g$ab)\n\n# Create a grid from start date in our data set to one year in future (so we forecast for next season)\nstartdate &lt;- as.Date(min(df$datetime))# identify start week\ngrid_datetime &lt;- seq.Date(startdate, Sys.Date() + 365, by = 7) # create sequence\n\n# Build the input space for the predictive space (All weeks from 04-2014 to 07-2025)\nXXt1 &lt;- fx.iso_week(grid_datetime)\nXXt2 &lt;- fx.sin(grid_datetime)\n\n# Standardize\nXXt1c &lt;- XXt1/53\n\n# Store inputs as a matrix\nXXt &lt;- as.matrix(cbind.data.frame(XXt1c, XXt2))\n\n# Make predictions using predGP with the gp object and the predictive set\nppt &lt;- predGPsep(gpi, XXt) \n\n# Now we store the mean as our predicted response i.e. density along with quantiles\nyyt &lt;- ppt$mean\nq1t &lt;- ppt$mean + qnorm(0.025,0,sqrt(diag(ppt$Sigma))) #lower bound\nq2t &lt;- ppt$mean + qnorm(0.975,0,sqrt(diag(ppt$Sigma))) # upper bound\n\n# Back transform our data to original\ngp_yy &lt;- fi(yyt)\ngp_q1 &lt;- fi(q1t)\ngp_q2 &lt;- fi(q2t)\n\n# Plot the observed points\nplot(as.Date(df$datetime), df$observation,\n       main = paste(site_names[site_number]), col = \"black\",\n       xlab = \"Dates\" , ylab = \"Abundance\",\n       # xlim = c(as.Date(min(df$datetime)), as.Date(cutoff)),\n       ylim = c(min(df_train$observation, gp_yy, gp_q1), max(df_train$observation, gp_yy, gp_q2)* 1.05))\n\n# Plot the testing set data \npoints(as.Date(df_test$datetime), df_test$observation, col =\"black\", pch = 19)\n\n# Line to indicate seperation between train and test data\nabline(v = as.Date(cutoff), lwd = 2)\n\n# Add the predicted response and the quantiles\nlines(grid_datetime, gp_yy, col = 4, lwd = 2)\nlines(grid_datetime, gp_q1, col = 4, lwd = 1.2, lty = 2)\nlines(grid_datetime, gp_q2, col = 4, lwd = 1.2, lty =2)\n\n\n\n\n\n\n\n# Obtain true observed values for testing set\nyt_true &lt;- f(df_test$observation)\n\n# FInd corresponding predictions from our model in the grid we predicted on\nyt_pred &lt;- yyt[which(grid_datetime  %in% df_test$datetime)]\n\n# calculate RMSE\nrmse &lt;- sqrt(mean((yt_true - yt_pred)^2))\nrmse\n\n[1] 0.9553652\n\n\n\n\nUse an environmental predictor in your model. Following is a function fx.green that creates the variable given the datetime and the location.\nHere is a snippet of the supporting file that you will use; You can look into the data.frame and try to plot ker for one site at a time and see what it yields.\n\nsource('code/df_spline.R') # sources the cript to make greenness predictor\nhead(df_green) # how the dataset looks\n\n  site iso ker\n1 BLAN   1   0\n2 BLAN   2   0\n3 BLAN   3   0\n4 BLAN   4   0\n5 BLAN   5   0\n6 BLAN   6   0\n\n# The function to create the environmental predictor similar to iso-week and sin wave\nfx.green &lt;- function(datetime, site, site_info = df_green){\n  ker &lt;- NULL\n  iso &lt;- fx.iso_week(datetime) # identify iso week\n  df.iso &lt;- cbind.data.frame(datetime, iso) # combine date with iso week\n  sites.ker &lt;- subset(site_info, site == site)[,2:3] # obtain kernel for location\n  df.green &lt;- df.iso %&gt;% left_join(sites.ker, by = 'iso') # join dataframes by iso week\n  ker &lt;- df.green$ker # return kernel\n  return(ker)\n}\n\n\nChoose a site\nset up X3 using fx_green\nScale X3\n\nSetting up the target dataframe\n\n# Obtaining site name\nsite_names &lt;- unique(target$site_id)\n\n# Subsetting all the data at that location\ndf &lt;- subset(target, target$site_id == site_names[site_number])\n\n# extracting only the datetime and obs columns\ndf &lt;- df[, c(\"datetime\", \"observation\")]\n\n# Selecting a date before which we consider everything as training data and after this is testing data.\ncutoff = as.Date('2020-12-31')\ndf_train &lt;- subset(df, df$datetime &lt;= cutoff)\ndf_test &lt;- subset(df, df$datetime &gt; cutoff)\n\nAdding Greenness\n\n# Choose location \nsite_number = 7\ndf_green_site2 &lt;- subset(df_green, site == site_names[site_number])\n\n# Setting up iso-week and sin wave predictors by calling the functions\nX1 &lt;- fx.iso_week(df_train$datetime) # range is 1-53\nX2 &lt;- fx.sin(df_train$datetime) # range is 0 to 1\n\n# you need datetime, site name and the df_green dataset.\nX3 &lt;- fx.green(df_train$datetime, site = site_names[site_number], site_info = df_green_site2)\n\n# Centering the iso-week by diving by 53\nX1c &lt;- X1/ 53\n\n# Scale X3\nX3c &lt;- (X3 - min(X3))/ (max(X3)- min(X3))\n\n# We combine columns centered X1 and X2, into a matrix as our input space\nX &lt;- as.matrix(cbind.data.frame(X1c, X2, X3c))\nhead(X)\n\n           X1c        X2        X3c\n[1,] 0.3584906 0.8150439 0.84585476\n[2,] 0.3962264 0.8974272 0.97749530\n[3,] 0.4528302 0.9782005 0.96806621\n[4,] 0.5094340 0.9991219 0.75535447\n[5,] 0.6226415 0.8587536 0.23957183\n[6,] 0.6792453 0.7150326 0.09822483\n\ny_obs &lt;- df_train$observation\ny &lt;- f(y_obs) # transform y\n\n# A very small value for stability\neps &lt;- sqrt(.Machine$double.eps) \n  \n# Priors for theta and g. \nd &lt;- darg(list(mle=TRUE, min =eps, max=5), X)\ng &lt;- garg(list(mle=TRUE, min = eps, max = 1), y)\n\n# Fitting a GP with our data, and some starting values for theta and g\ngpi &lt;- newGPsep(X, y, d = 0.1, g = 1, dK = T)\n\n# Jointly infer MLE for all parameters\nmle &lt;- jmleGPsep(gpi, drange = c(d$min, d$max), grange = c(g$min, g$max), \n                 dab = d$ab, gab=  g$ab)\n\n# Create a grid from start date in our data set to one year in future (so we forecast for next season)\nstartdate &lt;- as.Date(min(df$datetime))# identify start week\ngrid_datetime &lt;- seq.Date(startdate, Sys.Date() + 365, by = 7) # create sequence\n\n# Build the input space for the predictive space (All weeks from 04-2014 to 07-2025)\nXXt1 &lt;- fx.iso_week(grid_datetime)\nXXt2 &lt;- fx.sin(grid_datetime)\nXXt3 &lt;- fx.green(grid_datetime, site = site_names[site_nunber], site_info = df_green_site2)\n\n# Standardize\nXXt1c &lt;- XXt1/53\nXXt3 &lt;- (XXt3 - min(XXt3))/ (max(XXt3)- min(XXt3))\n\n# Store inputs as a matrix\nXXt &lt;- as.matrix(cbind.data.frame(XXt1c, XXt2, XXt3))\n\n# Make predictions using predGP with the gp object and the predictive set\nppt &lt;- predGPsep(gpi, XXt) \n\n# Now we store the mean as our predicted response i.e. density along with quantiles\nyyt &lt;- ppt$mean\nq1t &lt;- ppt$mean + qnorm(0.025,0,sqrt(diag(ppt$Sigma))) #lower bound\nq2t &lt;- ppt$mean + qnorm(0.975,0,sqrt(diag(ppt$Sigma))) # upper bound\n\n# Back transform our data to original\ngp_yy &lt;- fi(yyt)\ngp_q1 &lt;- fi(q1t)\ngp_q2 &lt;- fi(q2t)\n\n# Plot the observed points\nplot(as.Date(df$datetime), df$observation,\n       main = paste(site_names[site_number]), col = \"black\",\n       xlab = \"Dates\" , ylab = \"Abundance\",\n       # xlim = c(as.Date(min(df$datetime)), as.Date(cutoff)),\n       ylim = c(min(df_train$observation, gp_yy, gp_q1), max(df_train$observation, gp_yy, gp_q2)* 1.05))\n\n# Plot the testing set data \npoints(as.Date(df_test$datetime), df_test$observation, col =\"black\", pch = 19)\n\n# Line to indicate seperation between train and test data\nabline(v = as.Date(cutoff), lwd = 2)\n\n# Add the predicted response and the quantiles\nlines(grid_datetime, gp_yy, col = 4, lwd = 2)\nlines(grid_datetime, gp_q1, col = 4, lwd = 1.2, lty = 2)\nlines(grid_datetime, gp_q2, col = 4, lwd = 1.2, lty =2)\n\n\n\n\n\n\n\n# Obtain true observed values for testing set\nyt_true &lt;- f(df_test$observation)\n\n# FInd corresponding predictions from our model in the grid we predicted on\nyt_pred &lt;- yyt[which(grid_datetime  %in% df_test$datetime)]\n\n# calculate RMSE\nrmse &lt;- sqrt(mean((yt_true - yt_pred)^2))\nrmse\n\n[1] 0.9532001\n\n\n\n\nFit a GP Model for all the locations (More advanced).\n\n# GP function. This can be varied but easiest way is to just take in X, y, XX and return the predicted means and bounds. \n\ngpfit &lt;- function(X, y , XXt){\n  eps &lt;- sqrt(.Machine$double.eps) \n  \n  # Priors for theta and g. \n  d &lt;- darg(list(mle=TRUE, min =eps, max=5), X)\n  g &lt;- garg(list(mle=TRUE, min = eps, max = 1), y)\n\n  # Fitting a GP with our data, and some starting values for theta and g\n  gpi &lt;- newGPsep(X, y, d = 0.1, g = 1, dK = T)\n\n  # Jointly infer MLE for all parameters\n  mle &lt;- jmleGPsep(gpi, drange = c(d$min, d$max), grange = c(g$min, g$max), \n                 dab = d$ab, gab=  g$ab)\n\n  ppt &lt;- predGPsep(gpi, XXt) \n\n  # Now we store the mean as our predicted response i.e. density along with quantiles\n  yyt &lt;- ppt$mean\n  q1t &lt;- ppt$mean + qnorm(0.025,0,sqrt(diag(ppt$Sigma))) #lower bound\n  q2t &lt;- ppt$mean + qnorm(0.975,0,sqrt(diag(ppt$Sigma))) # upper bound\n\n  # Back transform our data to original\n  gp_yy &lt;- fi(yyt)\n  gp_q1 &lt;- fi(q1t)\n  gp_q2 &lt;- fi(q2t)\n  \n  return(list(mean = gp_yy, s2 = diag(ppt$Sigma), q1 = gp_q1, q2 = gp_q2))\n}\n\n\nsite_number &lt;- 7 # (site_number = 4) for the other challenge\n\n# Obtaining site name\nsite_names &lt;- unique(target$site_id)\n\n# extracting only the datetime and obs columns\ndf &lt;- target[, c(\"datetime\", \"site_id\", \"observation\")]\n\ncutoff = as.Date('2020-12-31')\n\n# This was always be prediction set\nstartdate &lt;- as.Date(min(df$datetime))# identify start week\ngrid_datetime &lt;- seq.Date(startdate, Sys.Date() + 365, by = 7) # create sequence\n\n# You can pre process to have y transformed or have it in the loop.\nrmse &lt;- matrix(nrow = length(site_names), ncol = 1) # if rmse\n\nfor(i in 1:length(site_names)){\n  \n  df_site &lt;- subset(df, site_id == site_names[i])\n  \n  # cutoff for sites\n  df_train &lt;- subset(df_site, df_site$datetime &lt;= cutoff)\n  df_test &lt;- subset(df_site, df_site$datetime &gt; cutoff)\n  \n  df_green_site &lt;- subset(df_green, site == site_names[i])\n  \n  X1 &lt;- fx.iso_week(df_train$datetime) # range is 1-53\n  X2 &lt;- fx.sin(df_train$datetime) # range is 0 to 1\n  X3 &lt;- fx.green(df_train$datetime, site = site_names[site_number], site_info = df_green_site) # optional add\n\n  X1c &lt;- X1/ 53\n  X3c &lt;- (X3 - min(X3))/ (max(X3)- min(X3))\n  X &lt;- as.matrix(cbind.data.frame(X1c, X2, X3c))\n\n  y_obs &lt;- df_train$observation # only at this location\n  y &lt;- f(y_obs) # transform y\n\n  XXt1 &lt;- fx.iso_week(grid_datetime)\n  XXt2 &lt;- fx.sin(grid_datetime)\n  XXt3 &lt;- fx.green(grid_datetime, site = site_names[site_nunber], site_info =\n                   df_green_site)\n\n  # Standardize\n  XXt1c &lt;- XXt1/53\n  XXt3 &lt;- (XXt3 - min(XXt3))/ (max(XXt3)- min(XXt3))\n  XXt &lt;- as.matrix(cbind.data.frame(XXt1c, XXt2, XXt3))\n \n  fit &lt;- gpfit(X = X, y = y, XX = XXt)\n  \n  # Make plots\n  plot(as.Date(df_site$datetime), df_site$observation,\n       main = paste(site_names[i]), col = \"black\",\n       xlab = \"Dates\" , ylab = \"Abundance\",\n       # xlim = c(as.Date(min(df$datetime)), as.Date(cutoff)),\n       ylim = c(min(df_train$observation, df_test$observation, fit$q1),\n                max(df_train$observation,df_test$observation, fit$q2)* 1.05))\n\n  points(as.Date(df_test$datetime), df_test$observation, col =\"black\", pch = 19)\n  abline(v = as.Date(cutoff), lwd = 2)\n\n  # Add the predicted response and the quantiles\n  lines(grid_datetime, fit$mean, col = 4, lwd = 2)\n  lines(grid_datetime, fit$q1, col = 4, lwd = 1.2, lty = 2)\n  lines(grid_datetime, fit$q2, col = 4, lwd = 1.2, lty =2)\n  \n  \n  yt_true &lt;- f(df_test$observation)\n  yt_pred &lt;- f(fit$mean[which(grid_datetime  %in% df_test$datetime)])\n\n  # calculate RMSE\n  rmse[i, ] &lt;- sqrt(mean((yt_true - yt_pred)^2))\n}\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nrownames(rmse) &lt;- site_names\nprint(rmse)\n\n          [,1]\nBLAN 1.1173576\nKONZ 0.7779042\nLENO 0.6851390\nORNL 0.9205952\nOSBS 1.2057049\nSCBI 0.8610973\nSERC 0.9532001\nTALL 0.8883320\nUKFS 0.9521605"
+    "text": "Libraries\n\nlibrary(mvtnorm)\nlibrary(laGP)\nlibrary(hetGP)\nlibrary(ggplot2)\n\n\n\nHetGP (sin wave eg)\n\n# Your turn\nset.seed(26)\nn &lt;- 8 # number of points\nX &lt;- matrix(seq(0, 2*pi, length= n), ncol=1) # build inputs \ny &lt;- 5*sin(X) + rnorm(n, 0 , 2) # response with some noise\n\n# Predict on this set\nXX &lt;- matrix(seq(-0.5, 2*pi + 0.5, length= 100), ncol=1)\n\n# Data visualization\nplot(X, y)\n\n\n\n\n\n\n\n# ------ Solutions ------------------------------\n\nhet_fit &lt;- hetGP::mleHetGP(X, y)\nhet_pred &lt;- predict(het_fit, XX)\n\nmean &lt;- het_pred$mean\ns2 &lt;- het_pred$sd2 + het_pred$nugs\n\nyy &lt;- 5*sin(XX)\n\npar(mfrow = c(1, 1), mar = c(4, 4, 4, 1))\nplot(X, y, ylim = c(-10, 10))\nlines(XX, yy, col = 3)\nlines(XX, mean, col = 2)\nlines(XX, mean + 2 * sqrt(s2), col = 4)\nlines(XX, mean - 2 * sqrt(s2), col = 4)\n\n\n\n\n\n\n\n# You can check the nuggets (each one will be different)\nnugs &lt;- het_pred$nugs\nsummary(nugs)\n\n   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. \n  3.904   3.927   3.931   3.936   3.949   3.971 \n\n\n\n\nChallenges\nWe need to load the data and the functions\n\n# Pulling the data from the NEON data base. \ntarget &lt;- readr::read_csv(\"https://data.ecoforecast.org/neon4cast-targets/ticks/ticks-targets.csv.gz\", guess_max = 1e1)\n\nRows: 637 Columns: 5\n── Column specification ────────────────────────────────────────────────────────\nDelimiter: \",\"\nchr  (3): site_id, variable, iso_week\ndbl  (1): observation\ndate (1): datetime\n\nℹ Use `spec()` to retrieve the full column specification for this data.\nℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.\n\n# transforms y\nf &lt;- function(x) {\n  y &lt;- log(x + 1)\n  return(y)\n}\n\n# This function back transforms the input argument\nfi &lt;- function(y) {\n  x &lt;- exp(y) - 1\n  return(x)\n}\n\n# This function tells us the iso-week number given the date\nfx.iso_week &lt;- function(datetime){\n  # Gives ISO-week in the format yyyy-w## and we extract the ##\n  x1 &lt;- as.numeric(stringr::str_sub(ISOweek::ISOweek(datetime), 7, 8)) # find iso week #\n  return(x1)\n}\n\nfx.sin &lt;- function(datetime, f1 = fx.iso_week){\n  # identify iso week#\n  x &lt;- f1(datetime) \n  # calculate sin value for that week\n  x2 &lt;- (sin(2*pi*x/106))^2 \n  return(x2)\n}\n\n\nFit a GP Model for the location “SERC” i.e. site_number = 7.\nJust change site = 7\n\nsite_number &lt;- 7 # (site_number = 4) for the other challenge\n\n# Obtaining site name\nsite_names &lt;- unique(target$site_id)\n\n# Subsetting all the data at that location\ndf &lt;- subset(target, target$site_id == site_names[site_number])\n\n# extracting only the datetime and obs columns\ndf &lt;- df[, c(\"datetime\", \"observation\")]\n\n# Selecting a date before which we consider everything as training data and after this is testing data.\ncutoff = as.Date('2020-12-31')\ndf_train &lt;- subset(df, df$datetime &lt;= cutoff)\ndf_test &lt;- subset(df, df$datetime &gt; cutoff)\n\n# Setting up iso-week and sin wave predictors by calling the functions\nX1 &lt;- fx.iso_week(df_train$datetime) # range is 1-53\nX2 &lt;- fx.sin(df_train$datetime) # range is 0 to 1\n\n# Centering the iso-week by diving by 53\nX1c &lt;- X1/ 53\n\n# We combine columns centered X1 and X2, into a matrix as our input space\nX &lt;- as.matrix(cbind.data.frame(X1c, X2))\nhead(X)\n\n           X1c        X2\n[1,] 0.3584906 0.8150439\n[2,] 0.3962264 0.8974272\n[3,] 0.4528302 0.9782005\n[4,] 0.5094340 0.9991219\n[5,] 0.6226415 0.8587536\n[6,] 0.6792453 0.7150326\n\ny_obs &lt;- df_train$observation\ny &lt;- f(y_obs) # transform y\n\n# A very small value for stability\neps &lt;- sqrt(.Machine$double.eps) \n  \n# Priors for theta and g. \nd &lt;- darg(list(mle=TRUE, min =eps, max=5), X)\ng &lt;- garg(list(mle=TRUE, min = eps, max = 1), y)\n\n# Fitting a GP with our data, and some starting values for theta and g\ngpi &lt;- newGPsep(X, y, d = 0.1, g = 1, dK = T)\n\n# Jointly infer MLE for all parameters\nmle &lt;- jmleGPsep(gpi, drange = c(d$min, d$max), grange = c(g$min, g$max), \n                 dab = d$ab, gab=  g$ab)\n\n# Create a grid from start date in our data set to one year in future (so we forecast for next season)\nstartdate &lt;- as.Date(min(df$datetime))# identify start week\ngrid_datetime &lt;- seq.Date(startdate, Sys.Date() + 365, by = 7) # create sequence\n\n# Build the input space for the predictive space (All weeks from 04-2014 to 07-2025)\nXXt1 &lt;- fx.iso_week(grid_datetime)\nXXt2 &lt;- fx.sin(grid_datetime)\n\n# Standardize\nXXt1c &lt;- XXt1/53\n\n# Store inputs as a matrix\nXXt &lt;- as.matrix(cbind.data.frame(XXt1c, XXt2))\n\n# Make predictions using predGP with the gp object and the predictive set\nppt &lt;- predGPsep(gpi, XXt) \n\n# Now we store the mean as our predicted response i.e. density along with quantiles\nyyt &lt;- ppt$mean\nq1t &lt;- ppt$mean + qnorm(0.025,0,sqrt(diag(ppt$Sigma))) #lower bound\nq2t &lt;- ppt$mean + qnorm(0.975,0,sqrt(diag(ppt$Sigma))) # upper bound\n\n# Back transform our data to original\ngp_yy &lt;- fi(yyt)\ngp_q1 &lt;- fi(q1t)\ngp_q2 &lt;- fi(q2t)\n\n# Plot the observed points\nplot(as.Date(df$datetime), df$observation,\n       main = paste(site_names[site_number]), col = \"black\",\n       xlab = \"Dates\" , ylab = \"Abundance\",\n       # xlim = c(as.Date(min(df$datetime)), as.Date(cutoff)),\n       ylim = c(min(df_train$observation, gp_yy, gp_q1), max(df_train$observation, gp_yy, gp_q2)* 1.05))\n\n# Plot the testing set data \npoints(as.Date(df_test$datetime), df_test$observation, col =\"black\", pch = 19)\n\n# Line to indicate seperation between train and test data\nabline(v = as.Date(cutoff), lwd = 2)\n\n# Add the predicted response and the quantiles\nlines(grid_datetime, gp_yy, col = 4, lwd = 2)\nlines(grid_datetime, gp_q1, col = 4, lwd = 1.2, lty = 2)\nlines(grid_datetime, gp_q2, col = 4, lwd = 1.2, lty =2)\n\n\n\n\n\n\n\n# Obtain true observed values for testing set\nyt_true &lt;- f(df_test$observation)\n\n# FInd corresponding predictions from our model in the grid we predicted on\nyt_pred &lt;- yyt[which(grid_datetime  %in% df_test$datetime)]\n\n# calculate RMSE\nrmse &lt;- sqrt(mean((yt_true - yt_pred)^2))\nrmse\n\n[1] 0.9553652\n\n\n\n\nUse an environmental predictor in your model. Following is a function fx.green that creates the variable given the datetime and the location.\nHere is a snippet of the supporting file that you will use; You can look into the data.frame and try to plot ker for one site at a time and see what it yields.\n\nsource('code/df_spline.R') # sources the cript to make greenness predictor\nhead(df_green) # how the dataset looks\n\n  site iso ker\n1 BLAN   1   0\n2 BLAN   2   0\n3 BLAN   3   0\n4 BLAN   4   0\n5 BLAN   5   0\n6 BLAN   6   0\n\n# The function to create the environmental predictor similar to iso-week and sin wave\nfx.green &lt;- function(datetime, site, site_info = df_green){\n  ker &lt;- NULL\n  iso &lt;- fx.iso_week(datetime) # identify iso week\n  df.iso &lt;- cbind.data.frame(datetime, iso) # combine date with iso week\n  sites.ker &lt;- subset(site_info, site == site)[,2:3] # obtain kernel for location\n  df.green &lt;- df.iso %&gt;% left_join(sites.ker, by = 'iso') # join dataframes by iso week\n  ker &lt;- df.green$ker # return kernel\n  return(ker)\n}\n\n\nChoose a site\nset up X3 using fx_green\nScale X3\n\nSetting up the target dataframe\n\n# Obtaining site name\nsite_names &lt;- unique(target$site_id)\n\n# Subsetting all the data at that location\ndf &lt;- subset(target, target$site_id == site_names[site_number])\n\n# extracting only the datetime and obs columns\ndf &lt;- df[, c(\"datetime\", \"observation\")]\n\n# Selecting a date before which we consider everything as training data and after this is testing data.\ncutoff = as.Date('2020-12-31')\ndf_train &lt;- subset(df, df$datetime &lt;= cutoff)\ndf_test &lt;- subset(df, df$datetime &gt; cutoff)\n\nAdding Greenness\n\n# Choose location \nsite_number = 7\ndf_green_site2 &lt;- subset(df_green, site == site_names[site_number])\n\n# Setting up iso-week and sin wave predictors by calling the functions\nX1 &lt;- fx.iso_week(df_train$datetime) # range is 1-53\nX2 &lt;- fx.sin(df_train$datetime) # range is 0 to 1\n\n# you need datetime, site name and the df_green dataset.\nX3 &lt;- fx.green(df_train$datetime, site = site_names[site_number], site_info = df_green_site2)\n\n# Centering the iso-week by diving by 53\nX1c &lt;- X1/ 53\n\n# Scale X3\nX3c &lt;- (X3 - min(X3))/ (max(X3)- min(X3))\n\n# We combine columns centered X1 and X2, into a matrix as our input space\nX &lt;- as.matrix(cbind.data.frame(X1c, X2, X3c))\nhead(X)\n\n           X1c        X2        X3c\n[1,] 0.3584906 0.8150439 0.84585476\n[2,] 0.3962264 0.8974272 0.97749530\n[3,] 0.4528302 0.9782005 0.96806621\n[4,] 0.5094340 0.9991219 0.75535447\n[5,] 0.6226415 0.8587536 0.23957183\n[6,] 0.6792453 0.7150326 0.09822483\n\ny_obs &lt;- df_train$observation\ny &lt;- f(y_obs) # transform y\n\n# A very small value for stability\neps &lt;- sqrt(.Machine$double.eps) \n  \n# Priors for theta and g. \nd &lt;- darg(list(mle=TRUE, min =eps, max=5), X)\ng &lt;- garg(list(mle=TRUE, min = eps, max = 1), y)\n\n# Fitting a GP with our data, and some starting values for theta and g\ngpi &lt;- newGPsep(X, y, d = 0.1, g = 1, dK = T)\n\n# Jointly infer MLE for all parameters\nmle &lt;- jmleGPsep(gpi, drange = c(d$min, d$max), grange = c(g$min, g$max), \n                 dab = d$ab, gab=  g$ab)\n\n# Create a grid from start date in our data set to one year in future (so we forecast for next season)\nstartdate &lt;- as.Date(min(df$datetime))# identify start week\ngrid_datetime &lt;- seq.Date(startdate, Sys.Date() + 365, by = 7) # create sequence\n\n# Build the input space for the predictive space (All weeks from 04-2014 to 07-2025)\nXXt1 &lt;- fx.iso_week(grid_datetime)\nXXt2 &lt;- fx.sin(grid_datetime)\nXXt3 &lt;- fx.green(grid_datetime, site = site_names[site_nunber], site_info = df_green_site2)\n\n# Standardize\nXXt1c &lt;- XXt1/53\nXXt3 &lt;- (XXt3 - min(XXt3))/ (max(XXt3)- min(XXt3))\n\n# Store inputs as a matrix\nXXt &lt;- as.matrix(cbind.data.frame(XXt1c, XXt2, XXt3))\n\n# Make predictions using predGP with the gp object and the predictive set\nppt &lt;- predGPsep(gpi, XXt) \n\n# Now we store the mean as our predicted response i.e. density along with quantiles\nyyt &lt;- ppt$mean\nq1t &lt;- ppt$mean + qnorm(0.025,0,sqrt(diag(ppt$Sigma))) #lower bound\nq2t &lt;- ppt$mean + qnorm(0.975,0,sqrt(diag(ppt$Sigma))) # upper bound\n\n# Back transform our data to original\ngp_yy &lt;- fi(yyt)\ngp_q1 &lt;- fi(q1t)\ngp_q2 &lt;- fi(q2t)\n\n# Plot the observed points\nplot(as.Date(df$datetime), df$observation,\n       main = paste(site_names[site_number]), col = \"black\",\n       xlab = \"Dates\" , ylab = \"Abundance\",\n       # xlim = c(as.Date(min(df$datetime)), as.Date(cutoff)),\n       ylim = c(min(df_train$observation, gp_yy, gp_q1), max(df_train$observation, gp_yy, gp_q2)* 1.05))\n\n# Plot the testing set data \npoints(as.Date(df_test$datetime), df_test$observation, col =\"black\", pch = 19)\n\n# Line to indicate seperation between train and test data\nabline(v = as.Date(cutoff), lwd = 2)\n\n# Add the predicted response and the quantiles\nlines(grid_datetime, gp_yy, col = 4, lwd = 2)\nlines(grid_datetime, gp_q1, col = 4, lwd = 1.2, lty = 2)\nlines(grid_datetime, gp_q2, col = 4, lwd = 1.2, lty =2)\n\n\n\n\n\n\n\n# Obtain true observed values for testing set\nyt_true &lt;- f(df_test$observation)\n\n# FInd corresponding predictions from our model in the grid we predicted on\nyt_pred &lt;- yyt[which(grid_datetime  %in% df_test$datetime)]\n\n# calculate RMSE\nrmse &lt;- sqrt(mean((yt_true - yt_pred)^2))\nrmse\n\n[1] 0.9532001\n\n\n\n\nFit a GP Model for all the locations (More advanced).\n\n# GP function. This can be varied but easiest way is to just take in X, y, XX and return the predicted means and bounds. \n\ngpfit &lt;- function(X, y , XXt){\n  eps &lt;- sqrt(.Machine$double.eps) \n  \n  # Priors for theta and g. \n  d &lt;- darg(list(mle=TRUE, min =eps, max=5), X)\n  g &lt;- garg(list(mle=TRUE, min = eps, max = 1), y)\n\n  # Fitting a GP with our data, and some starting values for theta and g\n  gpi &lt;- newGPsep(X, y, d = 0.1, g = 1, dK = T)\n\n  # Jointly infer MLE for all parameters\n  mle &lt;- jmleGPsep(gpi, drange = c(d$min, d$max), grange = c(g$min, g$max), \n                 dab = d$ab, gab=  g$ab)\n\n  ppt &lt;- predGPsep(gpi, XXt) \n\n  # Now we store the mean as our predicted response i.e. density along with quantiles\n  yyt &lt;- ppt$mean\n  q1t &lt;- ppt$mean + qnorm(0.025,0,sqrt(diag(ppt$Sigma))) #lower bound\n  q2t &lt;- ppt$mean + qnorm(0.975,0,sqrt(diag(ppt$Sigma))) # upper bound\n\n  # Back transform our data to original\n  gp_yy &lt;- fi(yyt)\n  gp_q1 &lt;- fi(q1t)\n  gp_q2 &lt;- fi(q2t)\n  \n  return(list(mean = gp_yy, s2 = diag(ppt$Sigma), q1 = gp_q1, q2 = gp_q2))\n}\n\n\nsite_number &lt;- 7 # (site_number = 4) for the other challenge\n\n# Obtaining site name\nsite_names &lt;- unique(target$site_id)\n\n# extracting only the datetime and obs columns\ndf &lt;- target[, c(\"datetime\", \"site_id\", \"observation\")]\n\ncutoff = as.Date('2020-12-31')\n\n# This was always be prediction set\nstartdate &lt;- as.Date(min(df$datetime))# identify start week\ngrid_datetime &lt;- seq.Date(startdate, Sys.Date() + 365, by = 7) # create sequence\n\n# You can pre process to have y transformed or have it in the loop.\nrmse &lt;- matrix(nrow = length(site_names), ncol = 1) # if rmse\n\nfor(i in 1:length(site_names)){\n  \n  df_site &lt;- subset(df, site_id == site_names[i])\n  \n  # cutoff for sites\n  df_train &lt;- subset(df_site, df_site$datetime &lt;= cutoff)\n  df_test &lt;- subset(df_site, df_site$datetime &gt; cutoff)\n  \n  df_green_site &lt;- subset(df_green, site == site_names[i])\n  \n  X1 &lt;- fx.iso_week(df_train$datetime) # range is 1-53\n  X2 &lt;- fx.sin(df_train$datetime) # range is 0 to 1\n  X3 &lt;- fx.green(df_train$datetime, site = site_names[site_number], site_info = df_green_site) # optional add\n\n  X1c &lt;- X1/ 53\n  X3c &lt;- (X3 - min(X3))/ (max(X3)- min(X3))\n  X &lt;- as.matrix(cbind.data.frame(X1c, X2, X3c))\n\n  y_obs &lt;- df_train$observation # only at this location\n  y &lt;- f(y_obs) # transform y\n\n  XXt1 &lt;- fx.iso_week(grid_datetime)\n  XXt2 &lt;- fx.sin(grid_datetime)\n  XXt3 &lt;- fx.green(grid_datetime, site = site_names[site_nunber], site_info =\n                   df_green_site)\n\n  # Standardize\n  XXt1c &lt;- XXt1/53\n  XXt3 &lt;- (XXt3 - min(XXt3))/ (max(XXt3)- min(XXt3))\n  XXt &lt;- as.matrix(cbind.data.frame(XXt1c, XXt2, XXt3))\n \n  fit &lt;- gpfit(X = X, y = y, XX = XXt)\n  \n  # Make plots\n  plot(as.Date(df_site$datetime), df_site$observation,\n       main = paste(site_names[i]), col = \"black\",\n       xlab = \"Dates\" , ylab = \"Abundance\",\n       # xlim = c(as.Date(min(df$datetime)), as.Date(cutoff)),\n       ylim = c(min(df_train$observation, df_test$observation, fit$q1),\n                max(df_train$observation,df_test$observation, fit$q2)* 1.05))\n\n  points(as.Date(df_test$datetime), df_test$observation, col =\"black\", pch = 19)\n  abline(v = as.Date(cutoff), lwd = 2)\n\n  # Add the predicted response and the quantiles\n  lines(grid_datetime, fit$mean, col = 4, lwd = 2)\n  lines(grid_datetime, fit$q1, col = 4, lwd = 1.2, lty = 2)\n  lines(grid_datetime, fit$q2, col = 4, lwd = 1.2, lty =2)\n  \n  \n  yt_true &lt;- f(df_test$observation)\n  yt_pred &lt;- f(fit$mean[which(grid_datetime  %in% df_test$datetime)])\n\n  # calculate RMSE\n  rmse[i, ] &lt;- sqrt(mean((yt_true - yt_pred)^2))\n}\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nrownames(rmse) &lt;- site_names\nprint(rmse)\n\n          [,1]\nBLAN 1.1173576\nKONZ 0.7779042\nLENO 0.6851390\nORNL 0.9205952\nOSBS 1.2057049\nSCBI 0.8610973\nSERC 0.9532001\nTALL 1.2216955\nUKFS 2.0705090"
   },
   {
     "objectID": "Stats_review_soln.html",
@@ -811,6 +811,62 @@
     "section": "References",
     "text": "References\n\n\n\n\n\n\n\n\nBinois, Mickael, Robert B Gramacy, and Mike Ludkovski. 2018. “Practical Heteroscedastic Gaussian Process Modeling for Large Simulation Experiments.” Journal of Computational and Graphical Statistics 27 (4): 808–21.\n\n\nGramacy, Robert B. 2020. Surrogates: Gaussian Process Modeling, Design, and Optimization for the Applied Sciences. Chapman; Hall/CRC.\n\n\nThomas, R Quinn, Carl Boettiger, Cayelan C Carey, Michael C Dietze, Leah R Johnson, Melissa A Kenney, Jason S Mclachlan, et al. 2022. “The NEON Ecological Forecasting Challenge.” Authorea Preprints."
   },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#exploring-the-data",
+    "href": "VB_TimeDepData_practical_soln.html#exploring-the-data",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Exploring the Data",
+    "text": "Exploring the Data\nAs always, we first want to take a look at the data, to make sure we understand it, and that we don’t have missing or weird values.\n\nmozData&lt;-read.csv(\"data/Culex_erraticus_walton_covariates_aggregated.csv\")\nsummary(mozData)\n\n   Month_Yr          sample_value        MaxTemp          Precip      \n Length:36          Min.   :0.00000   Min.   :16.02   Min.   : 0.000  \n Class :character   1st Qu.:0.04318   1st Qu.:22.99   1st Qu.: 2.162  \n Mode  :character   Median :0.73001   Median :26.69   Median : 4.606  \n                    Mean   :0.80798   Mean   :26.23   Mean   : 5.595  \n                    3rd Qu.:1.22443   3rd Qu.:30.70   3rd Qu.: 7.864  \n                    Max.   :3.00595   Max.   :33.31   Max.   :18.307  \n\n\nWe can see that the minimum observed average number of mosquitoes it zero, and max is only 3 (there are likely many zeros averaged over many days in the month). There don’t appear to be any NAs in the data. In this case the dataset itself is small enough that we can print the whole thing to ensure it’s complete:\n\nmozData\n\n   Month_Yr sample_value  MaxTemp       Precip\n1   2015-01  0.000000000 17.74602  3.303991888\n2   2015-02  0.018181818 17.87269 16.544265802\n3   2015-03  0.468085106 23.81767  2.405651215\n4   2015-04  1.619047619 26.03559  8.974406168\n5   2015-05  0.821428571 30.01602  0.567960943\n6   2015-06  3.005952381 31.12094  4.841342729\n7   2015-07  2.380952381 32.81130  3.849010353\n8   2015-08  1.826347305 32.56245  5.562845324\n9   2015-09  0.648809524 30.55155 10.409724627\n10  2015-10  0.988023952 27.22605  0.337750269\n11  2015-11  0.737804878 24.86768 18.306749680\n12  2015-12  0.142857143 22.46588  5.621475377\n13  2016-01  0.000000000 16.02406  3.550622029\n14  2016-02  0.020202020 19.42057 11.254680803\n15  2016-03  0.015151515 23.13610  4.785664728\n16  2016-04  0.026143791 24.98082  4.580424519\n17  2016-05  0.025252525 28.72884  0.053057634\n18  2016-06  0.833333333 30.96990  6.155417473\n19  2016-07  1.261363636 33.30509  4.496368193\n20  2016-08  1.685279188 32.09633 11.338749182\n21  2016-09  2.617142857 31.60575  2.868288451\n22  2016-10  1.212121212 29.14275  0.000000000\n23  2016-11  1.539772727 24.48482  0.005462681\n24  2016-12  0.771573604 20.46054 11.615521725\n25  2017-01  0.045454545 18.35473  0.000000000\n26  2017-02  0.036363636 23.65584  3.150710053\n27  2017-03  0.194285714 22.53573  1.430094952\n28  2017-04  0.436548223 26.15299  0.499381616\n29  2017-05  1.202020202 28.00173  6.580562663\n30  2017-06  0.834196891 29.48951 13.333939858\n31  2017-07  1.765363128 32.25135  7.493927035\n32  2017-08  0.744791667 31.86476  6.082113434\n33  2017-09  0.722222222 30.60566  4.631037395\n34  2017-10  0.142131980 27.73453 11.567112214\n35  2017-11  0.289772727 23.23140  1.195760473\n36  2017-12  0.009174312 18.93603  4.018254442"
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#plotting-the-data",
+    "href": "VB_TimeDepData_practical_soln.html#plotting-the-data",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Plotting the data",
+    "text": "Plotting the data\nFirst we’ll examine the data itself, including the predictors:\n\nmonths&lt;-dim(mozData)[1]\nt&lt;-1:months ## counter for months in the data set\npar(mfrow=c(3,1))\nplot(t, mozData$sample_value, type=\"l\", lwd=2, \n     main=\"Average Monthly Abundance\", \n     xlab =\"Time (months)\", \n     ylab = \"Average Count\")\nplot(t, mozData$MaxTemp, type=\"l\",\n     col = 2, lwd=2, \n     main=\"Average Maximum Temp\", \n     xlab =\"Time (months)\", \n     ylab = \"Temperature (C)\")\nplot(t, mozData$Precip, type=\"l\",\n     col=\"dodgerblue\", lwd=2,\n     main=\"Average Monthly Precip\", \n     xlab =\"Time (months)\", \n     ylab = \"Precipitation (in)\")\n\n\n\n\n\n\n\n\nVisually we noticed that there may be a bit of clumping in the values for abundance (this is subtle) – in particular, since we have a lot of very small/nearly zero counts, a transform, such as a square root, may spread things out for the abundances. It also looks like both the abundance and temperature data are more cyclical than the precipitation, and thus more likely to be related to each other. There’s also not visually a lot of indication of a trend, but it’s usually worthwhile to consider it anyway. Replotting the abundance data with a transformation:\n\nmonths&lt;-dim(mozData)[1]\nt&lt;-1:months ## counter for months in the data set\nplot(t, sqrt(mozData$sample_value), type=\"l\", lwd=2, \n     main=\"Sqrt Average Monthly Abundance\", \n     xlab =\"Time (months)\", \n     ylab = \"Average Count\")\n\n\n\n\n\n\n\n\nThat looks a little bit better. I suggest we go with this for our response."
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#building-a-data-frame",
+    "href": "VB_TimeDepData_practical_soln.html#building-a-data-frame",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Building a data frame",
+    "text": "Building a data frame\nBefore we get into model building, we always want to build a data frame to contain all of the predictors that we want to consider, at the potential lags that we’re interested in. In the lecture we saw building the AR, sine/cosine, and trend predictors:\n\nt &lt;- 2:months ## to make building the AR1 predictors easier\n\nmozTS &lt;- data.frame(\n  Y=sqrt(mozData$sample_value[t]), # transformed response\n  Yl1=sqrt(mozData$sample_value[t-1]), # AR1 predictor\n  t=t, # trend predictor\n  sin12=sin(2*pi*t/12), \n  cos12=cos(2*pi*t/12) # periodic predictors\n  )\n\nWe will also put in the temperature and precipitation predictors. But we need to think about what might be an appropriate lag. If this were daily or weekly data, we’d probably want to have a fairly sizable lag – mosquitoes take a while to develop, so the number we see today is not likely related to the temperature today. However, since these data are agregated across a whole month, as is the temperature/precipitaion, the current month values are likely to be useful. However, it’s even possible that last month’s values may be so we’ll add those in as well:\n\nmozTS$MaxTemp&lt;-mozData$MaxTemp[t] ## current temps\nmozTS$MaxTempl1&lt;-mozData$MaxTemp[t-1] ## previous temps\nmozTS$Precip&lt;-mozData$Precip[t] ## current precip\nmozTS$Precipl1&lt;-mozData$Precip[t-1] ## previous precip\n\nThus our full dataframe:\n\nsummary(mozTS)\n\n       Y               Yl1               t            sin12         \n Min.   :0.0000   Min.   :0.0000   Min.   : 2.0   Min.   :-1.00000  \n 1st Qu.:0.2951   1st Qu.:0.2951   1st Qu.:10.5   1st Qu.:-0.68301  \n Median :0.8590   Median :0.8590   Median :19.0   Median : 0.00000  \n Mean   :0.7711   Mean   :0.7684   Mean   :19.0   Mean   :-0.01429  \n 3rd Qu.:1.1120   3rd Qu.:1.1120   3rd Qu.:27.5   3rd Qu.: 0.68301  \n Max.   :1.7338   Max.   :1.7338   Max.   :36.0   Max.   : 1.00000  \n     cos12             MaxTemp        MaxTempl1         Precip      \n Min.   :-1.00000   Min.   :16.02   Min.   :16.02   Min.   : 0.000  \n 1st Qu.:-0.68301   1st Qu.:23.18   1st Qu.:23.18   1st Qu.: 1.918  \n Median : 0.00000   Median :27.23   Median :27.23   Median : 4.631  \n Mean   :-0.02474   Mean   :26.47   Mean   :26.44   Mean   : 5.660  \n 3rd Qu.: 0.50000   3rd Qu.:30.79   3rd Qu.:30.79   3rd Qu.: 8.234  \n Max.   : 1.00000   Max.   :33.31   Max.   :33.31   Max.   :18.307  \n    Precipl1     \n Min.   : 0.000  \n 1st Qu.: 1.918  \n Median : 4.631  \n Mean   : 5.640  \n 3rd Qu.: 8.234  \n Max.   :18.307  \n\n\n\nhead(mozTS)\n\n          Y       Yl1 t         sin12         cos12  MaxTemp MaxTempl1\n1 0.1348400 0.0000000 2  8.660254e-01  5.000000e-01 17.87269  17.74602\n2 0.6841675 0.1348400 3  1.000000e+00  6.123234e-17 23.81767  17.87269\n3 1.2724180 0.6841675 4  8.660254e-01 -5.000000e-01 26.03559  23.81767\n4 0.9063270 1.2724180 5  5.000000e-01 -8.660254e-01 30.01602  26.03559\n5 1.7337683 0.9063270 6  1.224647e-16 -1.000000e+00 31.12094  30.01602\n6 1.5430335 1.7337683 7 -5.000000e-01 -8.660254e-01 32.81130  31.12094\n      Precip   Precipl1\n1 16.5442658  3.3039919\n2  2.4056512 16.5442658\n3  8.9744062  2.4056512\n4  0.5679609  8.9744062\n5  4.8413427  0.5679609\n6  3.8490104  4.8413427"
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#building-a-first-model",
+    "href": "VB_TimeDepData_practical_soln.html#building-a-first-model",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Building a first model",
+    "text": "Building a first model\nWe will first build a very simple model – just a trend – to practice building the model, checking diagnostics, and plotting predictions.\n\nmod1&lt;-lm(Y ~ t, data=mozTS)\nsummary(mod1)\n\n\nCall:\nlm(formula = Y ~ t, data = mozTS)\n\nResiduals:\n     Min       1Q   Median       3Q      Max \n-0.81332 -0.47902  0.03671  0.37384  0.87119 \n\nCoefficients:\n             Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept)  0.904809   0.178421   5.071  1.5e-05 ***\nt           -0.007038   0.008292  -0.849    0.402    \n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 0.4954 on 33 degrees of freedom\nMultiple R-squared:  0.02136,   Adjusted R-squared:  -0.008291 \nF-statistic: 0.7204 on 1 and 33 DF,  p-value: 0.4021\n\n\nThe model output indicates that this model is not useful – the trend is not significant and it only explains about 2% of the variability. Let’s plot the predictions:\n\n## plot points and fitted lines\nplot(Y~t, data=mozTS, col=1, type=\"l\")\nlines(t, mod1$fitted, col=\"dodgerblue\", lwd=2)\n\n\n\n\n\n\n\n\nNot good – we’ll definitely need to try something else! Remember that since we’re using a linear model for this, that we should check our residual plots as usual, and then also plot the acf of the residuals:\n\npar(mfrow=c(1,3), mar=c(4,4,2,0.5))   \n\n## studentized residuals vs fitted\nplot(mod1$fitted, rstudent(mod1), col=1,\n     xlab=\"Fitted Values\", \n     ylab=\"Studentized Residuals\", \n     pch=20, main=\"AR 1 only model\")\n\n## qq plot of studentized residuals\nqqnorm(rstudent(mod1), pch=20, col=1, main=\"\" )\nabline(a=0,b=1,lty=2, col=2)\n\n## histogram of studentized residuals\nhist(rstudent(mod1), col=1, \n     xlab=\"Studentized Residuals\", \n     main=\"\", border=8)\n\n\n\n\n\n\n\n\nThis doesn’t look really bad, although the histogram might be a bit weird. Finally the acf\n\nacf(mod1$residuals)\n\n\n\n\n\n\n\n\nThis is where we can see that we definitely aren’t able to capture the pattern. There’s substantial autocorrelation left at a 1 month lag, and around 6 months.\nFinally, for moving forward, we can extract the BIC for this model so that we can compare with other models that you’ll build next.\n\nn&lt;-length(t)\nextractAIC(mod1, k=log(n))[2]\n\n[1] -44.11057"
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#example-solution-ar1-model-only",
+    "href": "VB_TimeDepData_practical_soln.html#example-solution-ar1-model-only",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Example Solution: AR1 model only",
+    "text": "Example Solution: AR1 model only\n\nmod2&lt;-lm(Y ~ Yl1, data=mozTS)\nsummary(mod2)\n\n\nCall:\nlm(formula = Y ~ Yl1, data = mozTS)\n\nResiduals:\n    Min      1Q  Median      3Q     Max \n-0.6338 -0.2173 -0.0678  0.2463  0.8675 \n\nCoefficients:\n            Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept)   0.2410     0.1130   2.132   0.0405 *  \nYl1           0.6899     0.1240   5.562 3.51e-06 ***\n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 0.3598 on 33 degrees of freedom\nMultiple R-squared:  0.4839,    Adjusted R-squared:  0.4682 \nF-statistic: 30.94 on 1 and 33 DF,  p-value: 3.507e-06\n\n\nThe model is better than the original trend only model – the AR1 term explains about 48% of the variability. Let’s plot the predictions:\n\n## plot points and fitted lines\nplot(Y~t, data=mozTS, col=1, type=\"l\")\nlines(t, mod2$fitted, col=2, lwd=2)\n\n\n\n\n\n\n\n\nPretty good! Look at all of the diagnostic plots:\n\npar(mfrow=c(1,3), mar=c(4,4,2,0.5))   \n\n## studentized residuals vs fitted\nplot(mod2$fitted, rstudent(mod2), col=2,\n     xlab=\"Fitted Values\", \n     ylab=\"Studentized Residuals\", \n     pch=20, main=\"AR 1 only model\")\n\n## qq plot of studentized residuals\nqqnorm(rstudent(mod2), pch=20, col=2, main=\"\" )\nabline(a=0,b=1,lty=2, col=1)\n\n## histogram of studentized residuals\nhist(rstudent(mod2), col=2, \n     xlab=\"Studentized Residuals\", \n     main=\"\", border=8)\n\n\n\n\n\n\n\n\nMaybe one outlier, but not too bad.\n\nacf(mod2$residuals)\n\n\n\n\n\n\n\n\nWe seem to have taken care of all of the autoregression, even at multiple lags!\n\nn&lt;-length(t)\nextractAIC(mod2, k=log(n))[2]\n\n[1] -66.50482\n\n\nBIC is much lower – overall a much much better model than the first one."
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#example-solution-sinecosine-terms-only",
+    "href": "VB_TimeDepData_practical_soln.html#example-solution-sinecosine-terms-only",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Example Solution: sine/cosine terms only",
+    "text": "Example Solution: sine/cosine terms only\n\nmod3&lt;-lm(Y ~ sin12 + cos12, data=mozTS)\nsummary(mod3)\n\n\nCall:\nlm(formula = Y ~ sin12 + cos12, data = mozTS)\n\nResiduals:\n     Min       1Q   Median       3Q      Max \n-0.70116 -0.21655 -0.03611  0.19213  0.67992 \n\nCoefficients:\n            Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept)  0.75706    0.05750  13.165 1.83e-14 ***\nsin12       -0.38804    0.08072  -4.807 3.48e-05 ***\ncos12       -0.34298    0.08192  -4.187 0.000207 ***\n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 0.3399 on 32 degrees of freedom\nMultiple R-squared:  0.5533,    Adjusted R-squared:  0.5254 \nF-statistic: 19.82 on 2 and 32 DF,  p-value: 2.512e-06\n\n\nThe model is better than the original trend only model – it explains about 55% of the variability (we expect R^2 to increase as we have more predictors). Let’s plot the predictions:\n\n## plot points and fitted lines\nplot(Y~t, data=mozTS, col=1, type=\"l\")\nlines(t, mod3$fitted, col=3, lwd=2)\n\n\n\n\n\n\n\n\nPretty good! Look at all of the diagnostic plots:\n\npar(mfrow=c(1,3), mar=c(4,4,2,0.5))   \n\n## studentized residuals vs fitted\nplot(mod3$fitted, rstudent(mod3), col=3,\n     xlab=\"Fitted Values\", \n     ylab=\"Studentized Residuals\", \n     pch=20, main=\"sin/cos only model\")\n\n## qq plot of studentized residuals\nqqnorm(rstudent(mod3), pch=20, col=3, main=\"\" )\nabline(a=0,b=1,lty=2, col=2)\n\n## histogram of studentized residuals\nhist(rstudent(mod3), col=3, \n     xlab=\"Studentized Residuals\", \n     main=\"\", border=8)\n\n\n\n\n\n\n\n\nMaybe one outlier, but not too bad.\n\nacf(mod3$residuals)\n\n\n\n\n\n\n\n\nWe seem have taken care of the longer lag autocorrelation, but still some lag 1 left.\n\nn&lt;-length(t)\nextractAIC(mod3, k=log(n))[2]\n\n[1] -68.00597\n\n\nThis model is even better than the AR1 model. We’ll keep this in mind…."
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#example-solution-environmental-predictors-only",
+    "href": "VB_TimeDepData_practical_soln.html#example-solution-environmental-predictors-only",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Example Solution: environmental predictors only",
+    "text": "Example Solution: environmental predictors only\nI’ll put in the predictors at the current time period. Since this is monthly averaged data we could probably do either current or lagged.\n\nmod4&lt;-lm(Y ~ MaxTemp + Precip, data=mozTS)\nsummary(mod4)\n\n\nCall:\nlm(formula = Y ~ MaxTemp + Precip, data = mozTS)\n\nResiduals:\n     Min       1Q   Median       3Q      Max \n-0.76043 -0.17925 -0.01671  0.15491  0.64193 \n\nCoefficients:\n             Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept) -1.248452   0.323576  -3.858 0.000521 ***\nMaxTemp      0.075450   0.011641   6.481 2.72e-07 ***\nPrecip       0.003928   0.011870   0.331 0.742852    \n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 0.3344 on 32 degrees of freedom\nMultiple R-squared:  0.5676,    Adjusted R-squared:  0.5406 \nF-statistic:    21 on 2 and 32 DF,  p-value: 1.493e-06\n\n\nThe model is even better than the last – the model explains about 58% of the variability, although the Precip isn’t significant and we might want to consider dropping it. Let’s plot the predictions:\n\n## plot points and fitted lines\nplot(Y~t, data=mozTS, col=1, type=\"l\")\nlines(t, mod4$fitted, col=4, lwd=2)\n\n\n\n\n\n\n\n\nPretty good! Look at all of the diagnostic plots:\n\npar(mfrow=c(1,3), mar=c(4,4,2,0.5))   \n\n## studentized residuals vs fitted\nplot(mod4$fitted, rstudent(mod4), col=4,\n     xlab=\"Fitted Values\", \n     ylab=\"Studentized Residuals\", \n     pch=20, main=\"weather model\")\n\n## qq plot of studentized residuals\nqqnorm(rstudent(mod4), pch=20, col=4, main=\"\" )\nabline(a=0,b=1,lty=2, col=2)\n\n## histogram of studentized residuals\nhist(rstudent(mod4), col=4, \n     xlab=\"Studentized Residuals\", \n     main=\"\", border=8)\n\n\n\n\n\n\n\n\nMaybe one outlier again, but not too bad.\n\nacf(mod4$residuals)\n\n\n\n\n\n\n\n\nWe seem to have taken care of all of the autoregression, except maybe a bit of AR1.\n\nn&lt;-length(t)\nextractAIC(mod4, k=log(n))[2]\n\n[1] -69.14372\n\n\nEven better, although it’s not much different than the sin/cos"
+  },
+  {
+    "objectID": "VB_TimeDepData_practical_soln.html#example-solution-ar1-plus-sincos",
+    "href": "VB_TimeDepData_practical_soln.html#example-solution-ar1-plus-sincos",
+    "title": "VectorByte Methods Training: Regression Methods for Time Dependent Data (practical - solution)",
+    "section": "Example Solution: AR1 plus sin/cos",
+    "text": "Example Solution: AR1 plus sin/cos\nOk, now to combine things:\n\nmod5&lt;-lm(Y ~ Yl1 + sin12 + cos12, data=mozTS)\nsummary(mod5)\n\n\nCall:\nlm(formula = Y ~ Yl1 + sin12 + cos12, data = mozTS)\n\nResiduals:\n     Min       1Q   Median       3Q      Max \n-0.49092 -0.25028 -0.02153  0.17287  0.60748 \n\nCoefficients:\n            Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept)  0.38035    0.12935   2.940 0.006148 ** \nYl1          0.49652    0.15681   3.166 0.003453 ** \nsin12       -0.13417    0.10729  -1.251 0.220457    \ncos12       -0.29593    0.07386  -4.007 0.000358 ***\n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 0.3002 on 31 degrees of freedom\nMultiple R-squared:  0.6625,    Adjusted R-squared:  0.6298 \nF-statistic: 20.28 on 3 and 31 DF,  p-value: 1.835e-07\n\n\nThe model is better than the original trend only model – the AR1 term explains about 48% of the variability. Let’s plot the predictions:\n\n## plot points and fitted lines\nplot(Y~t, data=mozTS, col=1, type=\"l\")\nlines(t, mod5$fitted, col=5, lwd=2)\n\n\n\n\n\n\n\n\nPretty good! Look at all of the diagnostic plots:\n\npar(mfrow=c(1,3), mar=c(4,4,2,0.5))   \n\n## studentized residuals vs fitted\nplot(mod5$fitted, rstudent(mod5), col=5,\n     xlab=\"Fitted Values\", \n     ylab=\"Studentized Residuals\", \n     pch=20, main=\"AR 1 only model\")\n\n## qq plot of studentized residuals\nqqnorm(rstudent(mod5), pch=20, col=5, main=\"\" )\nabline(a=0,b=1,lty=2, col=2)\n\n## histogram of studentized residuals\nhist(rstudent(mod5), col=5, \n     xlab=\"Studentized Residuals\", \n     main=\"\", border=8)\n\n\n\n\n\n\n\n\nThat’s really good!.\n\nacf(mod5$residuals)\n\n\n\n\n\n\n\n\nWe seem to have taken care of all of the autoregression!\n\nn&lt;-length(t)\nextractAIC(mod5, k=log(n))[2]\n\n[1] -74.25862\n\n\nAnd definitely the best so far. Just to compare more easily:\n\nc(mod1 = extractAIC(mod1, k=log(n))[2],\n  mod2 = extractAIC(mod2, k=log(n))[2],\n  mod3 = extractAIC(mod3, k=log(n))[2],\n  mod4 = extractAIC(mod4, k=log(n))[2],\n  mod5 = extractAIC(mod5, k=log(n))[2])\n\n     mod1      mod2      mod3      mod4      mod5 \n-44.11057 -66.50482 -68.00597 -69.14372 -74.25862 \n\n\nWe’re looking for difference of about 5 to determine if a model is better. Model 5 is about 5 better than model 4, and models 2-4 are all about even. It may be that AR1 plus temperature might be even better, but it’s easier to forecast with a sine/cosine than using temperature, so I went for that…."
+  },
   {
     "objectID": "GP_Practical.html",
     "href": "GP_Practical.html",