Probability, Computing and Simulation in Statistics Fall 2018
This project is maintained by ST541-Fall2018
Links are either to open resources, or canvas (only accessible to registered students).
Pre-lecture reading:
Read the article: Putin’s Peaks: Russian Election Data Revisited. Kobak, Dmitry, Sergey Shpilkin, and Maxim S. Pshenichnikov. n.d. Significance 15 (3): 8–9
Be prepared to answer the following questions in class:
What feature of Figure 1 do the authors argue is evidence for election fraud?
For a particular year, say 2018, how do the authors measure the statistical significance of the feature in Figure 1? What statistic do they calculate? What do they compare that statistic to?
Pre-lecture reading:
Be prepared to answer:
Pre-lecture reading:
Functions in R4DS 19.1 through and including 19.3 (If you are on a roll keep reading…)
Be prepared to answer:
Pre-lecture reading:
Wilson G, Bryan J, Cranston K, Kitzes J, Nederbragt L, et al. (2017) Good enough practices in scientific computing. PLOS Computational Biology 13(6): e1005510. https://doi.org/10.1371/journal.pcbi.1005510
a minimum set of tools and techniques that we believe every researcher can and should consider adopting.
Read the Author summary, Overview, Introduction and Software sections.
You may also want to review some things we touched on in class about functions in R:
Section 1.2 Introduction (page 8) up to and including page 10, in Bootstrap Methods: A Guide for Practitioners and Researchers by Michael R. Chernick
Be prepared to answer the following questions:
Usually when thinking about constructing an estimate for a paramater, we make some assumptions about the population distribution. What does the bootstrap replace the population distribution with? Why is this a good replacement?
Why do we need a Monte Carlo approach?
If you’ve seen the bootstrap before, you might continue reading to the end on the Introduction, paying attention to the reasons the bootstrap might not work.