Your semester project assignment is due at the end of the semester and is submitted as a link on Blackboard.
We will discuss this project across the semester. As an overview, you will be demonstrating that you can conduct a reproducible analysis, which is an analysis of data that is independently verifiable. For example, someone else could obtain your data and code and independently reproduce your analysis.
You will complete three related parts.
Reproducible Report: Obtain open-data from an existing psych paper, load the data in R, and attempt to reproduce the statistical analysis that the original authors reported.
APA paper: Learn how to use the
papaja package that allows you to compile .Rmd files to APA style manuscripts in pdf form. Then, write a short APA-style research report that describes your reproducible analysis.
Simulation based power analysis: Include a simulation based power analysis at the end of your APA paper.
Here are a few tips for finding a psych paper with open data. Most important, for this assignment you do not need to re-analyze all of the data from a particular paper. Many papers have multiple experiments, and multiple analyses, including analyses you may not be familiar with. You can restrict your re-analysis to a portion of the paper. For example, you might only re-analyze the results from one experiment, and perhaps only the results relevant to one of the tests reported for the experiment. You can limit your re-analyses to tests that have been covered in lecture or lab.
The data you find could be in many different formats. It should be possible to load it into R and transform the data into the format/organization that you need to complete the analysis.
Focus on a single analysis that was relevant to one of the research questions. For example, if the analysis involved several t-tests:
The concept of a reproducible report is that someone else could exactly reproduce your analysis given your report. It is easy to make reproducible reports using R markdown. If you write your report in an .Rmd file, and that file includes your scripts for loading and analyzing the data, then by sharing your .rmd file, other people can exactly reproduce your report.
Your report should include the following (the points add up to 10 for part 1).
A brief description of the research question and experiment (with citation to the paper, and link to find the data) (3 points)
The R code chunks necessary to complete the re-analysis (3 points).
A write-up of your re-analysis results. (3 points)
A brief discussion of whether you were successful or not. (1 point)
In part 2, you will learn how to use the papaja package to create APA style manuscripts using R markdown. We will discuss how to use papaja in class. You will create a new .rmd file using the papaja template, and then transfer your reproducible report into this format. You will write very brief sections for:
Again, the purpose here is not to write a full-length APA paper, but to get some experience with using the papaja package.
In part 3 you will add a simulation-based power analysis to your APA-style manuscript. Specifically, you should report a graph showing a power-curve for the design. We will discuss how to conduct simulation based power analyses in class.
The following should be included in the general discussion of your APA-paper (from part 2).
To give you a better idea of what I am looking for I completed the project myself.
And the source code is located in this github repository https://github.com/CrumpLab/psyc7709Lab/tree/master/semester_project.
This repository contains: