Final Protocol Project
Overview & Requirements for Final Deliverable
A protocol is like a recipe - it tells you what ingredients, tools, and steps are necessary to produce an end result. So just as you would follow a recipe to bake a cake, you might also follow a protocol to archive, share, or reuse data. In Data Curation I you created a simple protocol for a single dataset. In Data Curation II we have significantly more knowledge and skills to develop a more robust curation protocol. Over the course of the quarter you will complete a group or individual project focused on developing a robust protocol for curating a collection of open data. The topic, the audience, the type of data, and the curation tools required are all up for you to decide (note: examples provided below).
Goals
- Gain experience curating open data “in the wild” (e.g. on the web).
- Demonstrate competency in producing or identifying existing best practices for curating various types of data.
- Practice creating structured and unstructured documentation that improves the accessibility and usability of data.
- Develop a protocol that shows off your talents and can act as a portfolio for future professional activity.
Final Deliverable
Your final deliverable will include the following components. Each week you will complete an assignment that helps you think through each of these components:
- Project Pitch: to help you find a project group (groups are recommended but not required)
- Statement of Work: Project definition, scope, and audience
- Users and User Stories
- Collection Policies
- Transformations and Data Quality
- Metadata Application Profile
- Licensing
- At least, read through the Repository Architectures Optional Assignment
You will be required to produce a written report that coherently brings together each of the pieces described above. As stated above, each week you will complete an assignment that helps you practice and produce a portion of your report. You will likely refine portions of your assignments (even after they have been submitted and graded) after receiving feedback from me, or after learning new information. You should then, with minimal additional effort (depending on the level of refining necessary), be able to simply combine these assignments into one single document and hand it in (along with your data) for a complete protocol. Additional information that you choose to include is left to your discretion. The only requirement is to include - at minimum - the elements above.
Form of Final Deliverable
Your final deliverable can take many forms. You can, as the examples below demonstrate, use Gitbook and Github to deposit data in an open platform and supplement the collection with written documentation. You could use Github without Gitbook and just include the documentation as deposited files (or create a website like this one). You could also pitch another idea. I am completely open to innovations with this assignment - the only requirements are that you engage with the concepts we are discussing in class, and try to somehow implement these in a real world setting.
Presentation
In Week 10 you will be given the opportunity to present a 5-10 minute overview of your protocol. This is optional but highly encouraged. Presentations should explain design choices around each aspect of your final deliverable. How formal you make the presentation (slides, prepared talking points, as a musical, etc.) is up to your group. I understand that presentations of this type can sometimes be difficult for a variety of reasons. As such, I will not grade these informal presentations but I highly encourage them for three reasons:
- Many of you will be expected to present regularly in your jobs and all practice is good practice.
- You can share the presentation as part of a portfolio.
- I may better understand your final protocol after viewing your presentation.
Example Protocols
Here are some examples of previous Data Curation 2 protocols:
- Flattened Fauna: Repository and Project Report
- VaxStats: Repository and Project Report
- R3 Recycling: Repository and Project Report. (Note the lead author on this repository used it as a portfolio for her current job as a data curator.)