Schedule

Event

Date

Description

Course Material
Module

03/29/2021
Monday

Week 1: Course Overview and Introduction
[video] [content] [exercise]
Required Reading:
- Course Content
Due

03/30/2021 23:59
Tuesday

Sign Up for 15-minute Meet & Greet
Where:
- Canvas Calendar (see Course Overview Lecture)
- (This isn’t a hard deadline - just be aware that sign-ups are happening)
Due

04/04/2021 23:59
Sunday

Introduce Yourself on Canvas
Where:
- Canvas Discussion Board
Due

04/04/2021 23:59
Sunday

Read Assignment Overview
Module

04/05/2021
Monday

Week 2: Tables, Trees, & Triples
[video] [content] [exercise]
Required Readings:
Suggested Readings:
- Abiteboul, S., Buneman, P., & Suciu, D. (2000). Data on the Web: from relations to semistructured data and XML
Due

04/11/2021 23:59
Sunday

Week 2 Exercise Due
Where:
- Canvas Discussion Board
Due

04/11/2021 23:59
Sunday

Post Data Pitch
Module

04/12/2021
Monday

Week 3: Tidy Data
[video] [content] [exercise]
Required Readings:
Suggested Readings:
Due

04/18/2021 23:59
Sunday

Week 3 Exercise Due
Where:
- Canvas Discussion Board
Due

04/18/2021 23:59
Sunday

Submit Statement of Work
Module

04/19/2021
Monday

Week 4: Data Integration
[lecture] [guest lecture] [content] [exercise]
Required Readings:
- Course Content
Highly Recommended Readings:
- Whong, Chris (2020) “Taming the MTA’s Unruly Turnstile Data”
- Wikipedia article on data integration
Optional Readings:
- Halevy, A., Rajaraman, A., & Ordille, J. (2006, September). Data integration: The teenage years. In Proceedings of the 32nd international conference on Very large data bases (pp. 9-16)
- Abiteboul, S., Buneman, P., & Suciu, D. (2000). Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann
Due

04/25/2021 23:59
Sunday

Week 4 Exercise Due
Where:
- Canvas Discussion Board
Due

04/25/2021 23:59
Sunday

Submit Users and Use Cases
Module

04/26/2021
Monday

Week 5: Data Packaging
[video] [content] [exercise]
Required Readings:
- Course Content
- Bechhofer, S., De Roure, D., Gamble, M., Goble, C., & Buchan, I. (2010). Research objects: Towards exchange and reuse of digital knowledge
- Skim this list of projects and tools for data packaging: Google Sheet or PDF
- Neylon (2017) Packaging data
Pick 1 of following to read or review in-depth:
Due

05/02/2021 23:59
Sunday

Submit Collection Policies
Module

05/03/2021
Monday

Week 6: Repository Architectures
[video] [content]
Required Readings:
- Course Content
- description of digital libraries (cyberinfrastructure) from the National Science Foundation program
- This post from the IQSS staff at Harvard’s Dataverse provides an excellent table comparing existing data repository services. Pay attention to the categories being compared, and how this related to the affordances of the software
- Fallaw, C., Dunham, E., … (2016). Overly honest data repository development. Code4Lib
Review documentation for just one repository platform listed below (be sure to also look at an example of the platform’s deployment):
- Samavera (Open-source repository for universities and institutional repositories)
- Dataverse (Open-source repository for social science data)
  - About
  - Documentation
  - Example deployments https://data.qdr.syr.edu/ and https://dataverse.tdl.org/
  - See the QDR Core Seal Trust documentation for more details on how Dataverse is configured
- Fedora (Open-source repository with semantic capabilities - often used by science repositories)
- CKAN (open-source data repository - often used for civic data)
  - About
  - Documentation
  - Example deployments https://data.gov.au/ and Data.gov
  - Some additional info on Data.gov.au’s CKAN
- Clowder (Open-source for long-tail data)
Suggested Readings:
Module

05/10/2021
Monday

Week 7: Data Acquisition, Search, and Discovery
[video] [content] [exercise]
Required Readings:
Suggested Readings:
Case Study (Optional):
- A Data-Driven Approach to Appraisal and Selection at a Domain Data Repository.
Due

05/16/2021 23:59
Sunday

Week 7 Exercise Due
Where:
- Canvas Discussion Board
Due

05/16/2021 23:59
Sunday

Submit Transformations and Quality Assignment
Module

05/17/2021
Monday

Week 8: Metadata Application Profiles
[video] [content] [exercise]
Required Readings:
- Course Content
- Application profiles:
  - Heery, R., & Patel, M. (2000). Application profiles: mixing and matching metadata schemas. Ariadne, (25)
  - The Singapore Framework for Application Profiles Note this is currently under revision by DCMI. You can catch up on their work here (and also see an example of use cases in the wild)
- Some examples of metadata application profiles:
Suggested Readings:
- Hebron, T. K. (2018). Extending and Adapting Metadata Audit Tools for Mountain West Digital Library Members Code4Lib Journal, (41)
- Curado Malta, M., Bermúdez Sabel, H., Baptista, A. A., & González-Blanco García, E. (2018). Validation of a metadata application profile domain model
- Stein, A., & Dunham, E. (2018). Meaningful Data Sharing: Developing the Illinois Data Bank Metadata Framework. Journal of Library Metadata, 18(2), 59-83
Due

05/23/2021 23:59
Sunday

Submit Metadata Application Profile
Module

05/24/2021
Monday

Week 9: Linked Data
[video] [content]
Required Readings:
- Course Content
- Allemang, D., & Hendler, J. (2011). Semantic web for the working ontologist: effective modeling in RDFS and OWL. Second Edition
  - Read Chapter 1 for an introduction to SW’s concepts. If you are interested Chapter 2 gives a bit more detail on how the SW works, and Chapter 3 introduces RDF and knowledge modeling.
- Ontology Development 101 (Noy and McGuiness)
  - Read Section 1 and 2; (3 and 4 are optional)
  - Note - this is a classic formulation of what an ontology is and how to create one. The software they reference in building out the example is called Protege (free https://protege.stanford.edu/). If you are really keen you can follow along. (For reference - this short list from Wikipedia is quite helpful.)
- Ontology for Data Science
- Semantic Web for the Legal Domain
Suggested Readings:
- ARL White Paper on Wikidata: Opportunities and Recommendations (2019)
Due

05/30/2021 23:59
Sunday

Submit Licensing Assignment
Module

05/31/2021
Monday

Week 10: Emerging Topics
[content]
Required Reading:
- Course Content
Due

06/03/2021 23:59
Thursday

Optional Repository Assignment
Due

06/06/2021 23:59
Sunday

Submit Final Protocol