DHWI Wiki:Data Curation for Digital Humanists
Welcome to the main page for the Data Curation for Digital Humanists course!
There is no required software for this course. The primary materials for this course will be derived from the DH Curation Guide.
Class Location: 3132 Tawes Fine Arts Building, 3rd floor
List of Participants
Trevor Munoz Twitter
Dorothea Salo Twitter
Tanya Clement Twitter
Patricia Hswe Twitter
Lydia Zvyagintseva Twitter
Carol Chiodo Twitter
Laura Estill Twitter
Ed Summers Twitter
Harriett Green Twitter
Craig Milberg Twitter
Chelcie Rowell Twitter
Lara Otis
Adam Tate
Kate Doubler Twitter
Caitlin Christian-Lamb
Giorgio Guzzetta Twitter
Sharon Marcus Twitter
Zotero collection of articles mentioned in lectures
DHWI Data Curation Zotero Collection
Class session slides and links
Session 1: Introduction to Data Curation
Trevor's slides for this session (PDF)
Session 2: Stories of horror!
Dorothea's slides for this session (PDF)
- Dorothea's running linklist of horror stories
- Recovering Eyebeam's Archive
- How Toy Story 2 Nearly Vanished
- Data sharing and management snafu in three short acts
Our Tales of Terror whiteboard
Session 3: Participant introductions
No slides
Session 4: Lifecycles and Communities
Trevor's slides for this session
Session 5: Data representation
Trevor's slides for this session
Session 6: Models
Dorothea's slides for this session (PDF)
- The OAIS Reference Model: Introductory Guide by Brian Lavoie (PDF)
- OAIS Reference Models
- DCC Lifecycle Model
Session 7: Digital Forensics
- Digital Curation Workstation Description of the workstation MITH is using to capture data from old media.
- BitCurator home page
Session 8: The technology environment
Dorothea's slides for this session (PDF)
- Dataverse Network
- FOSS4LIB software-review site
- DCC Archiving and Preservation tools list (aka "the gigantor list of everything ever")
- DCC's larger list of data-curation tools (more lists from DCC!)
Session 9: Data Economies
Trevor's slides for this session(PDF)
Session 10: Collections
Trevor's slides for this session(PDF)
Session 11: Data Integration
Trevor's slides for this session(PDF)
Session 12: Data and the law
Dorothea's slides for this session (PDF)
- Wikipedia entry on Bridgeman Art Library vs. Corel Corp.
- Fair Use in Online Video
- Digital Copyright Slider (pretty great rule-of-thumb gauge for whether something US-created is under copyright)
- Licensing Open Data: a Practical Guide (UK-based, so make allowances, but also a lot of solid common sense applicable internationally.)
- Data (and the Creative Commons) Makes the point that you can't CC-license something that isn't copyright-eligible to begin with.
- ARL Code of Best Practices in Fair Use for Academic and Research Libraries (Many of these are applicable to likely DH projects; they may also be a starting point for a similar code by and for DH.)
Good people to follow on copyright issues (plug them into your search engine of choice): Nancy Sims, James Grimmelmann (especially for Google Books/Hathi Trust), Peter Hirtle, Kevin Smith (at Duke), Pamela Samuelson.
Dorothea's Infallible Method for Knowing For Sure That a Use is Fair
- Copy a copyrighted work.
- Get sued by a legitimate owner of copyright in the work.
- Assert fair use as your defense in court.
- Win the case.
Session 13: Risk mitigation and auditing
- TRAC Checklist (PDF)
- NEH Office of Digital Humanities Data Management Plan guidelines
- DRAMBORA website
- Data Curation Profiles website
- Data Curation Profiles worksheet
- How Much of the Web is Archived?
Session 14: Data Management Planning
Trevor's slides for this session
- DMP Online
- DMP Tool
- An example data management plan
- The UK Data Archive's chart of optimal formats for long-term data preservation. Very helpful when creating your DMP
Session 17: Current awareness
- Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works
- Research Data Curation Bibliography
DH journals
Library journals (OA only)
- International Journal of Digital Curation
- Journal of E-Science Librarianship
- Journal of Librarianship and Scholarly Communication
- Journal of Digital Information
- D-Lib Magazine
- Ariadne
- College and Research Libraries
Weblogs
- The Signal (Library of Congress)
- DataPub (California Digital Library)
- Digital POWRR
- DSHR's Blog (for serious preservation/storage nerds; good thing to point IT staff to)
- DH Now
- Dan Cohen's blog
- Library Loon
- California Digital Library
Copyright/scholarly communication/OA weblogs
- Copyright Librarian (Nancy Sims)
- Scholarly Communications @ Duke (Kevin Smith)
- The Laboratorium (James Grimmelmann; great for Google Books/Hathi Trust analysis)
- Library Babel Fish (Barbara Fister)
Twitter folks
Conferences
Further Training
* Digital Humanities Data Curation Institute