Education and Training

Home > Research > Education and Training
24 Sep 2019
Kirsten Keister

OpenITI AOCP: The Open Islamicate Texts Initiative Arabic-script OCR Catalyst Project

By |2020-02-28T10:22:16-05:00Sep 24, 2019|

With generous funding from The Andrew W. Mellon Foundation, OpenITI AOCP will create a new digital text production pipeline for Persian and Arabic texts. OpenITI AOCP will catalyze the digitization of the Persian and Arabic written traditions by addressing the central technical and organizational impediments stymying the development of improved OCR for Arabic-script languages.

8 Mar 2018
Raffaele Viglianti


By |2019-01-15T11:01:30-05:00Mar 8, 2018|

coreBuilder is an open source web-based visual environment for authoring stand-off markup. The tool aims at making the application of stand-off techniques more approachable in the context of Text Encoding Initiative projects dealing with multidimensional representations of text, without substantially disrupting workflows already familiar to TEI encoders.

20 Nov 2017
Stephanie Sapienza

Using the Digital to Engage Archival Radio Collections: A Panel and Workshop

By |2019-05-13T15:06:38-04:00Nov 20, 2017|

This panel and workshop, planned in conjunction with the 2017 Radio Preservation Task Force Conference, focused on innovative workflows for crowdsourcing linked data to build a web of data that can bridge collective heritage. Panelists discussed their work and research in crowdsourcing or linked open data for radio collections, followed by a Wikidata workshop demonstrating how it can be used to connect archival radio collections to a broader web-based community of knowledge.