This README.txt file was generated on 2019-12-23 by Matthew D. Lincoln

#
# General instructions for completing README:
# For sections that are non-applicable, mark as N/A (do not delete any sections).
# Please leave all commented sections in README (do not delete any text).
#

-------------------
GENERAL INFORMATION
-------------------

1. Title of Dataset: Frankenstein Variorum Annotations

This dataset contains HTML editions of Mary Shelley's Frankenstein, paired with a JSON file containing scholarly annotations keyed to particular locations in each edition.

The HTML editions were prepared as part of the Pittsburgh research team’s contribution to the Frankenstein Variorum Project. The 1818, Thomas, and 1831 HTML files were converted with small revisions to Jack Lynch's Pennsylvania Electronic edition by Elisa Beshero-Bondar of the University of Pittsburgh at Greensburg. Rikk Mulligan of Carnegie Mellon University supervised the OCR of the 1823 text and Elisa Beshero-Bondar converted this to HTML with corrections. Rikk Mulligan of Carnegie Mellon University. We are grateful for consultation on this process from Raffaele Viglianti, David J. Birnbaum, Wendell Piez, and Neil Fraistat.

Scholarly annotations were authored by Jon Klancher, Avery Wiscomb, Jack Quirk, and Steve Gotzler.

Matthew Lincoln led the technical effort to harvest annotation data from Hypothes.is and align it to collated TEI XML used in the variorum edition.


#
# Authors: Include contact information for at least the
# first author and corresponding author (if not the same),
# specifically email address, phone number (optional, but preferred), and institution.
# Contact information for all authors is preferred.
#

2. Author Information
<create a new entry for each additional author>

Corresponding Author Contact Information
    Name: Elisa Beshero-Bondar
    Institution: University of Pittsbrugh, Greensburg
    Email: Elisa ebb8@pitt.edu

Author Contact Information (if applicable)
    Name: John Klancher
    Institution: Carnegie Mellon University
    Email: jk2@andrew.cmu.edu

---------------------
DATA & FILE OVERVIEW
---------------------

#
# Directory of Files in Dataset: List and define the different
# files included in the dataset. This serves as its table of
# contents.
#

Directory of Files:

- Frankenstein_1818.html
- Frankenstein_1823.html
- Frankenstein_1831.html
- Frankenstein_Thom.html

These HTML files represent a digital edition prepared for the Frankenstein Bicentennial project, commemorating the 200th anniversary of the first published edition of Frankenstein; or, the Modern Prometheus in 1818.

Date these documents were first produced: 2017-05-07. Amended: 2018-03-18.

This edition of the 1818 text is part of the Pittsburgh research team’s contribution to the Bicentennial Frankenstein Project, and is prepared by Elisa Beshero-Bondar of the University of Pittsburgh at Greensburg and Rikk Mulligan of Carnegie Mellon University. We are grateful for consultation from Wendell Piez, David J. Birnbaum, and Raffaele Viglianti, as well as Neil Fraistat and Dave Rettenmaier. This edition’s stages of development are stored and documented in the Pittsburgh_Frankenstein GitHub repository: <https://github.com/FrankensteinVariorum/fv-collation>

These files represent each edition prior to collation processing. Comparison data is not marked in these documents, but each is a distinct edition in its own right suitable for reading and annotation.

- hypothesis.json

JSON harvested from the API for <https://hypothes.is>, the annotation service used by authors to annotate specific locations within each HTML file. The file is formatted as a JSON-lines file, with one JSON object per line, representing one annotation each. Key fields:

- uri: the HTML document this annotation targets. For example, "https://frankensteinvariorum.github.io/fv-collation/Frankenstein_1831.html" corresponds to the HTML file Frankenstein_1831.html in this repository
- text: the content of the annotation
- tags: category tags applied by the annotations team to this tag
- target.[0].selector: an array of selectors used to identify the location in the HTML file the annotation points to. Includes text quote selectors, a range selector based on HTML elements (e.g. "/p[16]" to indicate the 17th `p` element in the HTML document), and an integer text position selector.

--------------------------
METHODOLOGICAL INFORMATION
--------------------------

#
# Software: If specialized software(s) generated your data or
# are necessary to interpret it, please provide for each (if
# applicable): software name, version, system requirements,
# and developer.
#If you developed the software, please provide (if applicable):
#A copy of the software’s binary executable compatible with the system requirements described above.
#A source snapshot or distribution if the source code is not stored in a publicly available online repository.
#All software source components, including pointers to source(s) for third-party components (if any)

1. Software-specific information:

Name: Hypothesis
Version:
System Requirements:
Open Source? (Y/N): Y

(if available and applicable)
Executable URL:
Source Repository URL: https://github.com/hypothesis
Developer:
Product URL: https://hypothes.is/
Software source components:


#
# Dates of Data Collection: List the dates and/or times of
# data collection.
#

3. Date of data collection (single date, range, approximate date):

2017-06-01 to 2019-12-15