CSV Validation for Metadata Wrangling

dc.contributor.authorWestgard, Joshua A.
dc.date.accessioned2015-11-20T17:57:23Z
dc.date.available2015-11-20T17:57:23Z
dc.date.issued2015-06-04
dc.descriptionA lightning talk delivered at the Library Research and Innovative Practice Forum, McKeldin Library, June 4, 2015. The tool described is available at http://www.github.com/jwestgard/csv-validate/.en_US
dc.description.abstractThis lightning talk describes a Python script for the validation of CSV files against arbitrary sets of rules specified in a schema file. The motivation for creating the tool was that CSV (comma-separated values) files have become a de facto standard for moving data between systems, and for any sort of batch ingest process. But CSV data can be messy, and often there are problems that appear only when the data is being loaded, after it is out of the hands of the librarians who have created the data and into the hands of systems staff. The tool is intended to empower data creators to validate CSV files against the requirements of the systems for which the data are being prepared, so that they can correct any problems themselves before sending the data along the pipeline.en_US
dc.identifierhttps://doi.org/10.13016/M2999Z
dc.identifier.urihttp://hdl.handle.net/1903/17169
dc.language.isoen_USen_US
dc.relation.isAvailableAtLibrary Research & Innovative Practice Forum
dc.relation.isAvailableAtDigital Repository at the University of Maryland
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md)
dc.titleCSV Validation for Metadata Wranglingen_US
dc.typePresentationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
westgard-csv-validation.pdf
Size:
403 KB
Format:
Adobe Portable Document Format
Description:
presentation slides in PDF