パッケージの詳細

@zazuko/rdf-parser-csvw

zazuko5.1kMIT0.17.0

CSV on the Web parser

rdf, rdf-ext, csv, csvw

readme

rdf-parser-csvw

A CSV on the Web parser with RDFJS Stream interface.

⛓️‍💥 Fork of rdf-ext/rdf-parser-csvw converted to ESM, including some tweaks and fixes

Usage

The package exports the parser as a class, so an instance must be created before it can be used. The .import method, as defined in the RDFJS specification, must be called to do the actual parsing. It expects a stream of strings. The method will return a stream which emits the parsed quads.

The constructor accepts an options object with the following optional keys:

  • metadata: Use the metadata to convert the CSV to RDF. The metadata must be given as a Dataset using the CSV on the Web ontology. This options is required.
  • baseIRI: Use the IRI to create Named Nodes. The value must be a String. This options is required.
  • factory: Use an alternative RDFJS data factory. By default the reference implementation us used.
  • timezone: Use an alternative timezone to parse date and time values. The value must be given as a String as defined in the Luxon documentation. By default local will be used.
  • relaxColumnCount: Don't throw an error if a row has a column count which doesn't match the headers column count.
  • skipLinesWithError: Skip lines with error instead of throwing an error and stop parsing. This is mainly useful for debugging and should not be used in production environments.
  • skipEmptyLines: Ignore empty lines in the CSV file.
  • trimHeaders: Remove whitespace from the beginning and end of the column names.

It's also possible to pass options as second argument to the .import method. The options from the constructor and the .import method will be merged together.

Compliance with the CSVW specification

Dates and times

  • Date/Time formats are generally supported with possible issues around the timezone specifier X and the exact formatting of the timezone offset.
  • RFC2822 dates are supported via explict 'RFC2822' datatype format (case insensitive).
  • Time formats are implemented but the output will always be normalised to UTC.

更新履歴

@zazuko/rdf-parser-csvw

0.17.0

Minor Changes

  • 815e65d: This version attempts to improve the support for date/time formats
  • fc3547d: Require the RDF/JS factory to be an environment with clownface
  • 815e65d: When producing xsd:dateTime(Offset) literals, will remove the milliseconds
  • 815e65d: Dates in the RFC2822 format are no longer implicitly supported. Use custom datatype format = 'RFC2822' to interpret cell values as timestamps in that format

0.16.5

Patch Changes

  • e63e327: Given an invalid date, the would produce unexpected literals "Invalid Date"^^xsd:dateTime

0.16.4

Patch Changes

  • b539998: Include types for uri-templates

0.16.3

Patch Changes

  • 320b4e0: Allow DatasetCore passed to ObjectParserTransform

0.16.2

Patch Changes

  • 36d0da8: skipEmptyLines was still not there

0.16.1

Patch Changes

  • 6ef1200: The skipEmptyLines parameter was missing at the top interface level

0.16.0

Minor Changes

  • ce15e5b: Update code to ESM
  • 3f35bad: By default, the parser now skip empty lines (re zazuko/cube-creator#1495). This behavior can be configured with a new option skipEmptyLines

Patch Changes

  • 9555d76: Version forked from rdf-ext
  • 9607d66: Invalid literals would have sometimes been produced where a datatype URI was used a language tag
  • ce15e5b: Added TypeScript declarations
  • 26e76ec: Add an option to trim column names (re zazuko/cube-creator#1232)
  • ce15e5b: Update RDF/JS data model to v2
  • ce15e5b: Updated readable-stream to v4