Untyped Serializer
Untyped serializer gives this library the ability to write and read data without the need to define low-level column data or define classes. This is extremely useful for use case like:
Reading data from a file where the schema is not known in advance (i.e. parquet file viewers, generic utilities).
Writing parquet file converters, i.e. from parquet to JSON.
and so on.
Motivation
Single codebase for class serializer and untyped dictionary serializer.
De-serialization produces JSON-like structures in memory. These can be written back to JSON file as is.
Row API is an old legacy that is somewhat buggy and very hard to evolve and fix.
Usage
In this API, everything is Dictionary<string, object>
. For a simple use-case, with the following schema:
you can write data like this:
For more examples, see ParquetSerializerTests.cs
in the codebase. The documentation will evolve as this API gets more stable.