Code Block | ||||
---|---|---|---|---|
| ||||
{ // Location url: string, sourceKey: string, communityIds: [ ObjectId ], // The gzipped content stored in the Lucene index (eg after the harvest processing pipeline) gzip_content: binary, gzip_len: integer, // Optional content: gzip_raw_content: binary, // The original text, before the processing pipeline (but after Tika for PDFs etc) gzip_raw_len: integer, gzip_md_content: binary, // The compressed document metadata object gzip_md_len: integer } |
Notes
Note that the "binary" type serializes to byte[].
...