Re: Make COPY extendable in order to support Parquet and other formats - Mailing list pgsql-hackers

From Andrew Dunstan
Subject Re: Make COPY extendable in order to support Parquet and other formats
Date
Msg-id cad1cec1-a148-c488-bf51-5821cc1a9b16@dunslane.net
Whole thread Raw
In response to Re: Make COPY extendable in order to support Parquet and other formats  (Andres Freund <andres@anarazel.de>)
Responses Re: Make COPY extendable in order to support Parquet and other formats
List pgsql-hackers
On 2022-06-23 Th 21:45, Andres Freund wrote:
> Hi,
>
> On 2022-06-23 11:38:29 +0300, Aleksander Alekseev wrote:
>>> I know little about parquet - can it support FROM STDIN efficiently?
>> Parquet is a compressed binary format with data grouped by columns
>> [1]. I wouldn't assume that this is a primary use case for this
>> particular format.
> IMO decent COPY FROM / TO STDIN support is crucial, because otherwise you
> can't do COPY from/to a client. Which would make the feature unusable for
> anybody not superuser, including just about all users of hosted PG.
>

+1


Note that Parquet puts the metadata at the end of each file, which makes
it nice to write but somewhat unfriendly for streaming readers, which
would have to accumulate the whole file in order to process it.


cheers


andrew


--
Andrew Dunstan
EDB: https://www.enterprisedb.com




pgsql-hackers by date:

Previous
From: Tom Lane
Date:
Subject: Re: Unify DLSUFFIX on Darwin
Next
From: Matthias van de Meent
Date:
Subject: Pre-installed index access methods cannot be manually installed.