Ok I am figuring out things as I go along. Seems like the first step is to follow the step in Building Arrow C++
git clone https://github.com/apache/arrow.git
and this will build the parquet library used in here
Now I just need to come up with the table in
parquet::arrow::WriteTable(table, arrow::default_memory_pool(), outfile, 3));
and turn that into a function so I can use CxxWrap.jl.
Now, it seems like a few things still need to be done.
- Write a Julia DataFrame into arrow blob structure (potentially leveraging Arrow.jl)
- Write the C++ function using CxxWrap.jl that calls the parquet write function (yet to be writer) to write the arrow blob into parquet file
These are notes for me in case I forget. Also for other to let me know if I am sort of on the right track.