From Parquet Exports
How to Ingest
Simple guide on how to ingest parquet files for Farcaster data
Ingestion code available in this github repo, clone repo onto a server with a large disk and you should be importing in no time
Reach out to us for credentials to try it out.
- Install Homebrew:
- Install amazon’s command line tool:
- Configure amazon’s command line tool:
- Set this new profile to be the default (or you can use
--profile ...
on all of youraws
commands):
- List all the archive exports:
You’ll see some output that will look something like this (the timestamps will likely be different):
The filename format is ${DATABASE}-${TABLE}-${START_TIME}-${END_TIME}.parquet
. The timestamps bound the updated_at
column.
You probably want to fetch the latest versions of each table the first time you build your database.
- List all the incremental exports:
- List all the files for a specific time range:
- Download a specific file:
- Download all the tables for a specific time range:
- Use the parquet cli:
- Check some data: