VERTNET SNAPSHOTS CREATION PROCESS:
- VertNet assembles its datastore into BigQuery, a Google tool, with which we run queries over the complete VertNet index.
- From these queries we create taxon-based snapshots - Amphibia, Aves, Fishes*, Mammalia, and Reptilia (additional datasets may be produced in the future).
- These snapshots are converted into CSV (comma-separated) text files and compressed into .zip files.
- VertNet generates a metadata** file that describes the contents contained within each zipped dataset.
- Finally, we bundle up the metadata and the dataset and upload them to CyVerse.
- Yes, we understand that “fish” is a paraphyletic assemblage, but for the sake of simplicity and to avoid a ton of “fish-related resources” we have bundled everything to “fish”.
** The metadata we create is in the same format we use for any other resource in VertNet.