After 4 years of work, I am pleased to say that two publications have emanated from this work:
Low budget analysis of Direct-To-Consumer genomic testing familial data
Crowdsourced direct-to-consumer genomic analysis of a family quartet
I am pleased to say that the latter article has been featured by BMC Genomes as one of the most influential articles published in the journal in 2015.
You can find all of the Corpasome data in figshare. Given that the process of obtaining the data and analysing them has been incremental, there are two URLs you need to access should you want to use our genome data (released under a CC0 license).
The raw dataset for my parents and sister trio, together with the VCF file for the quartet (Mother, Father, Daughter, Son) are here:
The Son (mine) raw data is available here:
The figure below shows all the data that is available. In red I show some “anomalies” to distinguish data versions or the fact that my Aunt’s data is only 23andMe and she has sadly passed away.