Manuel Corpas' Blog

Genomes, Web 2.0 and Bioethics

Bye Sanger, Hello TGAC

After 3.5 exciting years at the Wellcome Trust Sanger Institute, working as senior developer for the DECIPHER database, it was time to start a new venture. As of February 13th, I am the Project Leader of Plant and Animal Genomes at The Genome Analysis Centre (TGAC). TGAC is specialized in the study of plants, microbes and animal genomes with the view to facilitate the development of new genomics-based biology in academic and commercial sectors.

In my new role at TGAC I will be leading the group of computational biologists working on plant and animal genome analyses within the Genome Analysis Team. Our aims will include the organization of the analysis of sequence generated at TGAC, engaging with internal and external collaborators, nationally and internationally. A lot of the work will focus on (but will not solely be restricted to) the analysis of RNA-seq, ChIP-seq and Bisulphite sequencing data for the purposes of understanding how genes are regulated.

Coming Opportunities

I will be expecting to have openings in my group in the near future for student projects (Masters and PhD) as well as research associates and technicians. Meanwhile, if you are interested in joining or simply discussing ideas or potential projects in the the broad areas of transcriptomics, epigenomics and gene regulation, you are always welcome to drop me a line.

Filed under: Bioinformatics, Opinion, Personal, , , , , , , , , ,

A Personal View of Personal Genomics

In this interview I talk about my experiences in analyzing my genome and the genomes of my family. Here I also introduce the motivation and future plans for the analysis of my personal exome (i.e. all coding regions in my genome).

Some further information regarding my family experience of personal genomics can be found in a recent publication.

Filed under: Opinion, Personal, , , , ,

My Personal Exome Now Publicly Released

After 5 months of having performed the sequencing of my personal exome, I now make it available to the community for public use. I release it under a CC BY-SA 3.0 license, giving you permission to use this data in any way, as long as it provides attribution to the source and it is shared under a similar license.

What is an exome?

An exome is the ~1% of my genome that encodes for proteins.

Why do I release my personal exome?

When my family and myself made our genotypes available through the Internet, we immediately received results from researchers around the world who took our data for analysis and came back with interesting results. As a result of this, we have been able to learn much about ourselves. I have reported this in a previous entry on this blog entitled “Benefits for Publishing Family Genomes on the Internet“. I now follow the same principle: if I make my exome available for people to analyse it, I can expect that some researchers may come back with interesting results.

What data do I actually release?

I release my 4 FastQ files that were given to me by my sequencing provider. This is the same kind of information that 23andMe gives in their current exome analysis offer. This information basically consists of raw reads that need to be aligned to a reference assembly. Once aligned, interesting variation data can be inferred.

What do I ask in return?

Nothing. I do appeal though to the good will of potential users to report back to me anything interesting they might find.

How big are the files?

They are huge. On average they are about 0.6 Gb per file and I have 4 of these. That means that it can take several hours for each file to be downloaded. Be patient!

Where can I get them?

Here:

  1. File 1
  2. File 2
  3. File 3
  4. File 4

Read the rest of this entry »

Filed under: Genomics, Personal, Personal Genomes, , ,

A Genome Blogger Manifesto

Have you ever wondered why some people have no reparation in sharing their genetic profiles? Why do they openly talk about something supposedly so private? I believe that no contradiction exists between wanting to protect one’s privacy yet sharing one’s genomic data with the world. I am more concerned about the information that Facebook collects about my profile than my genome data (provided that I live in a country where there I public health).

Sharing and comparing one’s genome with other personal genomes is a matter of necessity if one is to shed light on the meaning of one’s personal DNA.

This is why I became a genome blogger myself. Why one should be constrained by the information that genomic test reports provide? No personal genome analysis report can ever be complete, they will always be influenced by the biases of whomever is providing such a report.

*   *   *

Although no formal document seems to have been produced on what the core values for genome blogging should be yet, core beliefs driving personal genome-sharing should be made explicit. Here I present an initial and inherently imperfect first attempt to put in writing of what I believe genome blogger values could be. I do not expect every fellow blogger to agree with them, but I hope that at least they inspire some debate. These are not a fixed set of rules; on the contrary, I expect this thinking to evolve with the genomics technology itself. I base some of the ideas below on Marcus Wohlsen’s ‘Biopunk’ book, Meredith Patterson’s ‘biopunk manifesto’, Misha Angrist’s ‘Here is a human being’ book and Pekka Himanen’s ‘Hacker’s ethics’ book.

Core Values for Genome Blogging

  1. Intelligent exploration, experimentation and trial to push the boundaries of knowledge are a right for ordinary people. The days in which genetic science was only done by university professors or people working in corporate labs are now over. Now everyone should have the power and legitimacy to be able to discover, develop and find new things about their own genome data. Read the rest of this entry »

Filed under: Bioinformatics, Computational Bioethics, Genomics, Personal, Personal Genomes, , , , , , , , ,

Getting My Genome Sequencing Done (Part II)

The purpose of this exercise is to demonstrate how to get one’s genome sequenced to those without previous experience. I will build on part one of this story, in which I explained how I tried to contact several companies who might sequence my genome. In that post there was one company I did not mention on purpose, the BGI. This post is going to be mainly about my story on how I sequenced all my gene coding regions (also known as exome) with BGI’s help, for which I am very grateful. The BGI was in fact the only company that replied to me and was interested in supporting my adventures in direct-to-consumer personal genomics.

I was able to have personal contact with the BGI in June 2011, when I, and my collaborator Rafael Jimenez, presented them our plans for developing a free open source platform for self-analysis of personal genomes. This story is a key part of that enterprise in which we have already invested so much. I hope that interested readers may find some of these experiences helpful, especially if they are thinking of sequencing their genomes.

Our initial goal with the BGI was to get the sequencing of my family’s genomes for free. After several weeks of negotiation, it was clear that the best they could offer me was their cheapest deal, a personal exome sequencing for $999. This in itself was a big success for me, largely because sequencing a single genome is not profitable for a company, as I dicovered when I spoke to Complete Genomics who would only consider a minimum of 8 genomes per experiment. At current prices this would be unaffordable even with sponsorship. Second, with contact with the BGI already established, there was the possibility that my whole family might be able to have our genomes sequenced sooner rather than later.

Even though I did not have the money to be able to pay for my exome, I optimistically accepted BGI’s offer, and decided on sequencing my own exome. It was clear that whilst I would be able to send them a sample any time, they would only do the sequencing as soon they received the money. It turned out that the money arrived to Hong Kong, where BGI’s bank account is located, before the sample. In less than three days I found a generous undisclosed Spanish sponsor who agreed to pay in full the sequencing of my exome. The currency to be paid in was euros, so the final sum was 712 euros. Apart from that, I myself paid the cost of sending the sample, out of my own pocket, using DHL. The total cost for this was £45.95.

DHL Sample Shipment Receipt

Finding the right kind of DNA sample to be sent was the difficult bit. BGI initially asked for either purified DNA or a blood sample. Neither of these options were feasible for me, because here in the UK is next to impossible to extract one’s blood if it’s not for health-related purposes. Sending DNA was not an option either because I am not a wet-lab scientist and did not have easy access to DNA purification kits. Besides, for this exercise, in which I am trying to emulate an ordinary citizen with no lab access, those options would not be feasible anyway. The solution was Oragene Kits from DNA Genotek. If you are a 23andMe customer, you will know that you spit saliva into a test tube which then gets shipped to the US. Oragene kits contain a stabilizing solution that breaks down all DNA degradation enzymes naturally contained in cells. Their DNA yields from saliva are as good as from blood samples. One can send these Oragene kits at room temperature with no need to keep the sample cool with ice, as it is the case if one sends blood.

Manuel spitting in Oragene kit for DNA extraction from saliva (photo taken by Sito Torres)

I found on the DNA Genotek website that they provide 4 sample tubes for anyone to try at no cost. I sent them a request for free sample kits and the following day they called me on the phone to check what may plans were. I did not have to pay any money at all and in less than a week they had sent me enough spit tubes for me and for my parents (we hope they will be sequenced in the near future), all the way from Canada. I would like to take this chance to recommend DNA Genotek’s sample kits; they have worked really well for me.

I still had to negotiate with the BGI the shipment of saliva (instead of blood) and also the extraction of the DNA. Very kindly the sales people from the BGI agreed to try a saliva protocol for DNA extraction for free. In all respects, I am very grateful to the BGI for the impeccable service I have received. There was one last obstacle that had to be sorted before I sent my saliva sample. The Hong Kong government requires an import license for biological materials, blood and saliva included. Fortunately the sample management team at the BGI took care of this. We calculated that the sample would arrive during the first week of August (2011). The license stated the sample arrival date would be around the 1st of August. It actually reached Hong Kong slightly sooner but this did not seem to be a problem for the customs authorities.

Finally on the 6th of September I received an email from the BGI project manager who informed me that my exome was ready for download. The delivery date in fact was almost a month earlier than the one initially agreed, so from that point of view I am also grateful to the BGI. I downloaded 4 files from their ftp site taking a total of ~2.7 Giga Bytes of data, which is less than I thought. However, it still took me 4 hours to download the whole thing (even though I had a fast Internet connection). These files are in FASTQ format. Here is the first read of one of the files:

@FCD044UACXX:4:1101:1778:2233#CAGATCAT/2
CTTGGATAAAAGGAAAATTTTGAACATGGGACCCTGACTTAATGAATATTTGAATGTTNNNNNNNNNNNNNNNNNNNNNTCAAGTGTCGT
+
__aeccacgggce`ffheghhhiiiiihhhbfc_fghfhifhdhgihbghedb[egfeBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB

This is the end of the series on Getting My Genome Sequencing Done. There will be a new series coming up called Getting my Genome Analysis Done.

Filed under: Genomics, Personal, , , , ,

Why Do I Want to Be a Scientist?

After starting to read a book that seems quite promising, “Survival Skills for Scientists“, by Federico Rosei and Tudor Johnston, one of their first questions they invite the reader to think about is “Why do you want to be a scientist”. I thought that perhaps my personal motivation and driving reasons that have made me take this career path would be worth sharing in Manuel Corpas’ Blog. So here I write all that comes to mind and you, gentle reader, are encouraged to share your reasons for wanting to be a scientist in the comments section below if you so wish.

I want to be a scientist because:

  • I love the thrill of discovery, innovation and feeling a pioneer in the field that interests me (personal/clinical genome research).
  • I love to learn and understand the mysteries of the information encoded in the molecules of living organisms, particularly humans.
  • I’d love to be able to pave the way for people to access the information about their own genetic makeup, help them interpret it and share it with others if they so wish.
  • I want to understand the meaning of the genetic code, how much of my personal traits are inherited and how much I can alter them.
  • I want to understand how DNA is able to affect clinical traits and cause disease.

In order to “know myself“, as the greek aphorism inscribed in the Temple of Apollo at Delphi recommended, a good starting point is to understand the world I live in in its finest detail, which is the task of a scientist. I believe there is a part in human nature that makes us all want to be scientists. Perhaps you should allow yourself discover this side of your persona if you have not done so already.

Filed under: Personal, , , , , ,

Follow via Twitter

Disclaimer

Any views expressed here are the author's alone and do not necessarily form part of the official positions of his employer.

Creative Commons License
Unless otherwise stated, this work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.

Recently Tweeted

Follow

Get every new post delivered to your Inbox.

Join 211 other followers