<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Manuel Corpas&#039; Blog &#187; Technology</title>
	<atom:link href="http://manuelcorpas.com/category/technology/feed/" rel="self" type="application/rss+xml" />
	<link>http://manuelcorpas.com</link>
	<description>Genomes, Web 2.0 and Bioethics</description>
	<lastBuildDate>Tue, 07 Feb 2012 04:46:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='manuelcorpas.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://1.gravatar.com/blavatar/30b9f4f8115fc52af0bb4d7d67d33f7d?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Manuel Corpas&#039; Blog &#187; Technology</title>
		<link>http://manuelcorpas.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://manuelcorpas.com/osd.xml" title="Manuel Corpas&#039; Blog" />
	<atom:link rel='hub' href='http://manuelcorpas.com/?pushpress=hub'/>
		<item>
		<title>Scientific Announcements Don&#8217;t Get Noticed Where They Should</title>
		<link>http://manuelcorpas.com/2011/12/08/scientific-announcements-dont-reach-where-they-shoul/</link>
		<comments>http://manuelcorpas.com/2011/12/08/scientific-announcements-dont-reach-where-they-shoul/#comments</comments>
		<pubDate>Thu, 08 Dec 2011 10:40:46 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Databases]]></category>
		<category><![CDATA[iAnn]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[announcements]]></category>
		<category><![CDATA[curation]]></category>
		<category><![CDATA[dissemination]]></category>
		<category><![CDATA[scientific societies]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=1228</guid>
		<description><![CDATA[Wouldn&#8217;t it be nice if the event you are trying to promote needed to be posted only once? What if  there was a central repository for dissemination of announcements that was accessible and permanently up-to-date?  Wouldn&#8217;t it be great if your blog or website could show relevant professional announcements without having to enter them? Unfortunately, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=1228&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Wouldn&#8217;t it be nice if the event you are trying to promote needed to be posted only once? What if  there was a central repository for dissemination of announcements that was accessible and permanently up-to-date?  Wouldn&#8217;t it be great if your blog or website could show relevant professional announcements without having to enter them?</p>
<p>Unfortunately, people around the world are still trapped in the paper-based office paradigm when wanting to disseminate announcement information. Again and again they post their announcement to different places knowing that it will only reach a partial share of all potentially interested readers. They add data and clog online databases as no centralized repository is available for posting or getting information. Despite the great number of hours of work lost by millions of people trying to post, scientific organizations have been extremely slow to embrace community-shared announcement curation.</p>
<p>We (<a href="http://www.rajido.com/" target="_blank">Rafael Jimenez</a> and I) are promoting the creation of a community of <a href="http://www.itico.org/node/3" target="_blank">organizations</a> and <a href="http://www.itico.org/node/16" target="_blank">people</a> to lead <strong><a href="http://www.itico.org/node/4" target="_blank">iAnn</a></strong>, a centralized collaboration platform that coordinates curation efforts among scientific organizations. iAnn increases access to announcements through its dissemination tools, which have been designed specifically to integrate posts across many different websites with minimal effort. iAnn allows you to post your event, course, piece of news only once to a central repository, which is then disseminated seamlessly to relevant scientific organizations or websites according to keywords, dates or geographical location.</p>
<p>If you think <strong>iAnn</strong> is of interest to you please contact me (see contact information on the right) or wait for future developments that are about to come in <em>Manuel Corpas&#8217; Blog</em>. Currently we are in a development phase for the project and would like to hear from potential users or scientific organizations if they have any thoughts or suggestions on the matter. Our aim is to change the way anyone posts and finds relevant information about any given professional field. <strong>iAnn</strong> promises to help many users keep up-to-date with relevant announcements more effortlessly. Perhaps from now on websites will be better able to have most of the events, courses, seminars, news, etc. that users would expect to find in them.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/1228/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/1228/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/1228/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/1228/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/1228/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/1228/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/1228/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/1228/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=1228&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2011/12/08/scientific-announcements-dont-reach-where-they-shoul/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>
	</item>
		<item>
		<title>Genomic Technologies in the Clinic: Challenges and Opportunities</title>
		<link>http://manuelcorpas.com/2011/10/09/genomic-technologies-in-the-clinic-challenges-and-opportunities/</link>
		<comments>http://manuelcorpas.com/2011/10/09/genomic-technologies-in-the-clinic-challenges-and-opportunities/#comments</comments>
		<pubDate>Sun, 09 Oct 2011 19:09:45 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Opinion]]></category>
		<category><![CDATA[Human Genome]]></category>
		<category><![CDATA[CNV]]></category>
		<category><![CDATA[Next Gen Sequencing]]></category>
		<category><![CDATA[SNP]]></category>
		<category><![CDATA[dbSNP]]></category>
		<category><![CDATA[Exome]]></category>
		<category><![CDATA[Whole Genome Sequencing]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=1170</guid>
		<description><![CDATA[Next Generation Sequencing (NGS) offers the promise of revolutionizing our ability to diagnose genetic disorders. Fuelled by the exponential decrease in the cost of sequencing, NGS can now be outsourced, making it accessible to labs with modest budgets. A personal exome (the sum of all coding regions in a genome) is currently priced at $999 [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=1170&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Next Generation Sequencing (NGS) offers the promise of revolutionizing our ability to diagnose genetic disorders. Fuelled by the exponential decrease in the cost of sequencing, NGS can now be outsourced, making it accessible to labs with modest budgets. A personal exome (the sum of all coding regions in a genome) is currently priced at $999 by some providers. Although not as comprehensive as whole genome sequencing, exomes provide the ability to shed light on the origin of causative mutations lying on genes.</p>
<div id="attachment_1173" class="wp-caption aligncenter" style="width: 310px"><a href="http://corpasfoo.files.wordpress.com/2011/10/exome_sequencing_workflow_1b.png"><img class="size-medium wp-image-1173 " title="Exome_Sequencing_workflow_1b" src="http://corpasfoo.files.wordpress.com/2011/10/exome_sequencing_workflow_1b.png?w=300&#038;h=253" alt="" width="300" height="253" /></a><p class="wp-caption-text">Exome Sequencing</p></div>
<address>                                    (by SarahKusala, <a href="http://creativecommons.org/licenses/by/3.0/deed.en" target="_blank">CC-BY 3.0</a>)</address>
<p>Getting the raw sequence data is the easy part. The challenging part is to extract and interpret clinically the genomic variation found in the raw data. The extraction of variants from raw NGS data can be influenced by many factors such as the sequence read depth, the alignment of reads and the variant calling algorithm. If one is to find the variants that may be of clinical relevance, filtering is required. This filtering may be performed by comparing genome data against data from the “normal” variation found in the <a href="http://www.1000genomes.org/" target="_blank">1000genomes project</a> and <a href="http://www.ncbi.nlm.nih.gov/projects/SNP/" target="_blank">dbSNP</a>. Depending on the length of the mutation, there are three main kinds of variants: SNPs, indels and CNVs. SNPs constitute single point mutations (one DNA base), Indels insertions or deletions of up to about 1Kb and CNVs deletions or duplications from 1Kb to many megabases long.</p>
<p>It is well known, however, that many SNPs fall into locations that are far from genes, yet they can cause phenotypic effects. But assuming that one is looking at coding regions, many pieces of software have been developed to predict the effects of SNP mutations: stop codons, missense mutations and frameshifts.</p>
<p>Indels and CNVs are slightly harder to interpret clinically. CNVs can encompass many genes and their phenotypic effect cannot be clearly established unless several patients have been observed with a similar CNV. It is not uncommon for a normal individual to carry hundreds of indels and CNVs.</p>
<h2>Challenges</h2>
<p>One of the most important challenges in the clinic when implementing genomics is going to be how to deal with the huge amounts of data produced. There is going to be a great number of patients sequenced, all of them producing a huge number of genomic features of unknown significance. Given that in order to confidently interpret a rare variant it is needed to have evidence from several patients, it is not surprising that another big challenge is how this information is going to be shared. A lot more data about a patient means that the chances of personal identification are increased even if this information is anonymous. Thinking about a few routinely carried out tests today, it is possible to uniquely identify a person only with a handful of SNPs. Imagine when one possesses thousands of genomic variants from one patient.</p>
<p>Moreover, if this data is to be shared, a big challenge is going to be how it is going to be compared. Different labs have different Quality Control (QC) standards and different platforms. Each sequencing run may have different read depths and different levels of confidence in terms of whether a called variant is true. Another issue will be how the annotation of phenotypes will be carried out. There are phenotypic ontologies like the <a href="http://www.human-phenotype-ontology.org/index.php/hpo_home.html" target="_blank">Human Phenotype Ontology</a>, that allows a reasonably complete set of clinical descriptions. Nevertheless there is no guarantee that phenotypic descriptions even using the same ontology will have the same level of annotation. All these factors are going to need consideration when interpreting NGS in the clinic.</p>
<p>One of the main hurdles impairing the access of NGS to the clinic can also be the health system in the country. The UK seems to have been able for now to put together many state funded clinical labs to work together. Unfortunately, this would be unthinkable in countries like Spain, where instead of 1 unique health system, there are 17, as many as autonomous regions there are. Sequencing technologies require a lot of different sectors coordinating together in order to set up the appropriate platforms that guarantee the access of the technology, its proper interpretation and the protection of the patient&#8217;s privacy.</p>
<h2>Opportunities</h2>
<p>The other side of the coin is that these technologies are going to become increasingly affordable, not just for the rich countries but also for the emerging. The accessibility of this technology will make it ubiquitous in many labs around the world, not just to those looking for diagnosis of patients with genomic disorders. Expect sequencing routinely performed for cancer tissues and even at birth. Based on current estimates, it is likely that by 2020 there will be hundreds of millions of genomes sequenced.</p>
<h2>Conclusion</h2>
<p>Sequencing is going to revolutionize clinical practice. The degree to which it will revolutionize it depends on how we harness the challenges described above. There will be technical problems but also institutional ones that are more problematic to solve. The race for harnessing NGS in the clinical setting is on.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/1170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/1170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/1170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/1170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/1170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/1170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/1170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/1170/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=1170&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2011/10/09/genomic-technologies-in-the-clinic-challenges-and-opportunities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>

		<media:content url="http://corpasfoo.files.wordpress.com/2011/10/exome_sequencing_workflow_1b.png?w=300" medium="image">
			<media:title type="html">Exome_Sequencing_workflow_1b</media:title>
		</media:content>
	</item>
		<item>
		<title>myKaryoView: First Open Source Visualization Software for 23andMe Data</title>
		<link>http://manuelcorpas.com/2010/09/01/mykaryoview-first-open-source-visualization-software-for-23andme-data/</link>
		<comments>http://manuelcorpas.com/2010/09/01/mykaryoview-first-open-source-visualization-software-for-23andme-data/#comments</comments>
		<pubDate>Wed, 01 Sep 2010 20:53:44 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[23andMe]]></category>
		<category><![CDATA[DAS]]></category>
		<category><![CDATA[dbSNP]]></category>
		<category><![CDATA[myKaryoView]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[SNP]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=710</guid>
		<description><![CDATA[Following my previous post on the First Publicly Available Genome Via DAS I would like to present an open source software that Rafael Jimenez and myself have developed for visualization of genomic data. Here we have it configured to display 23andMe data as a test case. We call it myKaryoView and it is available for [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=710&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_715" class="wp-caption alignleft" style="width: 73px"><a title="myKaryoView Logo" href="http://corpasfoo.files.wordpress.com/2010/09/mykaryoview-tiny.png"><img class="size-full wp-image-715    " style="border:0 none;margin:0;" title="myKaryoView Logo" src="http://corpasfoo.files.wordpress.com/2010/09/mykaryoview-tiny.png?w=406" alt=""   /></a><p class="wp-caption-text">myKaryoView Logo</p></div>
<p>Following my previous post on the <a href="http://manuelcorpas.com/2010/08/26/first-publicly-available-personal-genome-via-das/">First Publicly Available Genome Via DAS</a> I would like to present an open source software that <a href="http://www.rajido.com/">Rafael Jimenez</a> and myself have developed for visualization of genomic data. Here we have it configured to display 23andMe data as a test case. We call it myKaryoView and it is available for free use and download. Its website is located at the following address:</p>
<p style="text-align:left;"><a href="http://mykaryoview.com">http://mykaryoview.com</a></p>
<p>myKaryoView works in most contemporary browsers without lengthy installations and uses publicly available data distributed throughout the Internet via <a href="http://www.biodas.org/wiki/Main_Page">DAS</a>. This means that there is no need to hold the data locally and that it is capable of visualizing any data as long as it is available via DAS. In order to visualize 23andMe data, myKaryoView requires the set up of a DAS source, which currently limits myKaryoView&#8217;s usage to those familiar with this technology. However, configuration and addition of sources are extremely simple and the amount of data able to display is limited only to the time of request completion and data rendering.</p>
<p>Here we show myKaryoView to display personal genomics data with a dummy <a href="http://mykaryoview.com:9000/das/mykaryoview/features?segment=1:1,2000000">23andMe genome data source</a>. This source is based on real 23andMe results data from my own genome, randomly modified in a manner that is irrecognizably different.</p>
<p>The myKaryoView website shows an implementation that allows search of genome data via gene name or genome coordinates. For example, type in the search box 1:2000000,6000000 and hit &#8220;Submit Query&#8221;.</p>
<p style="text-align:center;">
<div id="attachment_711" class="wp-caption aligncenter" style="width: 416px"><a href="http://corpasfoo.files.wordpress.com/2010/09/picture-8.png"><img class="size-full wp-image-711  " title="myKaryoView" src="http://corpasfoo.files.wordpress.com/2010/09/picture-8.png?w=406&#038;h=163" alt="" width="406" height="163" /></a><p class="wp-caption-text">myKaryoView Zoom and Chromosome views. </p></div>
<p>The figure above shows results of that query, with two tracks containing the source from 23andMe with dummy data plus genes for a subchromosomal region in chromosome 1, Start: 2000000, End: 60000000. Gene names and SNP data and are shown in red and blue respectively. Different color shades indicate the density of annotation for any given point. If the “Gene Names” data track name is clicked, a popup window appears with a link “Display Original Data Source” that allows the download of the raw data from its DAS source. Any feature can be clicked for retrieval of specific information contained in the DAS source. Here a blue SNP mark is clicked and a popup window appears describing the selected SNP and a link to its corresponding dbSNP entry.</p>
<p><a href="http://mykaryoview.com/mykaryoview_doc.pdf">A simple manual</a> explaining how to install and configure myKaryoView to show different data sources is provided from the website. myKaryoView is still in beta testing and any feedback is welcome. We have some plans for the near future for myKaryoView, which we will reveal in due time. Meanwhile I hope you find it interesting and useful.</p>
<p>By the way, the claim that this is the First Open Source Visualization for 23andMe data is, of course, arguable.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/710/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=710&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/09/01/mykaryoview-first-open-source-visualization-software-for-23andme-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>

		<media:content url="http://corpasfoo.files.wordpress.com/2010/09/mykaryoview-tiny.png" medium="image">
			<media:title type="html">myKaryoView Logo</media:title>
		</media:content>

		<media:content url="http://corpasfoo.files.wordpress.com/2010/09/picture-8.png" medium="image">
			<media:title type="html">myKaryoView</media:title>
		</media:content>
	</item>
		<item>
		<title>First Publicly Available Personal Genome Via DAS</title>
		<link>http://manuelcorpas.com/2010/08/26/first-publicly-available-personal-genome-via-das/</link>
		<comments>http://manuelcorpas.com/2010/08/26/first-publicly-available-personal-genome-via-das/#comments</comments>
		<pubDate>Thu, 26 Aug 2010 00:10:15 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[23andMe]]></category>
		<category><![CDATA[DAS]]></category>
		<category><![CDATA[DAS Registry]]></category>
		<category><![CDATA[dbSNP]]></category>
		<category><![CDATA[DIYgenomics]]></category>
		<category><![CDATA[personal genomics]]></category>
		<category><![CDATA[SNP]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=658</guid>
		<description><![CDATA[You may have heard stories about some well known people to have released their genome for public use. I would like to convince you that now you don&#8217;t have to have a lot of money or being a public figure in order to do that. Companies like 23andMe and Navigenics provide the ability to get [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=658&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>You may have heard stories about some well known people to have released their genome for public use. I would like to convince you that now you don&#8217;t have to have a lot of money or being a public figure in order to do that. Companies like <a href="https://www.23andme.com/">23andMe</a> and <a href="http://www.navigenics.com/">Navigenics</a> provide the ability to get one&#8217;s genome tested for not a lot of money and get the results via a password protected website. The problem is that our current understanding of what these results mean are rather limited on their own. Thus having open collaboration platforms for citizen science   using genomic data may be a step forward in helping understand one&#8217;s genetic testing results. Initiatives like <a href="http://www.diygenomics.org/">DIYgenomics</a> are already working on this concept.</p>
<p>You may wonder why making one&#8217;s genome released is useful. The answer is, in practical terms it is  not. However, the concept of being able to do that I consider it to be a very  interesting one. After all, one’s genome data on its own is hardly informative, but  when compared with  information like known genes, pathways or even other  people’s genomes, it  becomes much more interesting and opens up the  possibility for real  discoveries.</p>
<p>With this post I hope to prove that genomes can now be put on the  web in a standard format like the <a href="http://www.biodas.org/wiki/DAS/1">Distributed Annotated System</a> (DAS) where people can share and integrate them with other public  data sources mappable to genome coordinates. DAS is an environment that is open source, decentralized and unregulated. So what is different here from what is being done already? Why is this  significant? I can think of at least three reasons. 1) Flexibility: pretty much any genome annotation can be put up; 2)  Integration capabilities: anything can be combined with anything else as long as they share the same coordinates system and 3) Data outsourcing: data is stored and maintained by DAS source owners elsewhere. Here is my story:</p>
<p>Last year I decided to get a 23andMe kit to have my genome analyzed. After results were delivered, I decided to download the data in raw format, consisting of &gt;0.5M SNPs (single nucleotide polymorphisms) mapped to the NCBI36 genome assembly.</p>
<p>I wanted to experiment with this data from a bioinformatics point of view, so I decided to put my “genome” on the web for public access. Well almost. I did not put up my real genome, I created a randomly shuffled version of it (i.e. it does not resemble any recognizable trace to the real data). I put up this unreal data to make a point of principle.</p>
<p>Anyone in the world can thus access my randomly shuffled genome using an URL like this:</p>
<p><a href="http://mykaryoview.com:9000/das/mykaryoview/features?segment=1:1,2000000">http://mykaryoview.com:9000/das/mykaryoview/features?segment=1:1,2000000</a></p>
<p>where after the token “segment=” in the above URL a chromosome type is specified [1-22, X, Y], followed by a colon, followed by the start and end position, separated by comma. Try the above URL with different chromosome number and coordinates and see what results you get!</p>
<div id="attachment_664" class="wp-caption aligncenter" style="width: 416px"><a href="http://corpasfoo.files.wordpress.com/2010/08/picture-91.png"><img class="size-full wp-image-664" title="23andMe Personal Genome Sample" src="http://corpasfoo.files.wordpress.com/2010/08/picture-91.png?w=406&#038;h=111" alt="" width="406" height="111" /></a><p class="wp-caption-text">This is what you get when requesting a valid URL with the genome coordinates</p></div>
<p>In the above figure you see different columns, denoting the SNP id, start and end positions, the genotype under the &#8220;Notes&#8221; heading and a link to the SNP&#8217;s corresponding entry in <a href="http://www.ncbi.nlm.nih.gov/projects/SNP/">dbSNP</a>.</p>
<p>Now that this genome is in a standard format, it can easily be integrated with any other publicly available data in DAS. As of this writing (26th August 2010) there are 139 data sources available in the <a href="http://www.dasregistry.org">DAS registry</a> mapped to Human Genome coordinates. I may not be interested in them all, but certainly this is one of the greatest repositories of genomic data in just one shop. Leading providers of publicly available genomic DAS sources include <a href="http://www.ensembl.org/">Ensembl</a>, the <a href="http://projects.tcag.ca/variation/">Database of Genomic Variants</a> and <a href="http://www.genome.gov/10005107">ENCODE</a>. Potential permutations of this data provides a range of possibilities for interrogation of biological hypotheses that is probably unparalled.</p>
<p>Now this shuffled genome is available for public use via a DAS web service. It will probably not be the last one to be put up and soon real 23andMe genomes will follow.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/658/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/658/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/658/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/658/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/658/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/658/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/658/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/658/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=658&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/08/26/first-publicly-available-personal-genome-via-das/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>

		<media:content url="http://corpasfoo.files.wordpress.com/2010/08/picture-91.png" medium="image">
			<media:title type="html">23andMe Personal Genome Sample</media:title>
		</media:content>
	</item>
		<item>
		<title>Open Tech 2010</title>
		<link>http://manuelcorpas.com/2010/07/25/open-tech-2010/</link>
		<comments>http://manuelcorpas.com/2010/07/25/open-tech-2010/#comments</comments>
		<pubDate>Sun, 25 Jul 2010 16:30:49 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Biotech]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[data standards]]></category>
		<category><![CDATA[Open Source]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=633</guid>
		<description><![CDATA[I will be speaking at Open Tech 2010 in London (UK) on Friday 11 September 2010. My talk, entitled &#8216;Who Owns my Genome Data&#8217;, will be delivered at the Seminar Room (First Session, 10:30 a.m.). If you are planning to attend Open Tech 2010 this year, let me know and be sure to attend my [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=633&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I will be speaking at <a href="http://www.ukuug.org/events/opentech2010/schedule/">Open Tech 2010</a> in London (UK) on Friday 11 September 2010. My talk, entitled &#8216;Who Owns my Genome Data&#8217;, will be delivered at the Seminar Room (First Session, 10:30 a.m.). If you are planning to attend Open Tech 2010 this year, let me know and be sure to attend my talk!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/633/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=633&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/07/25/open-tech-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>
	</item>
		<item>
		<title>The Power of Incidental Findings</title>
		<link>http://manuelcorpas.com/2010/07/11/the-power-of-incidental-findings/</link>
		<comments>http://manuelcorpas.com/2010/07/11/the-power-of-incidental-findings/#comments</comments>
		<pubDate>Sun, 11 Jul 2010 08:18:23 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Computational Bioethics]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[ethics]]></category>
		<category><![CDATA[Genomic disorders]]></category>
		<category><![CDATA[incidental findings]]></category>
		<category><![CDATA[next generation sequencing]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=630</guid>
		<description><![CDATA[Imagine that you are a geneticist that receives a patient in your clinic with a rare genetic disorder. Your patient is a 3 year old girl with severe learning dificulties. After looking at her sample under the microscope you find nothing of note and so you dismiss the case as inconclusive. Up until recently, that [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=630&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Imagine that you are a geneticist that receives a patient in your clinic with a rare genetic disorder. Your patient is a 3 year old girl with severe learning dificulties. After looking at her sample under the microscope you find nothing of note and so you dismiss the case as inconclusive. Up until recently, that was the normal scenario for most cases in the genetics clinic. Today however, with next generation sequencing techniques, we are able to look at base pair level resolution, about 10,000 times the resolution microscopy can afford. At a later date, the resources become available to you and decide to carry out a next generation sequencing analysis for the patient and both parents.</p>
<p>With the results in hand, you identify a candidate mutation most likely to have caused the observed symptoms. You look then at the parents’ genotype to see whether the mutation is inherited and find that the father is not the real biological one, i.e. you are faced with an <em>incidental finding </em>of non-paternity. This is one example of pieces of information contained in the genetic material, not related to the diagnosis, but necessary to carry out the analysis. Incidental findings like these have the potential of dramatically affecting the lives of patients and their families.</p>
<p>How should we deal with incidental findings capable of dramatically affecting families? The answer is not simple. For instance, when non-paternity is identified, the case could be dismissed as non-analyzable to avoid the complex ethical ramifications of such a revelation on the family.</p>
<p>Let’s consider another incidental finding, such as detecting that one of the parents is a carrier of the disease that is causing the mutation in the patient. Such a finding may imply that future children from the couple may also inherit the mutation. Should parents be informed or should they be allowed to choose whether they find out or not?</p>
<p>There are many other incidental findings where strong arguments can be made either for or against informing the family. One example is the discovery of a malignant variant for the <a href="http://en.wikipedia.org/wiki/Apolipoprotein_E">APOE</a> gene in the patient, implicated in early onset Alzheimer’s. This is something for which the patient was not tested, but still it has the potential of having a life changing effect on the patients or their parents. Another example is the finding of a malignant mutation for <a href="http://en.wikipedia.org/wiki/BRCA2">BRCA2</a>, which increases by 70% the chances of developing breast cancer in female carriers. There are many other incidental findings such as these where practitioners are undecided on how to manage that information sensibly.</p>
<p>What it is clear though is that incidental findings will have to be dealt with on an individual basis. Informing the family will depend on the possible effects of the information found as well as personal circumstances. Here I have just limited my exposition to a few common examples of current challenges, but as technology progresses and more patients and families are sequenced, many new conundrums are likely to appear, requiring new ethical debate.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/630/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/630/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/630/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/630/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/630/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/630/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/630/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/630/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=630&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/07/11/the-power-of-incidental-findings/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>
	</item>
		<item>
		<title>Sending Sensitive Data Encrypted</title>
		<link>http://manuelcorpas.com/2010/07/08/sending-sensitive-data-encrypted/</link>
		<comments>http://manuelcorpas.com/2010/07/08/sending-sensitive-data-encrypted/#comments</comments>
		<pubDate>Thu, 08 Jul 2010 13:36:47 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Tutorials]]></category>
		<category><![CDATA[encryption]]></category>
		<category><![CDATA[passphrase]]></category>
		<category><![CDATA[privacy]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=616</guid>
		<description><![CDATA[The other day I was asked to find a way to send sensitive clinical data to another institute. How to make sure that the data is protected and only acessible to the right people? There are two aspects of protecting data, reflecting the different risks which the data may be exposed to: data in transit [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=616&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The other day I was asked to find a way to send sensitive clinical data to another institute. How to make sure that the data is protected and only acessible to the right people? There are two aspects of protecting data, reflecting the different  risks which the data may be exposed to:</p>
<ul>
<li>data in transit (email &#8220;in flight&#8221;, web or FTP downloads, data sets  on USB disks shipped by FedEx, etc)</li>
<li>data at rest (email arrived in recipient&#8217;s inbox, data copied to  collaborator&#8217;s working disk, etc)</li>
</ul>
<p>Here we will only explore the requirements for encrypting data in  transit. The security of the data at rest is assumed to be taken care of  by the collaborator or their IT staff, since it is outside one&#8217;s control.</p>
<p>There are various possible file transfer methods:</p>
<ul>
<li>email &#8211; suitable for small files (typically up to 5MB although  different sites impose different limits); no automatic encryption in  transit</li>
<li>FTP or non-SSL password-protected web site &#8211; suitable for large  files (in the GB range); no automatic encryption in transit</li>
<li>scp &#8211; suitable for large files; intrinsic encryption in transit;  likely to encounter firewall issues</li>
<li>password-protected SSL web site &#8211; suitable for large files;  intrinsic encryption in transit</li>
<li>USB disk &#8211; suitable for very large data sets (TB range); no  automatic encryption in transit</li>
</ul>
<p>When encryption is mandated (e.g. by a data access agreement) and the  file transfer method does not provide encryption intrinsically, it is  necessary to encrypt the data separately and transfer the encrypted file  by the chosen method.</p>
<p style="text-align:center;"><a href="http://corpasfoo.files.wordpress.com/2010/07/security.jpg"><img class="size-medium wp-image-619 aligncenter" title="security" src="http://corpasfoo.files.wordpress.com/2010/07/security.jpg?w=300&#038;h=200" alt="" width="300" height="200" /></a></p>
<p>For ad-hoc or one-off data encryption, it is appropriate to encrypt a  data set with  a password (&#8220;symmetric encryption&#8221;, because the same  password is used to encrypt and decrypt) which will be sent to the  recipient <strong>by a separate means</strong> to the actual data. For example, if  the data is shipped on a USB disk,  the password could be sent by  email, or given over the phone. Sending  the password with the encrypted  data defeats the object of encrypting  it!</p>
<p>For regular or scheduled data transfers, public-key encryption may be  suitable &#8211; and removes the need to send a password &#8211; but that will not  be explored here due to the extra work in creating and managing keys.</p>
<p>A suitable encryption tool on Linux systems is gpg (the <a href="http://www.gnupg.org/">GNU Privacy  Guard</a>). The simplest usage is to prepare a single file containing the  data in question using tar or zip, and then to encrypt that:</p>
<pre>$ gpg -c bigfile.tar
gpg: gpg-agent is not available in this session
Enter passphrase:
Repeat passphrase:
<pre>$ ls bigfile.tar*
bigfile.tar    bigfile.tar.gpg</pre>
</pre>
<p>At this point, &#8220;bigfile.tar.gpg&#8221; is the encrypted file which is safe  to transfer by email, FTP, or any other non-encrypted method. Note that  the passphrase is not displayed while it is being entered; and that the  encrypted file is typically smaller than the original due to compression  in the encryption process. However it is necessary to have enough disk  space to contain both the original and the encrypted data  simultaneously, which may make this approach unsuitable for very large  (TB) datasets.</p>
<p>The passphrase should be chosen with the same care as a computer  login password. The Linux utility &#8220;pwgen&#8221; produces a selection of random  passwords which may be useful in selecting a suitable passphrase.</p>
<p>The recipient will decrypt the file in a similar way:</p>
<pre>$ gpg bigfile.tar.gpg
gpg: CAST5 encrypted data
gpg: gpg-agent is not available in this session
Enter passphrase:
gpg: encrypted with 1 passphrase
gpg: WARNING: message was not integrity protected</pre>
<p>Note that if the passphrase is lost then it is vanishingly unlikely that  the encrypted data can be recovered. Unless the passphrase is easily  guessable, the encryption is sufficiently strong as to defeat most  attempts to break it.</p>
<p><em>Written by Dr David Holland (<a href="http://www.sanger.ac.uk/">WTSI</a>), adapted by Manuel Corpas. Posted with Dr Holland&#8217;s permission.<br />
</em></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/616/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/616/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/616/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/616/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/616/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/616/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/616/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/616/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=616&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/07/08/sending-sensitive-data-encrypted/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>

		<media:content url="http://corpasfoo.files.wordpress.com/2010/07/security.jpg?w=300" medium="image">
			<media:title type="html">security</media:title>
		</media:content>
	</item>
		<item>
		<title>Biomedical Community-Wide Annotation Using Wikipedia</title>
		<link>http://manuelcorpas.com/2010/06/03/biomedical-community-wide-annotation-using-wikipedia/</link>
		<comments>http://manuelcorpas.com/2010/06/03/biomedical-community-wide-annotation-using-wikipedia/#comments</comments>
		<pubDate>Thu, 03 Jun 2010 20:33:25 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Biotech]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Tutorials]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=582</guid>
		<description><![CDATA[The pace of data generation is leaving far behind our ability to convert this data into usable knowledge. Even well funded biomedical databases find it increasingly difficult to keep up to speed. In order to tackle this problem, some databases have opted for increasing automation in the way data is deposited, reducing the time needed [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=582&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The pace of data generation is leaving far behind our ability to convert this data into usable knowledge. Even well funded biomedical databases find it increasingly difficult to keep up to speed. In order to tackle this problem, some databases have opted for increasing automation in the way data is deposited, reducing the time needed for interpreting results. The problem with this approach is that generated knowledge as a result is less accurate than manually annotated entries and of lower quality. Another potential solution has been to engage leading experts, creating a sort of consortium where they give some of their time to curate data entries that match their specialties. Unfortunately, engaging world experts in curating biomedical resources has not had a lot of success, with a few contributing a lot and many hardly ever dedicating any time to curation no matter how much they were fetched.</p>
<p>A new revolutionary idea has come from <a href="http://www.sanger.ac.uk/research/faculty/abateman/">Alex Bateman</a>&#8216;s group to engage not just the community of experts but the whole of the Internet, using Wikipedia. One of his group&#8217;s databases, <a href="http://rfam.sanger.ac.uk/">Rfam</a>, which characterises RNA families, is now providing all of its annotation via Wikipedia. Wikipedia is already the leader reference resource for all kinds of information. It possesses the know-how and capability to mediate the curation of database entries as well as managing to have extremely resounding success in terms of gathering reasonably high quality knowledge.</p>
<p>After having a persuasive discussion with Alex, I decided to give it a try myself and add my very first entry to Wikipedia, which I thought it could potentially help the database I develop outsource its <em>public/non-sensitive</em> data annotation part.</p>
<p>I copied, edited and formatted parts of a non-sensitive entry (a Syndrome description) to Wikipedia. I learnt –contrary to what I expected- that as long as one has an account and no entry exists on the topic, a page can be added on the fly. So I added a page and started editing, copying and pasting.</p>
<p>It took me a bit of time to get used to some of the conventions and formatting tags used by Wikipedia but very early on I had help from Wikipedia &#8216;agents&#8217;. It really surprised me how quickly these agents picked up my entry and immediately made me know the criteria for making sure this Wikipedia entry achieves a high standard.</p>
<p>I learnt about important concepts in the Wikipedia context such as <a href="http://simple.wikipedia.org/wiki/Wikipedia:Notability">Notability</a> and <a href="http://en.wikipedia.org/wiki/Conflict_of_interest">Conflicts of Interests</a>. Apparently one cannot write about oneself for example, and personal opinions or articles are not accepted. So far this was OK for me although problems came when one of this agents pointed at some copywriting issues: I was trying to copy an entry of a website/database.</p>
<p>Blatant copy of public content from another website is considered a copyright violation unless a correct license is put in place and one &#8216;owns&#8217; the data. In our case, the <a href="http://en.wikipedia.org/wiki/Creative_Commons_licenses">Creative Commons License</a>, which is the one we hold, was not OK because although it lets public use of the information, it does not allow alteration. This means that people would not be able to edit my Wikipedia entry.</p>
<p>I must admit I felt intimidated at this point. Despite that, I was extremely impressed with the efficacy with which agents acted as well as how quickly they responded to my queries. I can understand why they have to be so tough so that they prevent abuse.</p>
<p>Overall I feel quite satisfied with what I have learnt in the process and I am extremely eager to keep exploring the use of Wikipedia for database curation. Of course this is just a try and our adopted solution for keeping up with current annotation may be something different in the end. However, it is worth a try.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/582/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/582/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/582/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/582/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/582/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/582/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/582/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/582/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=582&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/06/03/biomedical-community-wide-annotation-using-wikipedia/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>
	</item>
		<item>
		<title>Is Tide of Privacy War Turning Against Facebook?</title>
		<link>http://manuelcorpas.com/2010/05/14/is-tide-of-privacy-war-turning-against-facebook/</link>
		<comments>http://manuelcorpas.com/2010/05/14/is-tide-of-privacy-war-turning-against-facebook/#comments</comments>
		<pubDate>Fri, 14 May 2010 07:55:04 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=548</guid>
		<description><![CDATA[A series of articles in the NY Times and elsewhere are being written about the increasing disgust of people for Facebook&#8217;s privacy policies. Apparently, a bewildering tangle of options are needed in order to set one&#8217;s privacy configuration in your own profile. What is more, a group of 4 nerds in NYC have launched a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=548&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A series of articles in the NY Times and elsewhere are being written about the increasing disgust of people for Facebook&#8217;s privacy policies. Apparently, a <a href="http://nyti.ms/bzVXc2" target="_blank">bewildering tangle of options</a> are needed in order to set one&#8217;s privacy configuration in your own profile. What is more, a group of 4 nerds in NYC have launched a<a href="http://nyti.ms/cn596V" target="_blank"> cry to arms against Facebook</a>, promising to develop a social network called <a href="http://joindiaspora.com/">Diaspora*</a> that will not need users to surrender their privacy to be sold to third parties.</p>
<p>Whether this is going to be the beginning of a long battle for the holy grail of social network dominance or simply just another trifling spark against the giant it remains to be seen. What it is clear is that a social clamor is mounting up for their most basic instincts in search of privacy protection. The proof is that Diaspora* has been raising funds for this new venture in <a href="http://www.kickstarter.com/projects/196017994/diaspora-the-personally-controlled-do-it-all-distr">Kickstarter</a> and 18 days to go for closing of this round they have already been promised 1273% of the money they asked initially.</p>
<p>Perhaps there are now clear signs that cyberusers are getting tired of being imposed rules by the big monopoly or simply they just would like to see new blood providing more self-control options. It is clear though that the battle for personal privacy in the web continues and that the Tide of War might be turning against Facebook. What it is not so clear though is how all this will affect the end user.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/548/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/548/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/548/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/548/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/548/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/548/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/548/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/548/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=548&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/05/14/is-tide-of-privacy-war-turning-against-facebook/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>
	</item>
		<item>
		<title>Hacker Ethics are not Completely Universal</title>
		<link>http://manuelcorpas.com/2010/04/25/hacker-ethic-is-not-completely-universa/</link>
		<comments>http://manuelcorpas.com/2010/04/25/hacker-ethic-is-not-completely-universa/#comments</comments>
		<pubDate>Sun, 25 Apr 2010 19:36:03 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[access]]></category>
		<category><![CDATA[democracy]]></category>
		<category><![CDATA[Hacker ethics]]></category>
		<category><![CDATA[universal]]></category>
		<category><![CDATA[values]]></category>

		<guid isPermaLink="false">http://manuelcorpas.com/?p=537</guid>
		<description><![CDATA[Hacker values are great ideals to be aspired by current generations of would-be entrepeneurs and even philosophers. Let's not forget, however, that this culture was grown and fostered in a very particular set of circumstances and country. I do not believe that these ideals are necessarily universal. I can think of a parallel analogy with democracy: despite being universal in its aspirations it does not work in all cultures.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=537&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I paraphrase the main values of the hacker ethics, taken from Steven Levy&#8217;s 1984 book, Hackers: <a href="http://en.wikipedia.org/wiki/Hackers_Heroes_of_the_Computer_Revolution">Heroes of the Computer Revolution</a>:</p>
<ul>
<li>Access to computers &#8211; and anything which might teach you something about the way the world works &#8211; should be unlimited and total. Always yield to the Hands-On imperative!</li>
<li>All information should be free.</li>
<li>Mistrust authority &#8211; promote decentralization.</li>
<li>Hackers should be judged by their hacking, not bogus criteria such as degress, age, race, or position.</li>
<li>You can create art and beauty on a computer.</li>
<li>Computers can change your life for the better.</li>
</ul>
<p>While such values are laudable, I find a series of reservations with the hacker ethics that makes me think of it as a more exclusive, restrictive culture than what many hackers gurus might have initially thought of:</p>
<ul>
<li>It assumes a potentially equalitarian access to technology. Not all would-be hackers have the possibility to be exposed to the technological advances where hackers thrive. I cannot conceive a leading hacker dealing with obsolete technology. By definition, access to technology cannot be limitless to everyone.</li>
<li>Despite a free flow of information, the truth is that where you are born will condition your ability to become a hacker. Access to information alone is not enough. Why are there no well known hackers outsite the rich world?</li>
<li>Unless you speak and understand English you will never succeed as a hacker. All programming languages are based on the English lexicon and without it one cannot understand many of the nuances. The development of the hacker culture is also tied to a very specific environment and tradition.</li>
<li>Hacker culture is based on a civilization of abundance. Most people have their limited salaries and obligations to meet. The hacker culture assumes a continuous amount of free time, that many people simply do not have.  This excludes hacker values from universal adoption.</li>
</ul>
<p>Hacker values are great ideals to be aspired by current generations of would-be entrepeneurs and even philosophers. Let&#8217;s not forget, however, that this culture was grown and fostered in a very particular set of circumstances and country. I do not believe that these ideals are necessarily universal. I can think of a parallel analogy with democracy: despite being universal in its aspirations it does not work in all cultures.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/corpasfoo.wordpress.com/537/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/corpasfoo.wordpress.com/537/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/corpasfoo.wordpress.com/537/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/corpasfoo.wordpress.com/537/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/corpasfoo.wordpress.com/537/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/corpasfoo.wordpress.com/537/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/corpasfoo.wordpress.com/537/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/corpasfoo.wordpress.com/537/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=manuelcorpas.com&amp;blog=5424602&amp;post=537&amp;subd=corpasfoo&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://manuelcorpas.com/2010/04/25/hacker-ethic-is-not-completely-universa/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ad7b9d282ec00a53fee5c0d293f3f425?s=96&#38;d=wavatar" medium="image">
			<media:title type="html">manuelcorpas</media:title>
		</media:content>
	</item>
	</channel>
</rss>
