Genome in the Wild

I tested with 23andme in April 2010 and then upgraded to their version 3 chip with almost a million SNPs last Christmas.

Now I am releasing my personal genome in the public domain.

CC0
To the extent possible under law, Zack Ajmal has waived all copyright and related or neighboring rights to Zack Ajmal 23andme v3 Genome. This work is published from: United States.

You can download my genome data in zipped files:

Razib has a list of people who have made their 23andme genomes public.

When Blaine Bettinger released his genome into the public domain, he issued a challenge:

So, I’m challenging everyone who reads this to download my data and analyze it to find the most interesting or surprising results. For example, you could use my most recent 23andMe V3 data.

I’ve already done a fair amount of analysis myself, including the Promethease reports above (and see here), and a recent blog post about my vastly increased Type 2 Diabetes risk. However, perhaps there’s a recent but relatively study that applies, or perhaps there’s a story you can weave with a handful of SNPs. Or, even better, what can you tell me about my ancestry other than mtDNA and Y-DNA haplogroups? Don’t worry about the strength of the study, reproducibility, etc. – I’m aware of the uncertainties associated with this type of research, and my goal here is to make people aware of possibilities.

Please post your findings in the comments below, and in two weeks I’ll pick the most surprising or interesting findings and make them the focus of a new blog post.

Can you surprise me with my own genome?

I have done a fair amount of analysis on my genome. For example, here’s my Promethease report. My ID is DOD128 in Dodecad, PKEG1 in Eurogenes and HRP0001 in Harappa.

My challenge for you would be to find interesting information about my chromosome 9 which is 93% homozygous.

If you analyze my genome, it would be great if you could let me know about what you found as I am always hungry for more information.

Harappa Ancestry Project Update

I have got 25 participants to the Harappa Ancestry Project now. But we still need more especially from the Hindi belt.

I have been detailing the datasets I am using:

I have also started admixture analysis of the reference populations and first batch of project participants.

Harappa Project New Site

As several people had asked, I have set up a separate website for the Harappa Ancestry Project at http://www.harappadna.org/.

I am keeping a link to the new site on the top menu bar here titled Harappa DNA.

I might also crosspost some items from the project here.

I have also set up a Facebook page for the Harappa Ancestry Project. Please like it on Facebook so I can get a nice short name for the Facebook page URL.

I have received several samples and will be reporting some analysis results soon. However, I do need lots of participants, so please spread the word.

Cross-posted at Harappa Ancestry Project.

Harappa Ancestry Project

I have become interested (some would say obsessed) with genetics recently. I wrote about getting my DNA test done and there’s a lot more about my own results that I plan to bore you with.

One fun application of genetic testing is inferring ancestry: Which ancestral group are you descended from? Can we estimate the admixture of the different population groups you are descended from?

Most DNA testing companies provide information about ancestry and genetic genealogy has taken off. With several genome databases (HapMap, HGDP, etc) and software (like plink, admixture, Structure) publicly available, the days of the genome bloggers are here. And I am trying to be the latest one.

In starting this project, I have been inspired by the Dodecad Ancestry Project by Dienekes Pontikos and Eurogenes Ancestry Project by David Wesolowski. The catalyst for this project was my friend Razib who I bug whenever I need to talk genetics.

What is Harappa Ancestry Project?
It is a project to analyze (autosomal) genetic data of participants of South Asian origin for the purpose of providing detailed ancestry information. So the focus of the project is on South Asians: Indians, Pakistanis, Bangladeshis and Sri Lankans.

The project will collect 23andme raw genetic data from participants to better understand the ancestry relationships of different South Asian ethnicities.

I have named it after Harappa, an archaeological site of the Indus Valley Civilization in Punjab, Pakistan.

Participation
People of South Asian origin, or from neighboring countries, are eligible to participate. The list of countries of origin I am accepting are as follows:

  • Afghanistan
  • Bangladesh
  • Bhutan
  • Burma
  • India
  • Iran
  • Maldives
  • Nepal
  • Pakistan
  • Sri Lanka
  • Tibet

Right now, I am only accepting raw data samples from people who have tested with 23andme.

Please do not send samples from close relatives. I define close relatives as 2nd cousins or closer. If you have data from yourself and your parents, it might be better to send the samples from your parents (assuming they are not related to each other) and not send your own sample.

If you are unsure if you are eligible to participate, please send me an email (harappa@zackvision.com) to inquire about it before sending off your raw data.

What to send?
Please send your All DNA raw data text file (zipped is better) downloaded from 23andme to harappa@zackvision.com along with ancestral background information about you and all four of your grandparents. Background information would include where they were born, mother tongue, caste/community to which they belonged, etc. Please provide as much ancestry information as possible and try to be specific. Do especially include information about any ancestry from outside South Asia.

Data Privacy
The raw genetic data and ancestry information that you send me will not be shared with anyone.

Your data will be used only for ancestry analysis. No analysis of physical or health/medical traits will be performed.

The individual ancestry analysis published on this blog will be done using an ID of the form HRPnnnn known to only you and me.

What do you get?
All results of ancestry analysis (individual and group) will be posted on this blog under the Harappa Ancestry Project category. This will include admixture analysis as well as clustering into population groups etc.

I suggest you read about Dienekes’ analysis on South Asians for an idea about what to expect.

You can access all blog posts related to this project from the Harappa Ancestry Project link on the navigation menu on every page of my website. You can also subscribe to the project feed.