April 19, 2012

How Many Relatives Do You Have?

This month, 23andMe published a new article in PLoS One entitled “Cryptic Distant Relatives are Common in both Isolated and Cosmopolitan Samples.” Authored by former 23andMe scientists, Brenna Henn and Lawrence Hon, along with current 23andMe scientists and 23andMe advisor, Itsik Pe’er, of Columbia University, the article describes the nuts and bolts of the method behind our innovative Relative Finder tool. In this post, Brenna discusses what the findings mean for our notion of “relatives” and explores the many special features of the 23andMe customer database.

By Brenna Henn

You probably know how many siblings you have or the number of first cousins. But do you know how many 2nd cousins you have? Does 38 sound reasonable? What about the number of 3rd, 4th, or 5th cousins? The answer may surprise you. Under a simple model where a family has 2-3 children, you would have 190 third cousins, 940 fourth cousins and a whopping 4,700 fifth cousins. Going back to 8th cousins (or 9 generations back to a common ancestor), the model we developed predicts you would have over half a million 8th cousins!

Now, this model is a simplification because some families have a dozen children and some have none; but this model does help illustrate just how many potential distant cousins are out there. Finding a relative in the 23andMe database, however, rests on two other important conditions. First, the relative has to be a 23andMe customer. Second, you and the relative need to share a long identical segment of DNA. The more distant a pair of cousins, the less identical DNA they share. Very small fragments of DNA can be hard to detect with our algorithm. So although you might have thousands of 6th cousins, we think we can only detect about 4% of them with 23andMe’s current technology. Our detection success is much higher for more closely related cousins, though – we can detect about 46% of 4th cousin pairs and 90% of 3rd cousin pairs.

Simulated data showing the relationship between shared identical segments of DNA (IBD-half) and # of shared segments for different degrees of relatedness in a population with European ancestry.

Our ability to detect a person’s distant cousins is also influenced by the ancestry of the individual. If you are Ashkenazi Jewish, you may have noticed that 23andMe’s Relative Finder feature shows you over a thousand cousins. This is because Ashkenazi Jews are more closely related to each other than a random sample of European-Americans. Over the past several hundred years, a cultural tendency to choose marriage partners of the same ethnicity (also known as endogamy) means that Ashkenazi individuals are more likely to share the same ancestors. In fact, we estimate that any two randomly chosen individuals who identify as Ashkenazi are on average the genomic equivalent of 4th-5th cousins, because they share many recent common ancestors.

This phenomenon doesn’t just occur in the Ashkenazim. We looked at 121 populations, many from the 23andMe customer database. Pairs of individuals in Iceland, Finland or South Africa are more closely related than pairs of individuals from Italy or Japan.

Connect with 23andMe members who share DNA and ancestors with you using the Relative Finder tool. Not yet a 23andMe customer? Visit our store or learn more!

In our analyses, we also looked at DNA data for over 5,000 individuals with just European ancestry in the 23andMe database. The DNA indicated that in this sample there are over 5,000 3rd cousin pairs and 30,000 4th cousin pairs. This result is important beyond thinking about genealogy. It means that large disease association studies that sample thousands of individuals will have many pairs of distant cousins in the dataset. By identifying cryptic (or non-obvious) relatives in databases, researchers can see if certain disease mutations occur more often in different extended families.

Twitter Facebook Google+ Email Email to a friend

You may also want to check out:


Comments

  1. MJ Bailey says:

    I’m an only child, my dad one of 3, my mom one of 4. In my main file, there are 12,842 people to whom I’m directly related.

    In a different file, 12, 428–some of these may be duplicates of the above file, but I’m still working on both to extend as far as I can all collateral lines.

    MJ Bailey

  2. sammi says:

    the article above says: “in fact, we estimate that any two randomly chosen individuals who identify as Ashkenazi are on average the genomic equivalent of 4th-5th cousins, because they share many recent common ancestors.” How recent are those recent ancestors? Recent can mean different thinks to diferent people.

  3. Ponto says:

    I come from a Maltese family and one with 8 children. Large families were the norm in Malta up to post recent times.

    I have to say the RF algorithm does not work well for Maltese people as it appears we Maltese people are all related at the 5th to 10th cousin level. A predicted 3rd cousin turned out to be a 6th cousin which was understandable since his Maltese ancestry was back in the 19th century, and his family was Greek born for generations. I have basically learned to tune out the predictions, and just hope my RF cousins have good genealogical records to find the connection. Unfortunately, most don’t have good genealogical records.

    By the way, from genealogy, my parents were 6th cousins once removed. I am sure they would be surprised by that, though not shocked.

  4. Peter Meijlink says:

    Dear Sir/Madam,

    Recently I discovered that my aunt had a match of 13 cM, [shared 0.17] with an American with Dutch ancestry. In her list of matches he was number 33.
    His mother has a match of …cM, [shared0.21], she is number 22 of the list.
    After exchanging genealogical data with the American gentleman, we found out that we shared a common ancestor [Jan jansen Beelen, born 1751 in Hierden in Holland, he died in 1809].
    This ancestor is the grgrgr-grandfather=5 generations of Lydia [85 years] and the grgrgrgrgr-grandfather=7 generations of the American gentleman [43 years].

    I guess and I am afraid that one can NOT (always)use this (wonderful) result as a standard for other cases, am I right?

    Sincerely, Peter Meijlink, Holland

  5. KatieR says:

    I’m one of 5, mother’s one of 7, father’s one of 2. I have 21 1st cousins, and although I haven’t worked out the number of 2nd cousins, I know for sure it’s well over 38! Grandparents were one of 5, 6, 6, 10. One more generation back, they almost all come from families of 10+ kids. This is why I don’t trace descendants. I’d never have the time to get around to tracing ancestors if I did, ;)

  6. Diane Runyan says:

    I’m one of 5 children, my mother was one of four, as was my father. I have 10 first cousins on one side of my family, and don’t know yet about the other. This is a mystery as my mother was adopted at a young age.

    It is encouraging for me to see this prediction of 38 cousins, since my hope is that I find a second, third, or fourth cousin who can collaborate with me on the family history. I also hope that we can talk about some tendencies towards auto-immune diseases which seem to be popping up recently.

    Is anyone noticing any correlation between auto-immune diseases and marriages between distant cousins?

  7. Michael says:

    Would populations that tend to be more closely related have worse health individually, or do those effects only start to show up significantly when first cousins and up are having children?

  8. Barbara Balen says:

    My matches are 998. I would love to be Jewish and celebrate more holidays. But…
    Please discuss the Early American Colonist effect.

  9. I heard a few years ago that in the near future females like myself would be able to trace their fathers line somehow, Has that question been answered yet? I do not have any living male relatives on my fathers side of my family and since I am the last living descendent on that line, there is no hope, I have traced his line back into the early 1700′s and on census I am unable to find any straight line male relatives. Can you help me with this? Eleanore Sullivan Kilpatrick

  10. Jerry Sexton says:

    You think this is a mess for sorting out cousins?

    I am the product of successive grandfathers who lived into their 90′s, and had kids into their late 50′s and early 60′s, and then compounded it all by marrying multiple times.
    My 8th generation back was born in 1720, and my 10th generation was born in 1650.
    Even in recent times, my great-grandfather was 59, and his wife was 57, when they had my grandfather. And my grandpa’s oldest brother was already in his 40′s when gramps was born.
    So … when 23andMe uses their predicter to tell me someone is a 5th or 6th cousin, I just smile … they might be 10th cousins if it’s on one of the maternal branches and their own pedigree lines reproduced in a normal time frame.

    I’ve already had to sort this issue out with a number of cousins.
    It can be highly entertaining.

  11. Walt says:

    A woman has a X from her mother & a X from her father. How does the mother’s X know to go back on her mother’s line instead of the father’s X line?

  12. Bob Jones says:

    Sometimes there are large generation gaps. I know a 35 year old women who’s Grandfather died in the 1940s and Great-Grandfather died in 1880 I’m 33 years old, but her great-grandpa is my third-great-grandfather.

Leave a Comment

*

Return to top
Twitter Facebook Google+ Email Email to a friend