G2G: I took full sequence mtDNA test, and results say I belong to H4a1a1a group. What does this mean?

+8 votes
1.6k views
....and HOW do I get a crash course on understanding what all this means? I feel so ignorant because I clearly have no clue how to use the completed mtDNA results to help me find maternal DNA matches or how they connect. I really need LOTS of tutorial help! Videos? Step by step?

Thank you so very much!
in Genealogy Help by Jerri Samuel G2G2 (2.8k points)

4 Answers

+4 votes
 
Best answer

Hi, Jerri, and welcome to the H4a1a1a club! Yep: I carry the same haplogroup...which means that somewhere in the shadow of time we share a common matrilineal-line ancestor. There have already been some good answers, so I'll try not to repeat them.

TL;DR: mtDNA may seem like it should be simple to use for genealogy, but it isn't. In fact, it's probably the most difficult type of DNA to employ as a form of positive evidence...and sometimes even as negating evidence. Understanding autosomal DNA and yDNA doesn't completely prepare you to work effectively with mtDNA. And due to the very nature of the mtDNA we test, the results we obtain could potentially contain inaccuracies in the final reporting. This won't be any help as a tutorial. Sorry. It's really just an overview of four of the potential stumbling blocks that arise when using mtDNA for genealogy.

Sooner or later I'll learn to consistently do those "tool long; did not read" summaries. Ahem...

Part 1

Family Tree DNA is at work on a new(ish) mitochondrial DNA project that will hopefully provide us in the future with more information and tools, similar to what is available today for takers of the Big Y test. For now, if we look at YFull we see that they estimate a median appearance date of our H4a1a1a haplogroup to be around 2,700 years before present.

An interesting research paper in 2020 found, unexpectedly, the remains of a mummified Egyptian woman from circa 660 BCE to be haplogroup H4a1. From the paper's abstract: "Neither H4 nor H4a1 have been reported in ancient Egyptian samples, prior to this study. The modern distribution of H4a1 is rare and sporadic and has been identified in areas including the Canary Islands, southern Iberia and the Lebanon. H4a1 has also been reported in ancient samples from Bell Beaker and Unetice contexts in Germany, as well as Bronze Age Bulgaria." Of note in the finding is that the 660 BCE date places a predominantly European haplogroup "in an Egyptian individual in Southern Egypt, prior to the Roman and Greek influx (332BCE)." (Drosou, et al. "The First Reported Case of the Rare Mitochondrial Haplotype H4a1 in Ancient Egypt." Scientific Reports 10 (12 October 2020): 17037. https://doi.org/10.1038/s41598-020-74114-9.)

Does this help at all with genealogy? Well, maybe a little. It definitely means that if you have a hypothesis about your matrilineal line and multiple cousins test--some as H4a1a1a and some as, say, H2-something--you can rule that hypothesis as false. But using mtDNA as a from of positive evidence is difficult, time-consuming, tedious, and often done incorrectly. And sometimes false-negatives can be overlooked.

So consider this just a summary of four of the "gotchas" that come along with mtDNA for genealogy. And, I suppose, as a way to say that no, it is not easier to grasp than you think it should be, and you are most definitely correct to be confused. Truthfully, anyone who presents mtDNA as being simple and straightforward doesn't completely understand it.

Mitochondrial DNA deals only with an extraordinarily tiny DNA molecule, passed down only from the motherline, does not recombine like autosomal DNA or the X chromosome...yet it is the most difficult and imprecise form of DNA to work with for genealogy.

Gotcha #1

First up is the seeming contradiction that mtDNA mutates with glacial slowness, but then again somatic mtDNA, the mitochondrial DNA in our body cells, shows rapid mutation rates. How can this be? Jessica mentioned "germline" DNA, and that's what creates the confusion here. All of a mother's oocytes--the egg cells that will become mature ova--are formed in utero, before she is even born. In a mature egg cell there can be as many as 100,000 mitochondria, but as a female ages no new mitochondria are introduced into those egg cells: they work with what they have, so the numeric odds of mutations go way down compared to what happens in our body cells. This is the germline mtDNA, that which is inherited by the next generation.

You have, give or take, about 4 quadrillion mitochondria in your body. A challenging number to visualize. If my math is correct, that's over 125 times larger than the current size of the entire U.S. national debt. As you read this the little organelles (their DNA isn't part of the rest of our DNA that's inside the cell nucleus; the mitochondria exist inside the cell wall but outside the nucleus) are working away helping to create the energy for our cells that we need to live. And mitochondria don't even exist in our most prevalent type of cell, the red blood cell.

A given cell's mitochondria have a replication half-life of as brief as 8-11 days for things like skin cells, and 20-30 days for long-lived cells like neurons (Holt and Davies, 2020). So those quadrillion mitochondria are replicating at a rate far faster than the body cells themselves. Theoretically, you will turn over all 4 quadrillion at a median rate of about 9 times every year. If you live to be 80, that works out to you housing, over the course of your lifetime, somewhere around 2.9x1018, or 2 quintillion 900 quadrillion, different mitochondria. The mind boggles.

No copy machine is that good. Copy "mistakes" happen and we know that more than 61% of us--probably all of us--carry mitochondria whose DNA is not identical due to these minor copying errors (Ramos, et al., 2013). Yet we can only test our body cells, as done currently via saliva or epithelial cells scrapped off the cheek. We don't actually test the germline cells, at least not for genealogy. These copy errors are so prevalent that Family Tree DNA has a policy that only if a minor allele--one less prevalent in a sample at a given position in the DNA--is in a concentration of 20% or greater will it even be reported.

Gotcha #2

That very slow mutation rate for germline mtDNA can lead to a more optimistic view of possible relatedness in a genealogical timeframe than may be merited. For example, FTDNA's longstanding statement here has been: "Matching at HVR1, HVR2, and the coding region brings your matches into more recent times. It means that you have a 50% chance of sharing a common maternal ancestor within the last 5-16 generations (or about 125-400 years)."

I don't believe that's changed in more than a decade. But it's important to note that "matching" in that equation means zero reported differences, though for the full mtDNA test FTDNA will report 0 through 3 nucleotide differences (full disclosure: FTDNA ignores completely two locations on the mtDNA molecule, 309 and 315, because those see insertions/deletions way too frequently to mean much). Also, I think the 50% confidence interval throws some folks, and even at that level it would be considered overly generous when compared to recent research into germline mtDNA mutation.

Recent estimated average mutation rates for a germline mitogenome--the whole of the mtDNA molecule--are as low as only one instance in every 70 generations (Anderson and Balding, 2018). Per that publication, using the Översti method for a non-endogamous population, we see that a cumulative probability of an exact full-sequence mtDNA match at very roughly 90% intersects with approximately 190 birth events. If we take the overall average length of a matrilineal generation to be just 20 years, that would be 3,800 YBP. At a 50% probability, it looks to be about 31 or 32 generations, or about 620 years. Andrew Millard, Philip Gammon, and possibly others have also calculated that the average number should be about 31 generations.

by Edison Williams G2G6 Pilot (512k points)
selected by Lucas Van de Berg

Part 2

Gotcha #3

Obviously, if we can have thousands of mitochondria inside a single human cell, they have to be quite literally minuscule. And they are.

The DNA of our friendly little organelle, the mitochondrion, contains only 16,569 (give or take a deletion or insertion) base pairs, the nucleotides, or pairs of "letters" of DNA. As DNA goes, the whole thing is about 400 times too small to even register as a single segment using our current direct-to-consumer autosomal DNA testing technology. Included in that tiny molecule are a regulatory region and 37 genes that code for 13 polypeptides, 22 tRNAs, and two rRNAs. These account for over 80% of all the DNA base pairs in mtDNA, and mutations there often mean trouble to the host human cells.

Most markers useful for genealogy occur in the other 20% of the mtDNA, where mutations don't risk the viability of the organism. By comparison, the Y chromosome is about 57.2 million base pairs long and contains around 107 protein coding genes. The end result is that there isn't much room available for mtDNA to mutate.

That's a factor that makes mtDNA haplogroups look much more confusing than yDNA haplogroups. With yDNA, the single nucleotide polymorphisms, or SNPs, are pretty straightforward and hierarchical. Meaning that if you have, for example, the SNP BY3332 tested as positive, you'll also be positive for its parent, ZZ12_1, and its parent DF27...each is unique to its branch in the haplotree.

But it doesn't work that way for mtDNA. Our H4a1a1a haplogroup is defined by A73G! This means that we have guanine in position 73 instead of adenine, and the exclamation point designates a marker which is considered non-phylogenetic for the haplogroup; specifically, they are "back mutations" where a certain value expected within a main haplogroup is not present. So we're already pretty confusing, aren't we?

Being H4a1a1a means we're also H4a1a1, and that branch is defined by A10044G. Likewise, we're also H4a1a, defined by G8269A; and we're H4a1, defined by C14365T. The H4 branch is defined by A4024G and A14582G.

But wait. A73G! currently appears 12 different times in the mtDNA haplotree! It's also a defining mutation for H13a2b3, H17c, H1a, H1e2c, H32, and others. How is that possible? Unlike yDNA where the SNPs are (typically) unique to a given haplogroup and branch of the tree, with mtDNA--because there are far, far fewer possible mutations--the haplogroups are defined by the total aggregation of their mutations, not individual ones. For example, the A10044G that defines our H4a1a1, is also a defining variant of the haplogroup L3h1b...not even in the same basal "H" clade as you and me.

Gotcha #4

The biggest problem with mtDNA in general is that its germline mutation rate is so slow that its use as a positive form of evidence in genealogy is really, really tricky.

One quick (and admittedly fairly hyperbolic) example there is that the entire mtDNA haplotree contains (currently; this may change in a year or so) a total of 5,468 branches. So that's the total number of haplogroups that have been identified. We know that, for instance, our H4a1a1a haplogroup means that we would also show up as H4a1a1, H4a1a, H4a1, and so on. But let's ignore that for the moment and assume there are 5,468 distinct haplogroups in the world. The current global population is 8.03 billion people. If everything were neatly averaged out, that means the world has about 1.47 million people in every mtDNA haplogroup. That's roughly the population of San Antonio, Texas, or Ganzhou, China.

A lot of presumptive positive matches might be made where the DNA evidence, on its own, can't really support that. In the Group Projects I admin at FTDNA, my all-time leader in reported mtDNA full-sequence DNA matches (again, a distance of 0 through 3) has over 1,200 matches and gets at least a few new ones every month.

But with mtDNA, the reverse problem can happen, also: false negatives. Matches that really do exist but that you may never know about. This goes back to the matter of our not having a single mitogenome throughout our body cells, but multiple mtDNA "signatures."

Perhaps the best and most clearly explained example of this comes from noted genetic genealogist, Dr. Blaine Bettinger, in an April 2018 article where he explains why he and his biological mother don't show as being a match at FTDNA. That means the test results came back and he and she were at least a "genetic distance" of four or greater.

The one inaccuracy I'll note in the article is that Blaine writes, "We have something like 3 trillion cells in our body, almost all with a copy of our DNA." The number of cells in the body--depending on the size of the person and not counting bacteria, our microbiome--will range from around 30 trillion to 40 trillion based on work published in Molecular Biology of the Cell (Roy and Conroy, 2018); the authors arrived at an average of 37 trillion. And over 80% of those are red blood cells; platelets have mitochondria, but mature red blood cells have neither mitochondria nor nuclei.

The good thing about that false-negative was, being mother and son, Blaine had all the raw data so he could figure out what was going on (but he also recruited a cousin to help get to the bottom of it). It may be a bit in-depth, but it's a useful read for those diving into mtDNA, and it also shows the various notations reported when heteroplasmies are detected, plus the use of YSEQ in Germany as second-opinion testing in unusual cases.

The false-negative situation is very probably rare, but Blaine's example shows clearly that it exists. The difficulty this presents in genealogical research is that you may have a solid, paper-trail hypothesis about an in-common matrilineal ancestor but you and a cousin both get a full-sequence mtDNA test and...no match. Even as a form of negating evidence, if the documentary evidence is good, it may still require multiple test-takers in order to use mtDNA evidence effectively.


+9 votes
Haplogroups relate to Y-DNA - the male chromosome or the Mitochondrial DNA of a female. In Genealogy it's main use is in tracking your father's father's father and so on or your mother's mother's mother and so on. It won't help much with other lines. The H4 haplogroup and it's decendants relate to the female side. Your mother and her mother will have the same group. If you look on something like https://www.familytreedna.com/groups/mt-dna-h4/about/results you will see the general area that hapolgroup is mostly found in. Mutations happen over time and such changes can help you identify individuals you are more closely related to. You might like to use a site like GedMatch which compares matches irrespective of which companies they've used but you will have to upload your results before you can do this. Otherwise you will be relying on your testing company to show you possible matches you might be able to contact. Definitely watch youtube videos about all this as it is quite a learning curve. However, in my experience DNA will only help with matches that are 4th cousin or closer. Good luck.
by Jean Hollis G2G6 Mach 2 (27.0k points)

Thank you very much! I am still trying to navigate my way around on here and on FamilyTree DNA. I will be taking some time off to make myself more familiar with these sites. I have used GedMatch and other companies. In fact, I have tested with almost all the companies or uploaded my DNA. I probably need to slow down and learn more and understand the process better. I was waiting for all the answers to just appear on their own. I appreciate the help so much! ...and yes, I will watch some videos and read up more.

+15 votes
Hi Jerri! What this means is that a long time ago, a few thousand years ago, perhaps somewhere in what is now southern Iberia, a baby girl was born. We don't know her name. We do know that she had some mutations in her mtDNA that we now classify as H4a1 -- we all have some slight mutations, but in her case these mutations occured in what is called her germline cells. This means she was able to pass on her slightly mutated mitochondrial DNA to all her children, and her daughters would, in turn, pass it on to their children.

This unnamed woman, and her daughters, and their daughter's daughters, and so forth and so now, continued passing down their H4a1 mtDNA in an unbroken maternal line to you and I today. Because you and I both share this mtDNA, this means that at some point, we are descended from a pair of sisters who were themselves the daughter's daughter's daughter... etc, of this first woman to carry this H4a1 mtDNA.
by Jessica Key G2G6 Pilot (335k points)

Thank you so much for this beautiful explanation! <3  I love it, and it does help me quite a lot; however, maybe I need to do some homework to feel comfortable with everything.  I guess I should have learned more about DNA in general before jumping into the mtDNA testing. Sincerely though...Thank you again!

+8 votes

Hello Jerri,

See http://scaledinnovation.com/gg/snpTracker.html?snp=H4a1a1a&mt

and

http://scaledinnovation.com/gg/snpTracker.html?snp=H4a1a1a&mt&tab=snps

and

http://scaledinnovation.com/gg/treeExplorer.html?snp=H4a1a1a&mt

Please add your mtDNA to mitoYDNA.org to automatically link from your WikiTree profile to the above and so your mtDNA can be used to confirm the accuracy of your matrilineal line, and you can learn more about the history your direct maternal line.  Here is a video which shows you how to add your mtDNA from Family Tree DNA to mitoYDNA.org:  https://www.youtube.com/watch?v=X4ZnlVQRX9g

Also, here are step by step (written) instructions: https://www.wikitree.com/wiki/Space:MtDNA_instructions_for_mitoYDNA

Here are some of the benefits of having your mitoYDNA ID in WikiTree:

Linking to mtDNA HVR differences from the rCRS

https://m.youtube.com/watch?v=cDZoZBiDp5E&feature=youtu.be

Compare two or more mitotypes in WikiTree via mitoYDNA 

https://m.youtube.com/watch?v=apiZk2-80ZY&feature=youtu.be

by Peter Roberts G2G6 Pilot (767k points)

Thank you, Peter, for the links. As you can see, I am very much in need of assistance. I will definitely view all of them in the next week or two to better educate myself on all of this. I think everything I have learned so far is so fascinating, and I want to learn so much more. I did add my results to this site, but I did not go through mitoYDNA.org though, so I guess I need to still do this.

Again, I appreciate the expert help!

Jerri

Related questions

+9 votes
1 answer
asked Dec 3, 2014 in The Tree House by Veronica Williams G2G6 Pilot (231k points)
+3 votes
0 answers
+3 votes
2 answers
+8 votes
2 answers
+4 votes
1 answer
+7 votes
1 answer
asked Jun 15, 2018 in The Tree House by Peter Roberts G2G6 Pilot (767k points)
+9 votes
18 answers
asked Mar 13, 2019 in Genealogy Help by AL Wellman G2G6 Mach 2 (22.2k points)
+5 votes
2 answers
...