I'm back. Aren't ya thrilled?
Funny you should mention ERSA, Kerry...and thanks for reminding me of it. I'd seen that paper you linked, and another from 2014 (in PLOS Genetics) that was even more interesting to me. The lead author of that one was Hong Li, and any of you who have looked at the ISOGG article on excess IBD sharing will recognize the name because he's one of the very few researchers who has delved into that issue. In that 2014 paper, it indicates ERSA was updated to v2.0 (now v2.1, but I gather that was issued a few years ago) in order to account for and mask known pile-up regions. In the abstract, it notes: "We identified several genomic regions with excess pairwise IBD in both the pedigree and control datasets using three established IBD methods: GERMLINE, fastIBD, and ISCA. These spurious IBD segments produced a 10-fold increase in the rate of detected false-positive relationships among controls compared to high-density microarray datasets."
With a little digging I learned that Chad Huff, principal author of the first paper, was almost a neighbor (well, same major metropolitan area). The ERSA modeling software--no idea of OS or platform requirements--is available for download but registering to receive the download link requires a questionnaire that specifies institutional affiliation and contact information for the principal investigator. But I did try registering a little over a year ago; got no response. May try again now; would love to experiment. Google-Fu can find it, but I'm not linking to it openly since it's clear they don't mean it to be for general public use.
And really quickly, a "coefficient of relationship," a CoR, really is a thing. I know this isn't what you asked, but I personally feel there is a lot of value in knowing the theoretical average sharing amount regardless of any experiential data we can get. If nothing else it provides a consistent, justifiable baseline against which we can benchmark results and additional data.
It's basically what RJ described, though deals exclusively in percentages. Here's the formula (it's what I used to build this table):
"R" is the coefficient, resulting in the theoretical amount of shared DNA expressed as a decimal. "X" and "Y" represent the two individuals involved, and "n" is the number of direct links--meiosis events--between the individuals, counted only in one direction, X to Y. The summation operation is necessary in order to calculate the correlation of two living individuals.
Simple example: your 2g-grandmother to you. Starting with her, "X," there are four meiosis events to get to you. So 1/2 ^ 4 = 0.0625; shared percentage is 6.25%; shared centiMorgans (using ISOGG Method II, and what we see at GEDmatch, FTDNA, etc.) would then be 425cM.
When you're comparing two individuals to a common ancestor, a collateral relationship, you add up the number of discrete meiosis events along both legs of the inheritance chain to each common ancestor by starting with X, counting back to the CA, then counting forward again to Y. For example, for 2nd cousins sharing a g-grandparental couple, for X you've got three meiosis events back and three forward, for n = 6. Raise 1/2 ^ 6 = 0.015625. Y is the same because the MRCA couple represents two ancestors in common, so you sum the two results of n for 0.03125, 3.125% shared DNA, or 212.5cM.
A half 2nd cousin would be exactly half that, or back to the 1.5625% figure because only one g-grandparent is shared: you can count the meiosis trail only one time; in other words, you count the meiosis trail passing through no individual which is not a common ancestor more than once.
"Coefficient of inbreeding" is also a thing, but it's more complex and relies on a known result of the coefficient of relationship. A few months ago I said I might write something up about CoI for genealogy, but moved it off the radar. May still be worthwhile, but the problem is that in order to calculate the CoI, the actual ancestry both to and prior to an MRCA has to be known. For genealogy, we're typically looking to investigate hypotheses about that ancestry...we don't already know it, or we wouldn't be looking for a CoI result as guidance. Catch 22.
Dunno. I may still do it. But the CoR can be used to evaluate theoretical sharing for any possible relationship if all parentage is known along both legs of the inheritance chain of two people. If anyone gets bored and wants a pedigree collapse problem to use the CoR on, here's a nifty one from a friend, Yvette Hoitink: https://www.dutchgenealogy.nl/worst-case-of-pedigree-collapse-ever/. "She [Engelberta Harmina Roerdink] descends from her great-great-grandparents Jan Roerdink and Clasina Rengerdink four different ways, as all four of her grandparents are grandchildren of this couple. Following the family tradition, she herself married her first cousin, another great-great-grandson of Jan Roerdink and Clasina Rengerdink."