GEDMATCH and WikiTree

+29 votes
988 views

Some suggestions. 

I feel WIkiTree and GEDMATCH is a perfect match but we need to make it easier for the users to do the DNA genealogy research 

  1. on WikiTree profiles with more DNA connections with gedmatch ids we could auto generate a matrix with the cM relationship between DNA Connections on the profile using Gedmatch 

    e.g. profile Ersson_Torsfält_Thor-1


     
  2. have Project Database errors for profile with gedmatch and wrong privacy.

    Today you find people in gedmatch with a WIkiTree profile but you cant access it because of privacy settings. With a db_error some people will be aware of that
     
  3. have Project Database errors for DNA matches for profiles with gedmatch id and to low or no shared cM. On profile  Ersson_Torsfält_Thor-1 we see that A933552 and M0130018 share no segments over the threshold selected

    ==>

    could be false paper research done
     
  4. have Project Database errors for gedmatch IDs having a WikiTree profile but are not connected on WIkiTree
    1. Maybe when the WikiTree location data is better we could suggest in what area/location there is a match ny creating maps on the fly
    2. Group people who share the same segments

Example how another tool generate hotspots for people who has done the DNA tests and share DNA segments and have a family tree uploaded...



 

WikiTree profile: Lars Thor
in Genealogy Help by Living Sälgö G2G6 Pilot (297k points)
retagged by Ellen Smith
I am always interested in your very good ideas.  Unfortunately these ideas are often above by head.  (...makes me feel like Charlie in "Flowers for Algernon".  :)

...please continue with these good ideas.  You have good vision and maybe this is one that will be developed.

Its already there but you need to take the GedmatchID over to GEDMATCH from WIkiTree and do the matrix/reports yourself....

I guess WIkiTree would maybe be the prefered place to do DNA genealogy if this was done...

Aleš has proved that some tasks are easier done with computers and this is another one...

Like Vincent, I am following your brainstorms, Magnus (-:
Great ideas, especially the db errors suggested, thanks!
Magnus, I love your ideas, matrix - FABULOUS, db errors - AWESOME.
Following. Very interesting and interested!

5 Answers

+10 votes
Great information Mangus.  I think as a member of the DNA project it may be beneficial to join the DB_ERRORS project.  I plan to use db errors reports to identify and contact members who have taken a DNA tedt but have privacy levels set too high preventing the identification of potential matches on Wikitree.   I would like to join specifically for that reason but l'm willing to help where needs exist
by David Douglass G2G6 Pilot (127k points)

Sorry: Its just a suggestion what can be done...

You can start in GEDMATCH if you have Tier 1 then you can connect with WikiTree see video https://www.youtube.com/watch?v=ZhSAph_Zggc 

Thanks Mangus I am subscribed to Teir 1.  An error report that could alert users to profiles that have DNA test information entered but whose privacy levels are set too high to permit matching would be useful I think

One way to identify these profiles is to examine the list of badged profiles and look at the color of the privacy indicator.  There are around 30k profiles that are badged so I was looking for an easier way.
+8 votes
I really like the idea of having the gedmatch compare prepared on the profile, or at least available in a DNA tab/section/widget page.  My one concern would be implementing it so it doesn't overwhelm either WikiTree or gedmatch servers, especially when first turning it on.  Even with "only" 30000 or so DNA tests logged on WikiTree, and only some of those on gedmatch, there would be a lot of profiles that would need a DNA comparison chart.

I don't like the database errors for DNA lack of match, unless it's done only for very close relations.  In your example, the two descendants who do not match are 4th cousins once removed.  It's quite possible for them to share no significant DNA segment and still have the paper genealogy be correct; a database error here is pointless and likely to cause more mistakes than it does solve actual problems.

I'm not sure how I feel about the idea of trying to automatch people who are not connected.  I sort of feel like it's outside the bounds of what WikiTree should be focused on, there's a lot of chances for mistakes, and also it could get very computationally intensive quite quickly.  That's probably better left in the hands of gedmatch and the DNA testing companies.
by Stephen Haley G2G6 Mach 2 (25.4k points)

>>overwhelm WikiTree or gedmatch servers

I guess the load is on the GEDMATCH server and it can be run when people in the States are asleep. This I guess could be an argument for more people to pay for Tier 1 membership....

>> I'm not sure how I feel about the idea of trying to automatch people who are not connected.

Why not? Computers are superior on pattern matching....

Best would be if WikiTree used templates for sources with unique IDs. For Swedish sources if we use Arkiv Digital we have unique IDs for all different church books and also the pages in the church book ==>

  1. A computer could rather easy compare two family trees and say if those family trees shared the same sources = church books ==> it would be a good candidate for two profiles also sharing DNA ​

    e.g. v13295.b124.s118 is 
    v13295 = Book Household record Sunne (S) AI:14 (1772-1775)
    s118 is page 118 in the above book 
    on this page in the household book we find Jansdotter-205 and Ersson_Torsfält_Thor-1

I think today if a new person is added to the the DNA section you don't get an alert which is maybe something that would be also interesting ==>

Suggestion 2 Weekly DNA email: You get an email every week with new people connected as DNA matches for profiles in your DNA tree

This email will tell

  1. Profile Persson-1427 that is possible to confirm with your DNA relationship has a new DNA connection 
    1. New Barbro Maijgren Find Relationship : Family Tree DNA Family Finder, GEDMatch F311279, FTDNA kit #311279 [test details]
    2. Magnus Sälgö Find Relationship : Family Tree DNA Family Finder, GEDMatch T767406, FTDNA kit #320352 [test details]


  2. You are DNA connected as follow
  3. Other people on the profile match each others DNA as below

     
  4. You have the following connection in WIkiTree

    Big pic
  5. To learn more use the GEDMATCH Tier 1 functionality
The email idea would be brilliant!  As it is I only find new people when I go searching, I have actually stopped doing this, so an email would keep me motivated within the WT community.
I really like the email idea as well!
+5 votes

Hey Magnus,

We are on the same page. The DNA Project and the Database errors Project are working to coordinate some of these ideas. If you visit the DNA Project Page Tasks list, you will see we are working on it.

Join us!

Mags

by Mags Gaulden G2G6 Pilot (642k points)
Cool I think WIkiTree together with WikiTree+ and Gedmatch could make life easier for DNA Genealogy researcher....

Try to get Aleš in the boat ;-) I am more on Wikidata like the approach with structured data and connect....
Aleš is being pulled aboard kicking and screaming...not really but you started the analogy! Mags
+2 votes

Allow Phased kits to be added to WikiTree.  For example, I can generate phased DNA results for my dad using my results and my mom's results using the GEDMatch Phasing tool.  Now I have a kit for my father with the ID of PM092132P1. But I cannot enter this as my father's kit because it fails the validity check of 1 alpha followed by 6 digits.

by Rick Watts G2G4 (4.7k points)
Please describe the instances where it is more useful to use your phased maternal GEDmatch ID rather than using your mother's GEDmatch ID?

WikiTree would need to associate your phased maternal GEDmatch ID on your profile with only your mother's ancestry, and your phased paternal with only your father's ancestry.

Your IDs for your phased results can already go into the note field of your auDNA results.
I entered my phased kit for my parents as "Other Autosomal" on both their profiles. It is necessary to add the trailing digit that gets dropped, which I note on the profiles.

Hi Peter,

Perhaps I did not describe it well.  My mom and I are still living, so we did the 23andme test.  I have both of us entered in GEDMatch and those kit numbers associated with us.  GEDMatch allowed me to create a phased kit for my father using my mother and my kit ID.  It is PM092132P1  which does not match the requirements for a GEDMatch kit ID of 1 alpha followed by 6 numerics.  So when I tried to add my father's phased kit to his profile as an other auDNA test and placed PM092132P1 in the GEDMatch kit Id, it failed validation.  I could just put it in the comment, but that does not serve any purpose.  Having the kit id displayed as a GEDMatch kit id would allow others to match against it.  Having it in the comments will require people to dig through his profile to even find the id.

Thanks,

Rick 

I'm not sure why it didn't work, I don't get a validation error, it's listed there along with my other tests. But, the last digit is dropped, so that is noted in the comments - but at least the test appears, so it is seen - if it doesn't work, then people can look at the comments and see that the last digit has to be added.
+2 votes
I love most of these suggestions. I wish you would restore the video that you had posted in another thread, detailing more of the tools you use in these endeavors, which has since been deleted from youtube.
by Theresa Myers G2G6 Mach 1 (15.4k points)

Related questions

+9 votes
4 answers
+5 votes
5 answers
357 views asked May 25, 2022 in Genealogy Help by Carolyn Martin G2G6 Pilot (283k points)
+4 votes
3 answers
+3 votes
1 answer
+4 votes
2 answers
+9 votes
3 answers
301 views asked Nov 1, 2020 in WikiTree Help by Matt McNabb G2G6 Mach 3 (37.1k points)

WikiTree  ~  About  ~  Help Help  ~  Search Person Search  ~  Surname:

disclaimer - terms - copyright

...