Many profiles with uncleaned GEDCOMs are not tagged for this issue

+10 votes
288 views
While doing my Data Doctor work I've been cleaning profiles with uncleaned GEDCOM imports. I've worked through all of Wisconsin and most of Minnesota. But, as I do this I often come across many relatives of these profiles, where the profile has the GEDCOM junk problem, but is not tagged with a suggestion code (853, 852, or 851). I've left this one profile as-is so you can see an example. I suspect that, if these others could be identified, we would have a much larger set of Suggestions. Is this something that can or should be addressed?
WikiTree profile: Kjeld Skjæveland
in WikiTree Tech by Deb Gunther G2G6 Mach 2 (23.3k points)
Here's another example:

[[Bradbury-877|Charles Herbert Bradbury (1884-1968)]]

https://www.wikitree.com/wiki/Bradbury-877

3 Answers

+7 votes

Interesting. The profile has to have three "GEDCOM junk" headings. The linked profile has two not three but.... the list for the suggestion includes === Record File Number === and the profile has === Record ID Number ===

Ales? Are these equivalent?

by Kay Knight G2G6 Pilot (599k points)

Suggestion 853  has a lits of the items that Ales is looking for.  Check the Technical Stuff section where it has the Record File Number, not Record ID Number, as Kay said.  

The Bradbury one has Data Changed and User ID.  The Frietad has User ID and UPD.  

The Technical Stuff section states that it has to have 3 or more of those items in profiles to generate a suggestion.     

+11 votes

On suggestion help page it states that these phrases are checked:

=== Data Changed ===
=== User ID ===
=== LDS Endowment ===
=== LDS Baptism ===
=== Record File Number ===
=== Submitter ===
=== Object ===
=== COLOR ===
=== UPD ===
=== PPEXCLUDE ===

and if there are 3 or more it is displayed as an error. The reason for 3 occurrences is that there would be 1.5 million suggestions otherwise. In time, we will lower that limit.

You can however use normal WT+ search for GEDCOMjunk, and it returns  Found: 1209800 profiles for "GEDCOMjunk". 

You can combine that with other search keys.

https://wikitree.sdms.si/default.htm?report=srch1&Query=GEDCOMjunk+Wisconsin+&MaxProfiles=10000

by Aleš Trtnik G2G6 Pilot (808k points)

Thank you for reviewing this question. What are your thoughts on adding one more criterion:

=== Record ID Number ===

It seems to fit the intent of the search engine. I understand that it would increase the number of results, and that may not be desirable.

Ales,

I am not familiar with the GEDCOM spec, but is Record File Number equivalent to Record ID Number?

I understand that if this were added to the list that the number of profiles with the GEDCOM junk suggestion would increase. On the other hand, this would bring them to attention for cleanup.

Another way that folks can search is using the name of the GEDCOM, such as gedfile=Tomasi_Anderson_Family.ged

I checked the sources and there are a few more.

=== TAG ===
=== TAG1 ===
=== TAG2 ===
=== TAG3 ===
=== TAG4 ===
=== TAG5 ===
=== TAG6 ===
=== TAG7 ===
=== TAG8 ===
=== TAG9 ===
The heading === Record ID Number === is not included, since that is a useful tag, that identifies the record ID on the gedcom source.
Thank you, Ales for the magic word.  I was able to find 45 Acadians that needed to be cleaned up (didn't have the three problem headers but did have two).
+7 votes

I have added unsourced template for Norway as well as maintenance category "Norway, Needs Gedcom Cleanup". That way it will be seen by those working on Norwegian profiles.

If you are not up to editing these yourself, I would imagine that other countries and also US States has similar maintenance categories? You can find them with the category picker and hopefully these profiles will be taken care of by someone working on the maintenance of a country, state or geographical project.

by Maggie Andersson G2G6 Pilot (151k points)

Related questions

+7 votes
4 answers
325 views asked Sep 18, 2019 in The Tree House by Chris Orme G2G6 Mach 2 (27.7k points)
+7 votes
2 answers
+6 votes
0 answers
83 views asked Feb 28, 2017 in Genealogy Help by William Arbuthnot of Kittybrewster G2G6 Pilot (183k points)
+6 votes
3 answers
181 views asked Oct 12, 2018 in WikiTree Tech by Susan Keil G2G6 Mach 6 (67.4k points)
+22 votes
4 answers
+7 votes
3 answers
180 views asked Mar 24, 2017 in Policy and Style by Beth Jaquis G2G Crew (340 points)
+18 votes
3 answers
+5 votes
1 answer
149 views asked Aug 8, 2017 in WikiTree Tech by Paula Parker G2G Crew (400 points)

WikiTree  ~  About  ~  Help Help  ~  Search Person Search  ~  Surname:

disclaimer - terms - copyright

...