upload image

Database errors project (April 2nd 2017)

Privacy Level: Open (White)
Date: 2 Apr 2017 to 9 Apr 2017
Location: Worldwidemap
Surname/tag: data_doctors
Profile manager: Aleš Trtnik private message [send private message]
This page has been accessed 641 times.

Categories: DD Suggestions.

This page is part of the Data Doctors Project.
Latest report: February 10th 2019 and the Spreadsheet.
Custom reports by: Suggestion lists, Unsourced lists, Unconnected lists.
See WikiTree+ for custom reports and statistics.
Data Doctors Challenge: Dates_VIII .

Analysis was done on data from April 2nd 2017.

Related post.

Here are pages of errors lists with basic person data and links to WikiTree.

Contents

News

Cosmetic changes

Added direct jump to History of the profile in all reports. It is Link H under wikitreeID.

With error 572, I list all linked memorials, so it is obvious who the links are for. Example: http://wikitree.sdms.si/function/WTStatus/Status.htm?ErrID=572&UserID1=15535724&UserID2=0

Added new errors 851 GEDCOM uncleaned Interpret date and 852 GEDCOM uncleaned Parse Lastname

This are dates and lastnames, That gedcom import didn't use in profile data. They should be done manually, but never were. Actually some were, but this line remained in the Biography.

2359150 Errors Total 0000-0000 0001-1499 1500-1699 1700-1799 1800-1899 1900-1999 2000-9999 Open New
851 GEDCOM uncleaned Interpret date 74426 34181 336 3175 8238 23462 5021 13 42154 74426
852 GEDCOM uncleaned Parse Lastname 10212 4324 218 907 1439 2835 488 1 6590 10212

Republished 6x3 errors USA too early in location

Now that FamilySearch locations are implemented, I can republish this errors. If you think I could add automatically suggested corrections, let me know. I can work on that.

2359150 Errors Total 0000-0000 0001-1499 1500-1699 1700-1799 1800-1899 1900-1999 2000-9999 Open New
603 USA too early in birth location 221562 4 42698 178860 221562 644
633 USA too early in death location 74152 1088 38 47351 25604 65 6 74135 168
663 USA too early in marriage location 25701 322 1 8740 16624 14 25695 83

Changed 575 - FindAGrave - Different birth date and 578 - FindAGrave - Different death date

Adding 1 month buffer between dates. So only dates that are more then 1 month different gets reported. That decreased the number of errors for one third.

2286753 Errors Total 0000-0000 0001-1499 1500-1699 1700-1799 1800-1899 1900-1999 2000-9999 Open New
575 FindAGrave - Different birth date 30215 55 1684 5559 20063 2853 1 27974 512
578 FindAGrave - Different death date 14133 81 29 1129 3218 8051 1624 1 13267 284

FindAGrave Changes

  • Renamed error 572 to FindAGrave - Linked grave not matching profile
  • Removed private profiles from 572 error. Dates are not precise.
  • According to error report there is 3112 new errors this week. Last week it was 2795 new errors. I think this is due to a lot of changes in FAG links this week.
  • 575.786 profiles have link to FindAGrave. 54.155 of them have no link to actual memorial. The others link to 511.354 memorials on FindAGrave. This week it was 7.761 new memorials linked. Last week it was 5561.
  • Total number of FindAGrave errors is 220606. In last month there were cca 2000 new errors on links to FindAGrave. This week we are around 0. That shows, 5000 were addressed this week and 3000 were new. In time also linking practice will change, so number of new errors each week will decrease.
  • Here are report of all statuses entered last week. According to them, we have 50:50 ratio of errors vs false errors.

Conclusion: The most problematic errors are:

  • 572 - FindAGrave - Linked grave not matching profile. From comments in status and G2G discussion, I think you didn't understand what this error even means. I changed the name of this error. Maybe this will help. I also removed 1600 errors reported on private profiles, since they were not linked due to approximate date. Another possibility to address this problem would be by using a changed FindAGrave template, where you could specify, that one link is for the profile and others are for relatives. That way the correct link to FindAGrave would be matched.
  • 575 - FindAGrave - Different birth date and 578 - FindAGrave - Different death date.

This were problematic due to the number of errors. I see the solution in adding some buffer between dates. Maybe 1 month and report only differences, that are bigger than 1 month. That would decrease the number of errors for one third.

Errors 608 and 638 have automatically suggested changes

Activated change button for US states as they joined USA. It is new so be alert for bugs.

Wrote Help for Status page

Setting status of the error is described on Error status Help page

Previous news

Added suggestions for status

Added suggestions for statuses based on existing entries. Suggestions will be updated based on the number of entries. 10 most common suggestions are shown for each error/status pair.

Added the rest of FindAGrave Errors

Errors for Birth and Death dates and locations are similar to Wikidata errors.

Additionally 572 FindAGrave - Link without matching Grave lists all profiles, that have FAG link but is not matched to profile as same due to too many differences. It is probable that FAG memorial is for another person (usually relative). Try finding correct memorial and if not found, mark this as False error.

There is also 585 FindAGrave - Multiple profiles link to same grave ID, where multiple profiles are linked to same memorial. This profiles are probably duplicates and needs to be merged. If they are not, mark it as false error or correct link to FAG if not correct.

Error Status

I added Error Status that replaces Temp Hide and False Error.

It enables tracking of error status for the duration of error existence. You can set 4 possible statuses.

  • Not corrected Which you don't need to set, since it is default.
  • Corrected (hide until next recheck) If you corrected the error, it can be hidden until next errors recalculation. It error is indeed corrected, it will not reappear and if it wasn't it will reappear after next dump.
  • False error (hide forever) If it is False error, select this option and error will not be displayed. If you set this by mistake, You can add another status Not Corrected and error will reappear.
  • Manager notified (hide for 30 days) If you posted a message to the manager or proposed the merge, you can hide the error for 30 days. Then it will reappear if it was not corrected.

You can also enter your WikitreeID, so you can track your changes. Probably also Top DD list will be created.

You can also enter Comment for the status for other DD to see.

Status list is displayed with each error in all error lists.

Try it out and if something is unclear or doesn't work as expected, let me know.

Errors

Analysis was done on data from April 2nd 2017.

2303695 Errors Total 0000-0000 0001-1499 1500-1699 1700-1799 1800-1899 1900-1999 2000-9999 Open New
101 Birth in future 4 4 1
102 Death in future 73 1 67 5 2 2
103 Death before birth 8419 14 389 1066 4685 2211 54 5951 8
104 Too old 7587 250 1303 2466 3428 136 4 6995 25
105 Duplicate sibling 251 1 26 28 76 120 134 32
106 Duplicates between global tree and unconnected 844 88 270 361 125 662 87
109 Profile should be open (birth date) 12358 17 35 603 11703 117
110 Profile should be open (death date) 650 123 12 32 260 221 2 30
111 Died too young to be parent 456 34 55 48 185 129 5 352
112 Person is Father and mother 621 84 5 5 77 347 103 305 10
113 Duplicate in relatives 1054 254 125 32 106 454 83 506 24
114 Still living status and entered death date 1997 158 7 3 5 475 1344 5 268 35
115 Still living status and entered death location 309 11 4 1 17 153 123 133 11
201 Father is self 87 57 22 8 3
202 Parents are same 52 4 1 24 23 5
203 Father is Female 4557 630 18 152 659 2572 526 3337 17
204 Father has no Gender 493 391 5 80 17 10 10
205 Father is too young or not born 35554 1796 3585 7398 17050 5660 65 29986 157
206 Father is too old 4310 371 1044 1721 1169 5 4029 8
207 Father is also a child 157 69 14 2 6 53 13 49 1
208 Father is also a spouse 29 6 3 1 18 1 7 3
209 Father is also a sibling 1353 261 93 77 103 656 163 779 20
210 Father was dead before birth 35215 1498 2433 6383 10343 13867 691 32464 103
211 Duplicate sibling by father 389 1 81 94 148 65 266 20
212 Profile should be open (Child birth date) 2847 2847 39
301 Mother is self 1 1
303 Mother is Male 4693 529 45 182 745 2730 462 3355 21
304 Mother has no Gender 510 439 57 14 4 4
305 Mother too young or not born 45746 2267 4820 10588 21782 6222 67 39202 206
306 Mother is too old 3544 250 755 1369 1167 3 3268 3
307 Mother is also a child 5 1 3 1 2
308 Mother is also a spouse 269 66 45 7 29 112 10 127 11
309 Mother is also a sibling 127 23 1 2 7 72 22 49 3
310 Mother was dead before birth 39656 1848 1738 7217 12516 15635 701 1 36675 174
311 Duplicate sibling by mother 174 4 48 33 50 39 134 16
312 Profile should be open (Child birth date) 2288 2288 41
402 Unknown gender of spouse 656 468 1 165 22 15 6
403 Single sex marriage 935 218 13 24 32 521 127 333 21
404 Marriage before birth 7148 177 682 1675 3745 868 1 6022 25
405 Married too old 1593 35 187 517 854 1424 9
406 Marriage after death 9603 309 308 1320 2459 4881 326 8523 37
407 Lived too long after marriage 235 7 11 38 73 89 17 160
408 Multiple marriages on same day 3489 329 2 361 947 1752 98 2940 81
409 Marriage to duplicate person 11167 2360 91 936 2448 4971 361 8676 262
410 Marriage in future 4 4
412 Marriage End before marriage 118 4 17 90 7 82 3
413 Marriage too long 1 1
414 Marriage End before birth 109 1 6 6 65 31 58 4
415 Marriage End too old 47 4 20 23 42 2
416 Marriage End after death 3972 69 125 587 924 1992 275 3670 46
417 Lived too long after marriage End 5 1 4 3 3
418 Partner is also a sibling 434 30 33 47 110 205 9 376 3
501 Wrong gender (male) 4256 1054 1 68 518 2093 518 4 2167 68
502 Missing gender (male) 17838 10960 1 2614 4248 15 49 253
503 Probably wrong gender (male) 4959 1120 42 433 599 2046 712 7 3243 61
504 Missing gender (probably male) 8866 6008 1 5 707 2112 33 81 144
505 Wrong gender (female) 4538 964 3 80 330 2644 515 2 2337 65
506 Missing gender (female) 14381 9318 1 5 2400 2647 10 23 166
507 Probably wrong gender (female) 4177 713 19 156 565 2021 702 1 2710 48
508 Missing gender (probably female) 7815 5469 5 3 618 1708 12 111 99
509 Missing gender 36790 27211 18 250 764 4119 4384 44 8059 535
510 Unique name without gender 9727 4027 9 60 211 2672 2700 48 3927 73
511 Unique names (spelling) 263604 47544 7399 12274 20232 100101 74400 1654 160120 1590
551 Wikidata - Missing gender 35 1 34
552 Wikidata - Different gender 5 1 1 3 2
553 Wikidata - Empty birth date 150 150 131 1
554 Wikidata - Imprecise birth date 638 132 189 182 107 28 574 10
555 Wikidata - Different birth date 3776 1758 905 459 495 159 3707 19
556 Wikidata - Empty death date 205 50 63 30 28 11 23 170 1
557 Wikidata - Imprecise death date 885 25 266 230 213 113 38 818 5
558 Wikidata - Different death date 3989 58 2354 703 359 424 91 3943 20
559 Wikidata - Missing birth location 1460 36 249 421 320 358 76 1370 9
561 Wikidata - Missing death location 1925 59 432 424 391 495 124 1837 11
563 Wikidata - Possible duplicate by father 83 9 30 15 14 9 6 75 5
564 Wikidata - Possible father 162 7 80 21 14 27 13 148 1
565 Wikidata - Possible duplicate by mother 101 6 52 25 15 3 101 3
566 Wikidata - Possible mother 134 8 85 23 4 9 5 124 1
568 Wikidata - Unconnected branches to global tree 4707 57 52 182 420 1800 2191 5 3507 46
569 Wikidata - Unconnected orphans to global tree 5507 24 54 120 515 2393 2400 1 4100 17
571 FindAGrave - Link without Grave ID 54155 652 24 694 5622 32828 14332 3 40370 573
572 FindAGrave - Link without matching Grave 26383 2120 46 964 4613 15066 3573 1 23131 305
573 FindAGrave - Empty birth date 793 793 621 12
574 FindAGrave - Imprecise birth date 10230 1 219 1196 7331 1483 9591 148
575 FindAGrave - Different birth date 44828 61 2159 7948 29272 5385 3 41439 721
576 FindAGrave - Empty death date 5325 298 6 89 619 3691 622 4794 84
577 FindAGrave - Imprecise death date 10621 112 4 229 1525 6883 1868 9934 190
578 FindAGrave - Different death date 28436 110 43 1642 5561 17175 3904 1 26431 506
579 FindAGrave - Missing birth location 14986 343 4 221 1953 8957 3508 13770 201
581 FindAGrave - Missing death location 19404 314 10 268 2155 12199 4458 17773 281
585 FindAGrave - Multiple profiles link to same grave ID 1714 25 121 499 852 217 1564 90
586 FindAGrave - Link to merged Grave ID 617 6 119 359 133 517
587 FindAGrave - Link to nonexisting Grave ID 977 21 1 70 196 500 188 1 804 1
601 Wrong word in birth location 6703 414 47 836 4215 1191 5018 111
604 Birth location too short 7023 855 39 805 1008 3637 679 3931 43
605 Number in birth location 487 312 1 152 22 24 9
607 Misspelled word in birth location 7521 134 136 1079 2145 3537 490 6270 96
608 Misspelled country in birth location 15552 294 139 608 2542 10323 1646 12448 138
609 Wrong character in birth location 3925 186 2 482 1566 1620 69 3343 2
610 Birth location in uppercase 24700 197 551 1222 3872 15774 3084 21853 65
611 Birth location in lowercase 37872 942 206 916 6855 22785 6166 2 27566 68
631 Wrong word in death location 17767 347 15 1991 12640 2774 15003 153
632 Y death location 773 14 6 738 15 85 3
634 Death location too short 7930 639 129 673 1217 4368 904 5074 43
635 Number in death location 313 66 3 4 198 42 19 5
636 Bogus death location 307 4 1 1 146 154 1 282 1
637 Misspelled word in death location 1273 27 213 139 708 186 612 45
638 Misspelled country in death location 7345 284 67 293 517 4743 1441 5465 123
639 Wrong character in death location 2635 228 4 588 1031 706 78 2400
640 Death location in uppercase 15529 366 193 539 2233 9648 2550 13470 50
641 Death location in lowercase 23169 699 96 303 3445 13609 5015 2 16535 54
661 Wrong word in marriage location 1304 37 1 9 250 765 242 1033 6
664 Marriage location too short 2336 172 3 289 506 1243 123 1713 13
665 Number in marriage location 96 49 2 35 10 8 2
667 Misspelled word in marriage location 182 3 5 8 157 9 20 15
668 Misspelled country in marriage location 508 28 3 7 28 390 52 92 30
669 Wrong character in marriage location 808 41 270 369 126 2 773
670 Marriage location in uppercase 4845 112 135 220 834 3064 480 4382 17
671 Marriage location in lowercase 7131 231 29 247 1581 4375 668 5164 21
711 Separators in Prefix 900 76 9 12 15 402 384 2 330 6
712 Number in Prefix 867 40 9 3 648 167 627 2
713 Suffix in Prefix 2284 258 37 35 69 791 1090 4 913 35
714 Wrong word in Prefix 535 100 1 3 5 157 246 23 78 4
721 Separators in First Name 53121 11886 243 1953 4337 28619 6083 33644 130
722 Number in First Name 73 68 5
723 Prefix in First Name 22043 5524 2040 2326 3467 7043 1641 2 17100 42
724 Wrong word in First Name 7690 1928 76 377 756 2989 1563 1 5439 24
725 Wrong character in First Name 402 150 5 83 101 48 15 335
731 Separators in Preferred Name 16820 3292 76 77 120 1891 11279 85 820 10
732 Number in Preferred Name 28 22 6
733 Prefix in Preferred Name 7154 1549 209 192 237 782 4119 66 1374 32
734 Wrong word in Preferred Name 1809 531 3 15 15 120 1107 18 148 7
735 Wrong character in Preferred Name 690 183 5 91 102 54 251 4 346
741 Separators in Middle Name 1348 92 1 57 94 831 273 992 10
742 Number in Middle Name 8 2 6
743 Prefix in Middle Name 3039 145 36 52 277 1947 582 2635 26
744 Wrong word in Middle Name 882 112 3 22 132 470 142 1 750 12
745 Wrong character in Middle Name 7 2 2 1 1 1 6
751 Separators in Nicknames 2816 126 327 302 345 1323 392 1 2501 49
752 Number in Nicknames 62 2 1 57 2 14
753 Prefix in Nicknames 3064 235 682 470 518 756 403 2852 26
754 Wrong word in Nicknames 1042 81 57 69 101 537 197 900 13
761 Separators in Suffix 7124 1147 164 731 1377 2316 1383 6 4737 15
762 Number in Suffix 1539 287 145 429 476 201 1 1127 17
763 Prefix in Suffix 8691 927 147 362 892 2016 4326 21 3087 37
764 Wrong word in Suffix 1219 79 66 127 496 444 7 569 5
765 Wrong character in Suffix 30 3 1 25 1 2
771 Separators in Last Name at Birth 5101 1021 74 713 1189 1417 680 7 3216 8
772 Number in Last Name at Birth 11 11
773 Prefix in Last Name at Birth 10 3 1 6 1
774 Wrong word in Last Name at Birth 10 4 5 1 10
775 Wrong character in Last Name at Birth 1967 410 16 326 516 369 328 2 1399 3
781 Separators in Current Last Name 3050 667 56 223 358 938 808 1692 16
783 Prefix in Current Last Name 2 2
784 Wrong word in Current Last Name 205 27 30 77 38 17 16 194
785 Wrong character in Current Last Name 1561 376 19 130 386 330 318 2 1001 4
791 Separators in Last Name Other 3829 137 191 452 597 1218 1233 1 2355 36
792 Number in Last Name Other 12 4 1 7 1
794 Wrong word in Last Name Other 471 68 20 73 65 119 124 2 294 1
795 Wrong character in Last Name Other 13 12 1
801 Big profile 298 3 52 136 107 222 1
802 Empty profile 85595 36980 1031 1621 5579 29019 11291 74 44884 254
803 Almost empty profile 26820 5423 747 1214 3679 10821 4898 38 19570 378
811 Uncleaned profile after merge 194731 13144 6887 25549 40320 80025 28667 139 142179 1065
821 Headings starts with blank 248 13 1 5 4 128 97 30 30
822 Heading doesn't end with = 11477 116 40 1157 2017 6175 1967 5 9506 185
823 Heading doesn't start with = 9637 519 96 438 1100 6003 1477 4 7603 52
824 Heading different number of = 2478 44 11 233 466 1302 421 1 1908 51
825 Use separator line ---- 1523 9 9 65 319 1047 74 1229 20
831 Multiple duplicated lines 36196 608 230 1452 3047 25734 5125 26752 68
835 Local file reference 59208 4430 86 1310 6091 33422 13827 42 39591 469
841 Template doesn't start with double { 86 5 2 2 49 28 18 6
842 Template doesn't end with double } 39 1 4 4 12 18 16 11
843 Missing template (spelling) 2103 62 23 127 458 929 504 1338 98
844 Out of use template 78 7 71
901 Unconnected empty public profiles 28883 28883 11
902 Unconnected empty open profiles 18817 18817 18817 88
912 Swedish patronym SSON for female 577 2 86 489 471 7

Changes since previous update

Detailed statistics are available on Wikitree+ in Statistics section.







Collaboration

On 6 Apr 2017 at 06:53 GMT Eva Ekeblad wrote:

Still slowly working on error 912; mainly with the one PM who had over 300 of these to start with. Can usually only do one correction a day; there's a very active spam filter somewhere out there.

On 5 Apr 2017 at 09:29 GMT Esmé (Pieterse) van der Westhuizen wrote:

Working on Error 311 - 1700 to 1999

On 5 Apr 2017 at 06:37 GMT Esmé (Pieterse) van der Westhuizen wrote:

Error 211 - 1700 to 1999 - DONE - Proposed all new clear duplicates

On 5 Apr 2017 at 01:54 GMT Emma (McBeth) MacBeath M.Ed MSM wrote:

Slogging (is that a word?) away at error 721 1700-1799

On 4 Apr 2017 at 07:46 GMT Esmé (Pieterse) van der Westhuizen wrote:

Error 106 - 1700 to 1999 - DONE - Proposed all new clear duplicates

On 4 Apr 2017 at 07:26 GMT Paula (Round) Dea wrote:

667 & 665 new done

On 4 Apr 2017 at 05:35 GMT Esmé (Pieterse) van der Westhuizen wrote:

Error 105 - 1700 to 1999 - DONE - Proposed all new duplicates