Thank you again for your patience with me. I agree there are few words - I am NOT good with statistics; very sorry - it was a "half joke" in what I said.
Yes, ALL profiles where the birth year is 1816 or earlier MUST BE OPEN (not "Public - Green circle" but Open "black unlocked"). So those thousand should be identified and hopefully eliminated - many may be birth year Typos....
For the parents, the birth year should be BEFORE 1775 to allow a 40 year span for a mother to have had a child born no later than 1815. Few women had children after 40 years old in those times (maybe 1% - we can ignore them).
Do not worry about marriage data - it's not so important.
There will always be some profiles that "slip through the cracks" in any statistical program - with 5 million profiles and more adding to them daily. BUT if we can catch even 500 of the 1000 with this tool, it is worth it in my opinion.
Thank you again, sincerely, for your hard work on this. I have now eliminated all the found errors in my first 6 generations (although your programs keeps finding new categories - but that is GOOD) and I am working on the rest to 10 generations.
QUESTION: What about the Geographic words in Parentheses that I asked about ?? example (USA) or (Canada) or even (Germany) ?? Can they be excluded from the "Error" category so people can use them to indicate places that will BECOME THAT NATION LATER IN TIME ??? This would be very helpful so anyone looking at old profiles gets the sense of where they took place - few people nowadays know "Silesia" or "Bohemia" but many more know (Germany) or (Czech) etc. etc.
Again, thank you for this dialogue.
Chet Snow - A WikiTree Leader