I have been tracking several statistics that approximately represent the quality of the Wikitree database. Following is a summary of current information as of Nov 2019:
Overall status: 21.8 M total profiles; 17.9 M or 82% are connected; 6.0 M or 27% have DNA links (from Wikitree info).
Profiles with known internal consistency issues: 113,000 or 0.5% of all profiles (based on Suggestions report data).
Sourcing: about 12% with 3 or more sources, 35% with 1-2 sources, 15% poorly sourced, 25% unsourced, and 13% Unavailable (Unlisted/Red/Orange privacy) (based on random sampling).
Duplicates: about 1-9% (based on Wikitree Match suggestions and random sampling).
Compared with June 2019 when I last reported on these statistics, there are 1.2 M more profiles. The number of profiles with known consistency errors has dropped from 117,000 in June to 113,000 now. The fraction of profiles with 1 or more sources is about 47%, about the same as June within accuracy of this estimate. The estimate of duplicates has not been updated since Jan 2019.
A Free Space page with graphs, historical data and technical details is available here:
https://www.wikitree.com/wiki/Space:Wikitree_Statistics