I have been tracking several statistics that approximately represent the quality of the Wikitree database. Following is a summary of current information:
Overall status: 17.5 M total profiles; 13.9 M or 80% are connected; 4.0 M or 23% have DNA links (from Wikitree info).
Profiles with known internal consistency issues: 154,000 or 0.9% of all profiles (based on Suggestions report data).
Sourcing: about 11% with 3 or more original sources, 27% with 1-2 sources, 15% poorly sourced, 33% unsourced, and 14% Unavailable (Unlisted/Red/Orange privacy) (based on random sampling).
Identified Duplicates: about 11,700 (based on Suggestions report data).
In general, these statistics are either modestly improved from last November when I last reported them, or are largely unchanged even though about 1.5 M profiles have been added since then. Of particular note, the number of profiles with known consistency errors has dropped from 180,000 last October to 154,000 now.
A Free Space page has been created that has graphs, historical data, and the technical details – see https://www.wikitree.com/wiki/Space:Wikitree_Statistics