Number of profiles at N degrees from a profile P0

+9 votes
343 views

Quick question to our software dream team. smiley

Would it be possible to compute the size of a "circle" (or, more exactly "sphere"), meaning the number of profiles at distance N (measured in degrees) of a given "center" profile P0?

I'm sure it's possible, at least in batch, if not on demand, since we have already those distances computed every night. But is it easy and not too costly in resources?

I have an ongoing work for which the distribution of such numbers for a given "center" would be of heavy interest. I will present this work in G2G when it's a bit more advanced.

Thanks for your attention.

[edited : added link to Jean-Joseph Vatant profile, of which "circles" are under investigation at http://goaf.fr/jjv/cercles.html (in French so far)]

WikiTree profile: Jean-Joseph Vatant
in WikiTree Tech by Bernard Vatant G2G6 Mach 5 (58.4k points)
retagged by Bernard Vatant
I'm not sure it's too early. I'm also just beginning, so to speak, the systematic population of first circles, and it's interesting to see the state of affairs at the beginning. I've seen that Olof seems to stand at distances from the bulk of WikiTree similar to the ones of Jean-Joseph. My hunch is that the global distribution would be quite similar for our two ancestors beyond circle 20 or so. Only the local distributions would reflect the state of our respective work on our "local families", and I would bet they are not very different.
It sounds very likely that Olof an Jean-Joseph are at similar distances from the mainstream, yes.

If this is your interest it's OK with me if you ask Ales for the full set (as below) for Andersson-5056 even if t's a bit beyond my current focus - I think what I can get on Tuesday mornings (or so) from WikiTree+ will suffice for quite some time.

Hello, Bernard.  I know you have said you will present your work in G2G at a later time.  But, because you have already discussed it here, I wonder if you would be interested in setting up a free-space page about the project.  The purpose I'm envisioning would be to create a more permanent WikiTree location for discussion of the project, rather than a G2G discussion which will quickly fade from view.  It could be linked to this discussion, etc.  I'm thinking it could include an "FAQ" (frequently asked questions), so that if people have questions about your work, you only have to answer them once.  I'd be happy to help set up and maintain such a page, as long as you will give me a little help, such as explaining what "fly paws" (pattes de mouche) have to do with anything.  smiley

Julie, I've been reluctant so far to create any free-space page, but it would be a good idea indeed insofar as some people here seem interested. This page should be in English, and linked to my personal page which will stay in French.

I indeed somehow remember to have written "pattes de mouche" somewhere in this forum, but where and when?

https://en.wiktionary.org/wiki/pattes_de_mouche laugh

It wasn't in the forum, it was in your Circles page.  Paragraph 9, about the birth of Jean-Joseph.  I could not find a good translation, but did wonder if it was similar to the English expression "chicken scratch," meaning hard-to-read handwriting.  You have confirmed that.  smiley

Edited to correct typo.

Oh well. No bad memories. Supposed to be a precursor of some old age disease, but can't remember which one at the moment.

Imagine how I feel!  smiley  Having to try and understand French, mathematics, and WikiTree apps all at the same time!

A freespace page sounds like a good idea.

And in Swedish it's "kråkfötter" - crow's feet.
Wow, that "Cercles" page is impressive. Among many things, the description of WikiTree is excellent (for the French readers, who most likely never heard of it) and your writing style very entertaining. And I enjoyed the maths talk.

Thx Isabelle. So I have at least one French reader wink

3 Answers

+3 votes
 
Best answer
You can see all the profiles including sizes of each generation on WikiTree+ Here are the first 20 ones.

https://wikitree.sdms.si/default.htm?report=srch2&WikiTreeID=Vatant-5&Generations=20

Is that what you were asking for?
by Aleš Trtnik G2G6 Pilot (568k points)
selected by Bernard Vatant
To be honest, it takes 7 seconds to compute and store all distances for each profile to the disk. Just checking 208 million relations takes 3 seconds.

I have to brag a little, since I wrote such a fast program.

yes I must admit it's impressive. But I'm not surprised you could do it. And I will not hesitate to ask for 7 seconds once in a while. Let's say, every time a new million milestone is passed by the Single Tree. 21 million soon, BTW. This is every four months or so. Are you OK with that deal?

sure. Just send me the an email with link.
Just let me confirm my understanding: the WikiTree connection finder counts siblings as one step apart, whereas in terms of WikiTree+ generations siblings are in the second generation (two steps from the target person = going through a parent). Do I have it right?

That's the way I get it also, Eva. Of course it would be better if the two apps were consistent.frown

Yes. And the other difference is that CF is using also private profiles/connections.
Ales, since you are here, did you notice our other exchange with Eva above? She would be (and I would also) interested to compare the distribution of Jean-Joseph's circles with the ones of her ancestor [[Andersson-5056|Olof  Andersson (1793-1860)]]. Could you run your magic query on this profile too?
I suspect my Olof Andersson would give a very similar result to Jean-Joseph Vatant on the "big Aleš" test, since they are both from the outskirts of the Global Tree.

I wonder what it would look like for someone more central.
Results for /tree.json?userid1=12412011&debug=1

"0:1:1:0",

"1:21:22:12",

"2:44:66:449",

"3:86:152:1114",

"4:147:299:2198",

"5:258:557:3771",

"6:447:1004:6714",

"7:585:1589:12555",

"8:847:2436:18686",

"9:1302:3738:28176",

"10:1568:5306:42735",

"11:2174:7480:59588",

"12:2338:9818:81320",

"13:2988:12806:101929",

"14:3465:16271:129269",

"15:3500:19771:161448",

"16:3996:23767:195599",

"17:5303:29070:234105",

"18:7082:36152:289305",

"19:10924:47076:365136",

"20:18794:65870:491006",

"21:36923:102793:724149",

"22:78016:180809:1213771",

"23:171587:352396:2302928",

"24:348179:700575:4787465",

"25:595126:1295701:9766527",

"26:905981:2201682:17855272",

"27:1287646:3489328:29529626",

"28:1659003:5148331:45271595",

"29:1917002:7065333:64320039",

"30:2003687:9069020:84929564",

"31:1913671:10982691:104997592",

"32:1703069:12685760:122906902",

"33:1467013:14152773:138021281",

"34:1240659:15393432:150831833",

"35:1040351:16433783:161720025",

"36:863119:17296902:170926902",

"37:707666:18004568:178600550",

"38:577623:18582191:184864623",

"39:460841:19043032:189924047",

"40:363905:19406937:193923291",

"41:282549:19689486:197030059",

"42:219287:19908773:199392132",

"43:171888:20080661:201202169",

"44:134965:20215626:202616536",

"45:106291:20321917:203720804",

"46:84412:20406329:204590467",

"47:69814:20476143:205277457",

"48:57759:20533902:205848188",

"49:47889:20581791:206317882",

"50:39974:20621765:206699761",

"51:31867:20653632:207023010",

"52:26552:20680184:207267926",

"53:21071:20701255:207474759",

"54:17474:20718729:207634699",

"55:14554:20733283:207767634",

"56:12039:20745322:207879086",

"57:10153:20755475:207970323",

"58:9064:20764539:208048307",

"59:7758:20772297:208116738",

"60:7613:20779910:208176231",

"61:6930:20786840:208242313",

"62:5085:20791925:208299922",

"63:4004:20795929:208337194",

"64:3029:20798958:208364157",

"65:2197:20801155:208385734",

"66:1947:20803102:208401378",

"67:1777:20804879:208418120",

"68:1913:20806792:208433853",

"69:1593:20808385:208449452",

"70:1391:20809776:208460120",

"71:1465:20811241:208470873",

"72:1193:20812434:208482140",

"73:1098:20813532:208490606",

"74:862:20814394:208499175",

"75:769:20815163:208504087",

"76:664:20815827:208508773",

"77:447:20816274:208511953",

"78:344:20816618:208513847",

"79:297:20816915:208515193",

"80:304:20817219:208516504",

"81:156:20817375:208517779",

"82:118:20817493:208518266",

"83:97:20817590:208518785",

"84:64:20817654:208519132",

"85:55:20817709:208519370",

"86:31:20817740:208519529",

"87:37:20817777:208519698",

"88:57:20817834:208519958",

"89:41:20817875:208520237",

"90:52:20817927:208520438",

"91:63:20817990:208520674",

"92:76:20818066:208521091",

"93:109:20818175:208521560",

"94:101:20818276:208522300",

"95:59:20818335:208522620",

"96:46:20818381:208522819",

"97:38:20818419:208523085",

"98:17:20818436:208523319",

"99:20:20818456:208523431",

"100:21:20818477:208523525",

"101:12:20818489:208523673",

"102:5:20818494:208523710",

"103:3:20818497:208523718",

"104:1:20818498:208523722",

"105:1:20818499:208523724",

"106:1:20818500:208523726",

"107:1:20818501:208523728",

"108:1:20818502:208523730",

"109:1:20818503:208523732",

yes more numbers to crunch ...

First analysis :

- At large scale, distributions are pretty much similar, with a median / peak value at d=30 +/- 1

- The Olof distribution is a bit sharper than Jean-Joseph, passing 10% at d=23 vs d=27 and 90% at d=38 vs d=40

- The first circles have a more regular growth for Olof.

Those are basically details. The global geometry is really similar, without surprise.

+4 votes
This question sank very rapidly in the flow. I think your tags did not draw the attention of the people who could answer. I'm not sure I know better tags - I do believe "aleš" and "jamie" exist as tags. Just not sure if it is too "attention-seeking" to use them here.
by Eva Ekeblad G2G6 Pilot (433k points)
Thanks Eva. Difficult indeed to know who to attract.and how ...
+5 votes

You can get the number of ancestors/ relatives via the API.

https://www.wikitree.com/wiki/Help:API_Documentation#getAncestors

If you know which you want to include in your circle, you could repeat the call for N-1, thus fill your circle.

But I reckon this will be intensive if you want to do this continuously. Then it might be better to use the data dumps and do the calculation on your own dataset.

I would suggest to add the tag 'wt_apps' to your post, or maybe even join the group.

by Michel Vorenhout G2G6 Pilot (223k points)
Thanks Michel. I'm not a techie myself (not a coder, that is), so I would not be very useful in the group I'm afraid beyond asking questions. Since you read French also (I guess) see the link in the above answer to Isabelle to figure what I am about.

If you have specific profiles you are interested in, you could download the Gedcom with them as the starting point and import it into Gramps. There you have the option to plot the circle (and more). There it is called a Fan Chart. I promise: low tech wink

But your question is assuming that you want this on the fly which will require some coding.

Thx Michel. I don't want to mingle with GEDCOM, one of the ugliest data format I've ever seen in about 15 years I'd been involved in data migration. sad Actually since I'm retired I'd sworn to myself to crunch no data no more. I'll have a look to Gramps that I don't know. 

I don't want to have those figures on the fly. An idea of the distribution at some point in time would be enough. Refresh those figures once a month, or even once a year, would be OK. More to come ...

Oh, here we welcome retired people as they have loads of time wink. I can't agree more on Gedcom... but there is no universal alternative. In my country we have one for documents, but that needs updating as well. API is the way to go really.

I forgot to check the current apps page, but there is a fan chart one: https://www.wikitree.com/wiki/Help:Apps

by Greg. https://apps.wikitree.com/apps/clarke11007/fan.php

Maybe that is, for now, what you would want?

Actually the "circles" defined by the distance in degrees include ancestors, descendants and collateral extensions, following the four directions of connection: parent, child, sibling and spouse. I don't think the fan charts expand in those four directions.
No, it doesn't. It would make it a bit hard to visualize.. but also interesting. I am wondering if the Topola Viewer could be expanded to do this in a visual way.

Maybe one of the active developers might be interested?

Try the Family Circles visualization within WikiTree+:

https://wikitree.sdms.si/default.htm?report=disp&WikiTreeID=Crawford-7109&Generations=5&OnlyBlood=0

Not exactly what you're saying, but similar

Jonathan, that sounds interesting, but the page does not seem to load in my browser (Chrome)
@Michel, Julie and Jonathan : to be clear I don't care about any kind of fancy visualization (none of them scales beyond 100 nodes). Just a flat distribution table (distance, number of profiles) would make me more than happy.
Ok, try the Family List page in the Genealogy Research section on the profile (Family Tree & Tools, click on Genealogy Research, scroll down to Family List)

You can select how many generations, include ancestors/descendants/both, include siblings.

Not perfect, but closer.

https://www.wikitree.com/index.php?title=Special:FamilyList&p=8822218

Indeed, this is useful, but expansion of circles goes also through spouses and their families. That's how connection distance (degrees) in WT is computed.

For example, the family list for Jean-Joseph on 3 generations including ancestors, descendants and their siblings as of today, counts 192 profiles. In my local spreadsheet adding expansions through spouses, I count 354. And this is only the 3rd circle ...

@Jonathan,

that is new to me, and sounds very interesting (but is also not working for me on FF latest)

Related questions

+12 votes
5 answers
+17 votes
5 answers
452 views asked Jun 29 in The Tree House by Aleš Trtnik G2G6 Pilot (568k points)
+12 votes
3 answers
+14 votes
4 answers
+21 votes
8 answers
+8 votes
2 answers
180 views asked Dec 29, 2020 in The Tree House by Kelsey Jackson Williams G2G6 Mach 1 (15.2k points)

WikiTree  ~  About  ~  Help Help  ~  Search Person Search  ~  Surname:

disclaimer - terms - copyright

...