Wikidata talk:WikiProject Taxonomy/Archive/2015/02

This page is an archive. Please do not modify it. Use the current page, even to continue an old discussion.

Count of taxon name (P225) and parent taxon (P171)

 

We have now more than 500,000 × parent taxon (P171) but still a lot of work to do. --Succu (talk) 16:00, 20 March 2014 (UTC)

Yes, this is less than a third? - Brya (talk) 19:22, 20 March 2014 (UTC)

It is noteworthy that the curve of P225 appears to be flattening, stabilizing a little below two million. This would be within the expected range for an endpoint. - Brya (talk) 05:51, 21 March 2014 (UTC)

250,000 more items are now connected via parent taxon (P171). --Succu (talk) 14:03, 18 May 2014 (UTC)

A 50% increase over the last report, and approaching the halfway mark! P225 appears indeed to have stabilized. - Brya (talk) 05:47, 20 May 2014 (UTC)

I updated the diagram. --Succu (talk) 08:27, 29 June 2014 (UTC)

Interesting, P225 is rising again, while the climb of P171 is flattening. Still not reached the halfway point for P171. - Brya (talk) 10:14, 29 June 2014 (UTC)

A little milestone: parent taxon (P171) is now used more than 1,000,000 times. --Succu (talk) 07:19, 17 August 2014 (UTC)

Ah, past the halfway mark now. P225 is flattening again, and P171 is catching up. - Brya (talk) 10:34, 17 August 2014 (UTC)
I've built some statistics about ranks and their problems: User:Infovarius/taxonomy. There you can see which taxa have no parent taxon (P171) (almost half of species, evidently) and their distribution. --Infovarius (talk) 10:51, 18 August 2014 (UTC)
Thats not surprising. Yesterday I made a rough check on Osteichthyes (Q27207). More than 400 genera are still missing. That means we have not identfied them or - more probably - the items have to be created. --Succu (talk)
Looks like a great tool. The results are somewhat depressing, but not really surprising. - Brya (talk) 16:39, 18 August 2014 (UTC)
@Infovarius: it would be great if you could update this table on a regular base. --Succu (talk) 15:25, 20 August 2014 (UTC)
It is done manually, so I can update by query but not too frequent. --Infovarius (talk) 15:34, 20 August 2014 (UTC)
@Infovarius: automating this seems not a big deal to me. Do you wanna try it? --Succu (talk) 18:45, 20 August 2014 (UTC)
@Succu:, I've done full update. If you want more regular, you can do automating (it's not on my list for now). --Infovarius (talk) 19:18, 11 September 2014 (UTC)
@Infovarius: These are your private tables. So it's ok with me if you update them at your will. --Succu (talk) 22:00, 11 September 2014 (UTC)

Another little milestone: parent taxon (P171) is now used more than 1,500,000 times. --Succu (talk) 08:07, 3 October 2014 (UTC)

A 50% increase over the last report! Now at more than 75% coverage. Looking good! - Brya (talk) 10:41, 3 October 2014 (UTC)

The gap between taxon name (P225) and parent taxon (P171) dropped below 250,000. --Succu (talk) 07:54, 22 November 2014 (UTC)

So less than 15% remaining? That is good progress. - Brya (talk) 18:27, 22 November 2014 (UTC)

The count of taxon name (P225) raised above 1,900,000. The gap between taxon name (P225) and parent taxon (P171) dropped below 100,000. --Succu (talk) 10:12, 7 December 2014 (UTC)

Coverage at 95%? Nearly there! - Brya (talk) 11:59, 7 December 2014 (UTC)

Update. Not much progress. Around 80,000 items left. --Succu (talk) 18:54, 8 February 2015 (UTC)

Maybe closing the gap would happen sooner if you stopped adding new P225? I keep seeing more and more viruses added. - Brya (talk) 06:00, 9 February 2015 (UTC)
That's not the problem. All my additions of virus species had all four properties and references. The main problem is that Wikidata Query (WDQ) gives unreliable (outdated) results for weeks. There are many genus items to create for nlwiki-species. Or around 4000 zhwiki plants that are missing their parent. I started to work with these. But I cannot control the work with WDQ. --Succu (talk) 10:35, 9 February 2015 (UTC)
Isn't there some constraint that will turn up these? BTW those 4000 zhwiki plant sound scary: who knows how many of these are placed in non-existent genera ... - Brya (talk) 12:02, 9 February 2015 (UTC)
A strange orchid genus I found is Triopidia. Not known by Tropicos and IPNI. --Succu (talk) 16:26, 9 February 2015 (UTC)
No doubt a misspelling of Tropidia. - Brya (talk) 17:34, 9 February 2015 (UTC)

Statistics

Since the beginning of this year I'm running some statistics based on the weekly json dumps. Maybe you find them useful. --Succu (talk) 07:32, 10 February 2015 (UTC)

Interesting. Also somewhat depressing in that some properties have not been used much, even something as important as "taxon synonym". - Brya (talk)

said to be the same as (P460)

There is a problem with the use of said to be the same as (P460). On the Talk page of the property there is Template:Constraint:Symmetric. Either this constraint should be removed or we should use a different property. I tried to find an existing property, like "use instead", "see instead", "belongs to", "is probably a duplicate of", "is correctly", but the closest I can find is "part of" which is not very close. - Brya (talk) 05:34, 28 January 2015 (UTC)

I resolved most items with instance of (P31)=Wikimedia duplicated page (Q17362920) and said to be the same as (P460) using said to be the same as (P460) as a qualifier for instance of (P31)=Wikimedia duplicated page (Q17362920). What other cases are modeled with said to be the same as (P460)? --Succu (talk) 19:57, 29 January 2015 (UTC)
For example the case directly above Rosa eglanteria L. (1753) (Q161146). I am now thinking it would go a long way to have a property "is synonym of" (the reciprocal of "taxon synonym"). - Brya (talk) 11:35, 30 January 2015 (UTC)

I made a list with items having taxon name (P225) and said to be the same as (P460) from the last json dump. Maybe that helps to identify and fix further cases --Succu (talk) 12:53, 3 February 2015 (UTC)

List

I went through them as best I could, and moved all the "said to be the same" from properties to qualifiers. I don't know if I caught them all (it is a pretty complex web of links). Still, I can't help feeling that it would be better to have a separate property like "is a synonym of", "now correctly", etc. - Brya (talk) 06:44, 12 February 2015 (UTC)

Duplicate IPNI and other

Hello, some taxon-related properties has violations now. Earlier its were clear. Your project assistance will be good to restore previous quality level. Hourly-updated violations report: Wikidata:Database reports/Constraint violations/Mandatory constraints/Violations. — Ivan A. Krestinin (talk) 08:42, 16 February 2015 (UTC)

Be patient, Ivan. I'll fix them later this week. --Succu (talk) 08:51, 16 February 2015 (UTC)
Whatever it is, it seems not to be limited to IPNI, but also includes Tropicos and The Plant List. - Brya (talk) 11:58, 16 February 2015 (UTC)
Return to the project page "WikiProject Taxonomy/Archive/2015/02".