About this board

Previous discussion was archived at User talk:Strainu/Archive 1 on 2019-06-08.

VladaC-osm (talkcontribs)

Dear Strainu,

You added a number of Natura 2000 protected areas recently that are duplicates of existing entries. Example:

Q28915015

Q128415092

I merged some of them but I have not succeeded in all cases because you also created Wikipedia pages that are at least in one case duplicate of an existing page (rather than creating a Romanian language version of the already existing Wikipedia page). See:

https://cs.wikipedia.org/wiki/Rašeliniště_Jizerky

https://ro.wikipedia.org/wiki/Rašeliniště_Jizerky

I am an OSM mapper with very little experience in Wikidata and Wikipedia, so I don't know if it is acceptable here to import data with so many duplicates. Also, I am unable to resolve all the cases. Could you explain and/or take care of the problems?

Thanks and best regards,

VladaC

Strainu (talkcontribs)

@VladaC-osm the main question is: how sure are you that these are duplicates? There are different levels of protection (national, Birds directive, Habitats directtive) which don't always overlap perfectly and the quality of data varies significantly. For Úpor - Černínovsko it seems the two items are indeed the same. On the other hand, Rašeliniště Jizery (Q12048962) already has diferit(ă) de (P1889) with values the two items that have interwiki links to cs (Rašeliniště Jizerky (Q8547442)) and ro (Rašeliniště Jizerky (Q28915341)), indicating that having several items is intended. Indeed, the Site Standard Data form shows the Natura 2000 site overlaps 2 other national-level elements also named Rašeliniště Jizerky. Also, Common Database on Designated Areas ID (P4762) value for Rašeliniště Jizerky (Q8547442) is specific to national protection, not European.

All I can say for sure is that when I created the items there was no element with ID sit Natura 2000 (P3425) and that value. If you can provide a way to detect such items as duplicates, I can work on merging them. For example, if there is a public database with a link between ÚSOP code (P677) and Common Database on Designated Areas ID (P4762) or ID sit Natura 2000 (P3425) I can identify the duplicates automatically and then solve them

This post was hidden by Strainu (history)
VladaC-osm (talkcontribs)

Hi again, @Strainu,

Rašeliniště Jizery is indeed a different peat bog than Rašeliniště Jizerky, because Jizera and Jizerka are different rivers. I guess that P1889 is there to avoid possible confusion because of the very similar name. So let's keep anything named Rašeliniště Jizery aside from discussion.

Nevertheless, there were actually three items named "Rašeliniště Jizerky" after your addition of Q128419964. I merged your item with Q28915341 which has been a correct step I believe.

However, I realize that it would not be correct to merge the resulting item further with Q8547442 which is an overlapping NPR (Národní přírodní rezervace = National Nature Reserve).

From what I learned, it seems that the Natura 2000 areas in the Czech Republic need to be paired with areas called "evropsky významná lokalita" or "ptačí oblast".

I believe the link between the ÚSOP codes and the Natura 2000 IDs can be found at https://drusop.nature.cz/. You can tick "Evropsky významné lokality" and "Ptačí oblasti" and click "Vyhledat" (search). You then get a list where "Kód" is the ÚSOP code and "Kód Natura" is the Natura 2000 ID.

As I mentioned, I have little to none experience with Wikidata. Also, I know very little about protected areas. So that's all I have been able to found on the subject. Maybe it would be a good idea to consult those who imported the items you seem to be duplicating?

Regards, VladaC

Strainu (talkcontribs)

Unfortunately I don't see a "Kód Natura" in the results, maybe because I'm not in CZ. I'll ask around, as you suggested.

Reply to "Duplicate items"

Call for participation in a task-based online experiment

1
Kholoudsaa (talkcontribs)

Dear Strainu

I hope you are doing well,

I am Kholoud, a researcher at King's College London, and I am working on a project as part of my PhD research, in which I have developed a personalised recommender model that suggests Wikidata items for the editors based on their past edits. I am inviting you to a task-based study that will ask you to provide your judgments about the relevance of the items suggested by our model based on your previous edits. Participation is completely voluntary, and your cooperation will enable us to evaluate the accuracy of the recommender system in suggesting relevant items to you. We will analyse the results anonymised, and they will be published to a research venue.

The study should take no more than 15 minutes.

If you agree to participate in this study, please either contact me at kholoud.alghamdi@kcl.ac.uk or use this form https://docs.google.com/forms/d/e/1FAIpQLSees9WzFXR0Vl3mHLkZCaByeFHRrBy51kBca53euq9nt3XWog/viewform?usp=sf_link

Then, I will contact you with the link to start the study.

For more information about the study, please read this post: https://www.wikidata.org/wiki/User:Kholoudsaa In case you have further questions or require more information, don't hesitate to contact me through my mentioned email.

Thank you for considering taking part in this research.

Regards

Reply to "Call for participation in a task-based online experiment"
There are no older topics