Babel user information
en-N This user has a native understanding of English.
de-2 Dieser Benutzer beherrscht Deutsch auf fortgeschrittenem Niveau.
zh-2 这位用户的中文达到中级水平
zh-Hant-2 這位使用者有中等水平的繁體中文知識。
zh-Hans-2 这位用户的简体中文达到中级水平
ms-1 Pengguna ini memiliki kemahiran asas dalam bahasa Melayu.
fr-0 Cet utilisateur n’a aucune connaissance en français (ou le comprend avec de grandes difficultés).
Users by language

Hello world!


Queries edit

more

Use CirrusSearch for searches that would otherwise timeout with SPARQL, see.

IRMNG REST service

GBIF API documentation

Disambiguation needed edit

  • Tianella - ciliate or diplopod Q39487350
  • Hadziella - mollusc or ciliate Q18581905
  • Linostomella - fungus or ciliate Q6554754
  • Micromitra - brachiopod or ciliate Q16917654
  • Trachelochaeta - fish or ciliate Q25361557 - the GBIF record has also wrong author for fish https://www.gbif.org/species/4907217
  • Urostylinae - description says "insect" but this looks like a bad bot Q25361721
  • Trichospira - plant or ciliate Q7841011
  • Ophionella - plant or ciliate Q6051554
  • Cataphractes - ciliate Q121484275 or insects Q5051437
  • Trichospira - ciliate or plant Q7841011
  • Panophrys - ciliate or amphibian Q107264714
  • Ophionella - ciliate or plant Q6051554
  • Butschliella Q23070191 algae vs. Buetschliella ciliates
  • Epalxis - ciliates or snails Q101242432
  • Faureia - ciliates or insects Q10494364
  • Lacrymaria Ehrenberg 1830 Q25661057 vs. Lacrymaria Bory St. Vincent 1824 Q6468898; originally spelled Lacrimatoria by Bory, emended to Lacrymaria by Ehrenberg. See Catalogue
  • Cercaria
  • Cistula
  • Hormidium - genus of orchids or genus of green algae, e.g. Q96054281

Disambiguated edit

To do edit

A hierarchy of tasks:

  1. Generate labels and descriptions from statements within an item
  2. Link items within Wikidata using statements/labels of the items
  3. Link items in Wikidata to external identifiers in databases that are programmatically accessible
  4. Add statements to Wikidata based on sources that are not programmatically accessible

Extract structured data semi-manually from published works edit

Data cleaning and linking edit

  • Link taxa to identifiers in GBIF, IRMNG, NCBI, etc., matching higher taxa to avoid homonyms
  • Link taxa to publications of their first description
  • Find and disambiguate homonyms
  • Add basic descriptions for taxa based on vernacular names of higher taxa (e.g. "species of green algae")
  • Add vernacular names @zh for taxa from zhwiki sitelinks
  • Link errata to the articles they correct by matching titles

Authors and basionyms for taxa of mosses

  • Add taxon author citations to taxa of mosses Q25347 sourced from World Flora Online data export
  • Parse taxon author citation and match to botanist author abbreviations, to add taxon author and ex taxon author qualifiers (if not already present)
  • Parse taxon author citation to find items that are recombinations but without basionym statements; add basionyms statements sourced from World Flora Online
  • Explore parsing abbreviated citations from World Flora Online to match taxa to first valid descriptions or other nomenclatural acts

Data modeling questions edit

  • How to qualify reference where role is type designation?
  • How to represent taxon authors as qualifiers of taxon name statement if no item for author exists? (e.g. names we can't disambiguate)
  • How to represent order of taxon authors as qualifiers of taxon name statement?
  • Better to have nomenclatural acts, taxonomic treatments, etc. as objects of "described by" statements? Then these statements themselves can be qualified and ranked. For example, if the first valid description was found from a third party citation or if its status is disputed.

KIV edit