Wikidata:Property proposal/Geolex ID

Geolex ID edit

Originally proposed at Wikidata:Property proposal/Natural science

   Done: Geolex ID (P6202) (Talk and documentation)
Descriptionthe ID for every stratigraphic unit in the United States of America (USA) given by the United States Geological Survey
Data typeExternal identifier
Template parameternot included in any infobox, but would be a good way of linking to the primary data
DomainItem
Example 1Squantum Member -> Squantum_3930
Example 2Aztec Sandstone (Q4832955) -> Aztec_4626
Example 3Thumb member (Q57314070) -> Thumb_6240
Example 4Allalin Glacier (Q674665) -> Mancos_9165
Sourcehttps://ngmdb.usgs.gov/Geolex/search
Planned useadding lithographic units from the GEOLEX database and adding connection to database items such as publications, maps, etc.
Number of IDs in sourceover 16,000
Expected completenesseventually complete (Q21873974)
Formatter URLhttps://ngmdb.usgs.gov/Geolex/Units/$1.html
Robot and gadget jobsBots could gather ids for existing stratigraphic unit items

Motivation edit

I would like to give structure to the multitude of formations, shales, sandstones, conglomerates, etc. I'd like to unify those lithostratigraphic units with their source publications, maps, and other units above and below in the stratigraphic column.Trilotat (talk) 03:24, 25 November 2018 (UTC)[reply]

@Trilotat: did you try to contact the database manager(s)? (this is completely optional, but from recent experiences I learned that this could be very useful). Cheers, VIGNERON (talk) 20:01, 29 November 2018 (UTC)[reply]
@VIGNERON: I didn't, but I should. If they would part with the data, that would speed up the work, wouldn't it? -Trilotat (talk) 20:32, 29 November 2018 (UTC)[reply]
@VIGNERON: I have recently learned a little about webscraping and have been successful scraping some simple USGS data. I would like to scraping scraping this GEOLEX data, but the web addresses are a bit more complex, i.e. using both letters and numbers in the URL. How to scrap that isn't so clear to me. I've created a new and related proposal here if you're willing to support (or not support) it.Trilotat (talk) 21:17, 10 April 2019 (UTC)[reply]

Discussion edit

@ديفيد عادل وهبة خليل 2, PKM, Susannaanas, Trilotat, VIGNERON:   Done: Geolex ID (P6202)Pintoch (talk) 23:35, 2 December 2018 (UTC)[reply]

@ديفيد عادل وهبة خليل 2, PKM, Susannaanas, Trilotat, VIGNERON, Pintoch: Thank you, merci, danke, شكرا جزيلا, kiitos, etc. I guess I have some work to do now with some 16K+ units. Trilotat (talk) 23:56, 2 December 2018 (UTC)[reply]