Talk:Q28054658

Latest comment: 4 months ago by CV213 in topic ISNI format

Autodescription — Mix'n'match (Q28054658)

Can someone help me with a scraping variable

edit

I'm trying to define the URL for scraping from URLs following pattern of https://ngmdb.usgs.gov/Geolex/Units/AbbevilleYork_16165.html I'd like to create a wildcard or regex (maybe, but I'm unschooled in this) where it scrapes pages that have a word replacing "AbbevilleYork_16165" with a string starting with capital letter followed by indeterminate string of letters (upper and lower case) underscore a number, potentially 6 digits long. https://ngmdb.usgs.gov/Geolex/search, clicking "search", will provide a sample of what links I hope to scrape.

  • I was able to figure this out when the variable was only a number between 1 and 103000. Having a mix of upper and lower case letters is throwing me.

Thanks for any pointers. Trilotat (talk) 20:55, 7 April 2019 (UTC)Reply

ISNI format

edit

Did insert ISNI in spaced format, example: https://www.wikidata.org/w/index.php?title=Q124408545&oldid=2067027790 , spaces in existing values removed 2024-02 Topic:Xyr5b6zcavxhideq, but no mechanism to prevent new spaced imports in the tool db. CV213 (talk) 15:55, 22 February 2024 (UTC)Reply

Return to "Q28054658" page.