User:Jheald/todo/UK
< User:Jheald | todo
To do next edit
- Go through list of remaining unmatched GSSs, & compare with Commons
- Extract unit IDs from VoB
- Extract list of wards from categories & tag with P31
- Match live wards, constituencies, euro-constituencies etc to GSS, TOID
- Northern Ireland house of Commons constituencies
- Refine settlement P131s to districts
- Look at village/town/hamlet etc status of settlements
Civil parishes edit
- GSS count:
tinyurl.com/y8s2nf2y
10464 / 10459 (some may need to be retired)- list:
tinyurl.com/y7ocp4jk
- list:
- Items marked as CPs, with no GSS code. (To investigate).
tinyurl.com/yawlakzx
(520) - Villages with "civil parish" in desc:
tinyurl.com/yaec6f7f
-- to fix - Non-unique codes / links:
tinyurl.com/mw3e4pb
GSS values claimed by more than one item. (Currently: 65).tinyurl.com/mfdlwjl
Commons categories for CPs claimed by more than one item. (Currently: 102).tinyurl.com/kkuz36e
CPs that are in areas that are also claimed as CPs. (Currently: 57).tinyurl.com/y8f2s9mr
A query that tries to combine the above. (Currently: 128)
- Better labels, alts, descriptions for CPs -- current descs
tinyurl.com/l5f5asr
, altstinyurl.com/kt2unof
- West Sussex:
tinyurl.com/y76bxatg
- West Sussex:
- cf Bot request: Remove commas from places -- but need good descriptions first
tinyurl.com/jwzh7ms
CPs with a GSS needing a better district (16).- failed to remove "(parish)" and "(civil parish)" in alts
- check settlement type vs commons, os ?
- Download names+coords for CPs -- try to update GSS, TOIDs, KEPNs, etc
- Update P131s again if necessary
- Cheshire: map old->new CPs.
- CPs with no Commonscats:
tinyurl.com/y9ksoah3
(263) - Commons sitelinks that could be added for CPs
tinyurl.com/y9hyzo37
(48) - Swedish wikipedia -- UK items with links to sv-wiki, but no P31/P279:
tinyurl.com/yc3mj8no
; count:tinyurl.com/yan7bflo
(8949);- Most common properties:
tinyurl.com/yd7u5b2b
- pl:
tinyurl.com/yd6d7eus
(851) / props:tinyurl.com/y82mggey
- Most common properties:
- .. with names containing "parish":
tinyurl.com/y7qsolgl
/ count:tinyurl.com/y9qr6dff
(571) / props:tinyurl.com/y7lojhym
- .. with names containing "distrikt": props:
tinyurl.com/y8j2ykef
(738)- Parishes without links to sv-wiki:
tinyurl.com/y7f6sngg
(934); with geonamestinyurl.com/ybg87eda
- Parishes without links to sv-wiki:
- (1) Attempt to match on geonames:
tinyurl.com/yaqxu2jo
- Done, but need to go back and add alts
- (2) Use geonames to infer county; attempt to match on name+county. -- mark duplicate pages first; may still find non-parish matches to pre-empt merges
- (3) Try to match parishes, districts by name without county. -- as above, mark duplicate pages first.
- First attempt to match (but only 15 hits!)
tinyurl.com/ydeq6ghj
- First attempt to match (but only 15 hits!)
- Matched to parishes on geonames:
tinyurl.com/yaa5dgn2
- Believed to be no dupes:
tinyurl.com/y8tznxhg
- Believed to be no dupes:
- List of CPs and geonames:
tinyurl.com/yctwe6vj
- List of CPs and geonames:
- Failure to search through redirects?
Settlements edit
- Better labels, alts, descriptions for villages
- P131s for towns & villages -- query:
tinyurl.com/l4nzutz
/tinyurl.com/lflpkho
/tinyurl.com/k9majhe
(3830)- Use Commons hierarchy to identify CPs for them -- quarry: 17609 + links to CPs on Commons:
tinyurl.com/kaqcxcv
- Do some matching based on CPs/counties to find some more Commonscats
- Use Commons hierarchy to identify CPs for them -- quarry: 17609 + links to CPs on Commons:
- Try to check village/town/hamlet status of CPs that are also settlements
- resolve multiple values -- appear to be none
tinyurl.com/kl2gcdr
- resolve multiple values -- appear to be none
- P131s for towns & villages -- query:
- Identify more items as UK settlements (26859)
- Tracking: common properties:
tinyurl.com/lew5n6a
& classes:tinyurl.com/keudpnz
(subclasses only:tinyurl.com/mg2rf4p
) - Very many TOID, VoB, GeoNames, OpenDomesday, KEPN not yet matched
- Query for UK settlements with coords, other than civil parishes -- tinyurl.com/kxsg87w (17617)
- Tracking: common properties:
- UK settlements with no en label
tinyurl.com/kjvf9gy
(0) - ADMs of G II LBs that are not settlements or recognised ADMs:
tinyurl.com/knrrud3
-- mostly wards, some geo-features (82) - birthplaces, similarly:
tinyurl.com/mf2472r
(21) - UK items not marked as settlements
- Breakdown of properties on items with no P31, with P17 = Q145, other than listed buildings:
tinyurl.com/lkguctf
(30699)- 13288 have coordinates:
tinyurl.com/k6kybky
- 13288 have coordinates:
- Breakdown of properties on items without P31, in the UK by P131, other than listed buildings: -- tinyurl.com/nxlh954 (5458)
- 2471 have coordinates: worth running against TOID, Geonames.
- 591 are on NI heritage list -- should have heritage status
- Breakdown of properties on items with no P31, with P17 = Q145, other than listed buildings:
- Items not marked as being in the UK:
- Items in the UK by coords, that have not got P17 = Q145 -- tinyurl.com/l8o4rah (6090)
- (?not possible to get those without P31:
tinyurl.com/lz9yl9w
) - most are minimal stub items from other wikis -- try to match as toids / historical places, & merge
- classes: tinyurl.com/ku6q59f (no settlements)
- (?not possible to get those without P31:
- Items in the UK by P131, that have not got P17 = Q145, & no P31 --
tinyurl.com/kmhdrby
properties:tinyurl.com/lw8up7k
- (huge numbers of Welsh places. Be careful to exclude "British Raj").
- Same query for England:
tinyurl.com/la273w4
(43) - Allowing P31: quite a lot of wards, and places:
tinyurl.com/ku5cmvn
/tinyurl.com/n2lfvul
(abt 170)
- Items in the UK by coords, that have not got P17 = Q145 -- tinyurl.com/l8o4rah (6090)
Historical buildings edit
- Much better adms needed -- re-extract for parishes
- Current adm classes:
tinyurl.com/y8az996j
- perhaps extract from www.britishlistedbuildings.co.uk ?
- Current adm classes:
- Also add street addresses; try to identify nature of buildings
- Grade II listed buildings, without either ADMs or ADM2s that are parishes or Welsh communities -- tinyurl.com/y8ldfvmy
- each ADM -- tinyurl.com/yd4ekcyf
- ADMs of G II LBs that are not parishes: -- tinyurl.com/kfu3bvy
- Bad merges
tinyurl.com/mktsjqv
electoral divisions etc. edit
- Breakdown of types of electoral division:
tinyurl.com/m7738n9
- wards
tinyurl.com/mov8ohh
-- lots of identification needed, cf en:Category:Wards_of_England- existing:
tinyurl.com/muk2fmx
/ wards with no GSS:tinyurl.com/kpxfg3x
adapt existing quarry query to also retrieve Q-number (cf quarry 485)- level 1 -- quarry 17891
- level 2 -- quarry 17890
- level 3 -- quarry 17893
- existing:
- Westminster constituencies -- see Wikidata_talk:WikiProject_British_Politicians#Constituencies for queries.
- TOID -- just missing Northern Irish
tinyurl.com/ybnh8ubb
(18) - GSS -- ditto
tinyurl.com/yar9khj9
(18) -- (multiples due to withdrawn identifier valuestinyurl.com/yayqhq8n
: old ones need to be deprecated)
- TOID -- just missing Northern Irish
- Scottish / Welsh / MLA / Euro constituencies still to do
UK General edit
- Civ parishes with geonames & poor ADMs: tinyurl.com/lltul7e
- updated qy: toids -- tinyurl.com/mpm2zvn
- Estimating coords: tinyurl.com/mmetm9p / London boroughs: tinyurl.com/ke4pqac
- Region check variant:
tinyurl.com/kx49wfv
- Places inside a CP: tinyurl.com/meh36kt
- Region check variant:
- Instances of "Geographical object" in the UK -- need refinement:
tinyurl.com/l3dmnwt
Identifiers edit
GSS code (2011) (P836) edit
- All UK? - tinyurl.com/mkk4kqv - YES.
- item/gss/cerem: tinyurl.com/m464a4t
- Deprecated GSS values that haven't been
tinyurl.com/llsk7pl
(699). - Duplicated values:
tinyurl.com/mw3e4pb
(77)- duplicated values with no duplication on sv-wiki:
tinyurl.com/ln3q2co
(1).
- duplicated values with no duplication on sv-wiki:
- renamed parishes (from GSS):
tinyurl.com/n2nbehl
- Cheshire West decision document [1]
GeoNames ID (P1566) edit
- to do: check classes vs. geonames feature codes
- Multiple values - tinyurl.com/mo5m4dl (80): needs investigation
- UK items with Geonames -- tinyurl.com/l5ljbdl
TOID (P3120) edit
- Documentation could use constraint reports, incl special one for multiple values.
- Multiple values: tinyurl.com/kea2jlp -- mostly close together ?
- Non-unique values: tinyurl.com/muudoss (95)
- All UK? - tinyurl.com/lvl82tv - YES.
- See Abersychan (Q3304459) for some clean-up issues (eg "use" qualifier; coords from TOID).
- Lookup TOID from GSS:
tinyurl.com/y9w3fsg8
- Lookup TOID from GSS:
Vision of Britain place ID (P3616) edit
- All UK -- tinyurl.com/kb6ujvq -- 144 (but includes Ireland and Estonia)
- Map -- tinyurl.com/kmel5xt
- VoB counts for different types of human settlement
tinyurl.com/l496nsk
OpenDomesday settlement ID (P3118) edit
- All UK? -- tinyurl.com/mkas8j4 -- YES
KEPN ID (P3639) edit
- All UK? -- tinyurl.com/n2fosmt -- YES
Survey of English Place-Names ID (P3627) edit
British History Online VCH ID (P3628) edit
- Extraction from sv-wiki <-- No, only 32
- https://en.wikipedia.org/w/index.php?title=Special:LinkSearch&limit=5000&offset=0&target=http%3A%2F%2Fwww.british-history.ac.uk%2Fvch%2F <-- external links from en-wiki
- Consider how to extract parishes from pages like:
http://www.british-history.ac.uk/vch/beds/vol2/pp266-276#h3-0003
- Need listing of C19 CPs first ?
Commons category (P373) edit
- Sub-sub-categories of a Commons category; and the categories they are in:
https://quarry.wmflabs.org/query/17609
- & sub-sub-sub-categories:
https://quarry.wmflabs.org/query/17610
- & sub-sub-sub-categories:
- Items in class with no P373:
tinyurl.com/modhm49
- Ceremonial counties -- excludes City of Bristol (Q21693433) (link taken by item for settlement, rather than adm district)
- Non-met counties --
tinyurl.com/ltj6lrk
-- may exclude County Council areas, eg Derbyshire (Q11775003), North Yorkshire (Q21241814), Lincolnshire (Q21269047) - UAs -- districts where the city is also the UA:
tinyurl.com/l35k95g
- CPs with a non-unique P373:
tinyurl.com/mleexsc
- multiple values:
tinyurl.com/lo65uz7
- multiple values:
?use for settlements, and to extract CPs
- Check all P373s now point to valid commonscats in the CP hierarchy (query:
tinyurl.com/k24bj6h
)- they don't -- lots of cats on Commons not identified as CPs.
- however: also some cats on Commons not on the list <-- missing CPs ?
- dangerous to bulk tag CP cats on Commons until these are checked.
Heritage status edit
- Better localisation to parishes
- Systematic chasing of Commons links
- Downloads page:
https://www.historicengland.org.uk/listing/the-list/data-downloads/
- sim for Wales (CADW?), Scotland (?CANMORE)