Wikidata:Requests for permissions/Bot/companyBot
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Not done, no progress since April. Feel free to re-open this if work resumes. Thanks. Mike Peel (talk) 18:43, 24 September 2022 (UTC)[reply]
companyBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: WikiFan2100 (talk • contribs • logs)
Task/s:Updates ~1,000 wikidata items (all companies) with central index key, bloomberg ID, crunchbase ID, and official website properties. I checked and it should make 1,651 edits.
Code:https://github.com/emg89/wiki-update
Function details: this is a simple script that first checks if a company already has the property, and if not then it adds the property and value (as listed in the CSV file in the github repo), --WikiFan2100 (talk) 23:44, 9 January 2022 (UTC)[reply]
- Seems similar to my request that I never followed up on (see Wikidata:Requests_for_permissions/Bot/BorkedBot_3). Definitely something I'd like to see done. I didn't do it because the joins turn out to be a little tricky to do automatically. In particular using just stock tickers can be unreliable because we don't always have good start times. Looking at your git repo you wrote that you looked all this up yourself? If so I don't think a bot approval is needed since it's really all manually done (i.e. not using code). I would like to see references added to indicate where you are getting this data from if possible. (Since you aren't using pywikibot you might want to have your bot emit quickstatement commands instead of directly editing via the API because then it is simple to add things like references.). BrokenSegue (talk) 02:32, 10 January 2022 (UTC)[reply]
- Yes, this is from a database I have manually created for work, so it's very clean and accurate. I've always had the thought that I would like to contribute it to Wikipedia. Regarding Quickstatements, I wasn't familiar with that tool so I'll definitely take a look at that (thanks). Regarding references, I suppose all of the identifiers I am looking to add are kind of references themselves, as they can be appended to a url that will take you to the page for that company. For example, if you go to the wikidata page for Stripe and click on the CIK it will automatically take you to the SEC's page for Stripe thanks to the formatter url for the CIK property (same for crunchbase and bloomberg IDs). Also, definitely interesting what you were trying to do with BorkedBot_3, it would be great to have more of this information in wikipedia. I spend a lot of time on the SEC's website for work so I'll try to think if there is any way I can help out there. WikiFan2100 (talk) 01:02, 11 January 2022 (UTC)[reply]
- Cool. You can see the source code for my bot that I just never really ran very long here. Though don't run it as it as-is because it has known correctness issues. My main concern about references is "how do you know this CIK maps to the item" (as opposed to, say, a different item with the same name). If the answer is "I did it by hand" then no reference is needed. If you looked it up using another identifier (like the ticker) or used some joining technique then a reference would be helpful. Generally this all LGTM. We should approve this task. BrokenSegue (talk) 03:32, 11 January 2022 (UTC)[reply]
- Ideally we would address Property_talk:P3242#scope_and/or_datatype before adding more of them, but then 1000 isn't that much. --- Jura 11:49, 18 January 2022 (UTC)[reply]
- Support Germartin1 (talk) 04:47, 28 April 2022 (UTC)[reply]