Wikidata talk:WikiProject Informatics

SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days. For the archive overview, see Archive/. The latest archive is located at Archive/2024.

Generic properties edit

I am think that we should make use of these two properties, in preference to creating endless specific properties. So, we would claim that iPhone 5S (Q14850354) has part(s) (P527) Apple A7 (Q14887104) instead of creating a SoC property. SoC being similar to CPU. This may mean that the CPU property should be merged and deleted.

Any thoughts on this? Danrok (talk) 02:30, 27 January 2014 (UTC)Reply

At some point we will be able to do complex queries and then it will be obvious that the item we link to is a CPU, because it is in the subclass-tree of CPU. But currently I favour "working-properties" that allow us to sort things out using simple queries and the constraint violations. Once Wikidata gets more intelligent we can quickly move our data to a new property, so no work we do now is lost. --Tobias1984 (talk) 08:27, 27 January 2014 (UTC)Reply

Which time qualifier software version identifier (P348)? edit

Should we use publication date (P577) or point in time (P585) as the qualifier for software version identifier (P348)? Both of them are used, but I am not sure if we have statistics on the usage. -Tobias1984 (talk) 10:35, 6 August 2014 (UTC)Reply

Bot Request edit

I requested to include processor information from a table I made, see the discussion here: Wikidata:Bot requests#Intel processors.--MisterSanderson (talk) 16:24, 6 October 2014 (UTC)Reply

Linking from Wikipedia edit

I don't think we have links to this project yet from the corresponding Wikiprojects in the different languages of Wikipedia. A link with some sort of a logo might be more welcoming to new users of Wikidata. I made a quick attempt, but will gladly accept any other logo for the link. It is not super important and we can always change it later. I also made it into a userbox:

 This user is a member of
WikiProject Informatics.

. -Tobias1984 (talk) 21:28, 29 October 2014 (UTC)Reply

This logo is beautiful, even being more related to electronics than informatic.--MisterSanderson (talk) 01:06, 30 October 2014 (UTC)Reply
I made another one. The floppy is pretty iconic for the concepts of Informatics. But it might not work well for color-blind people. -Tobias1984 (talk) 19:51, 30 October 2014 (UTC)Reply
 
2nd Logo
-Tobias1984 (talk) 19:51, 30 October 2014 (UTC)Reply
I have no experience with color-blind people.--MisterSanderson (talk) 14:23, 1 November 2014 (UTC)Reply
Maybe replace gray to white? DrSauron (talk) 20:53, 3 November 2014 (UTC)Reply
I think that replacing the dark-grey of the piece that covers the hole to silver would make it better. The majority of my flop disks have this piece metalic, silver colour.--MisterSanderson (talk) 00:39, 4 November 2014 (UTC)Reply
@MisterSanderson, DrSauron: I made the changes you suggested. How does it look? -Tobias1984 (talk) 09:29, 4 November 2014 (UTC)Reply
Perfect.--MisterSanderson (talk) 15:08, 4 November 2014 (UTC)Reply
Now we just need to plant the links on the appropriate WikiProjects. I already did en:Wikipedia:WikiProject Computer science. Tobias1984 (talk) 15:34, 4 November 2014 (UTC)Reply
Actually it is easier to keep track of the links here (If every project allows the logo with the link): commons:File:WikiProject Informatics Floppy.svg#File usage on other wikis -Tobias1984 (talk) 15:43, 4 November 2014 (UTC)Reply

Algorithms: Instance of or Subclass of? edit

Some algorithms, like quicksort (Q486598) and merge sort (Q189057), have subclass of (P279) statements instead of instance of (P31) ones. Is that correct? I would think instance of (P31) would be more appropriate. --DSGalaktos (talk) 12:14, 1 November 2014 (UTC)Reply

@DSGalaktos: I try to think about it like this: instance of (P31) is really just for individuals like people, buildings, or places. Software doesn't really have instances because many computers can run them. I think one can actually say that subclass of (P279) is the default ontology. -Tobias1984 (talk) 13:05, 1 November 2014 (UTC)Reply
@Tobias1984: I consider a particular algorithm still an instance, although it’s of course not a physical instance of anything. I think this is also more useful, because there’s definitely some difference between, say, merge sort (Q189057) and comparison sort (Q2632949): The latter is a group of algorithms, while you can’t have subclasses of Mergesort. (Quicksort is perhaps a special case because you could see different pivot choices as different versions – i. e. instances – of it.)
We also consider programs as instances, even though there are multiple installations of them: For example, GNU Emacs (Q1252773) is an instance of Emacs (Q189722). --DSGalaktos (talk) 15:03, 1 November 2014 (UTC)Reply
@DSGalaktos: At the moment some people only add instance of (P31) statements which is a problem. Everybody editing should always think critically when to use p31 or p279. We could also see what Wikidata:WikiProject Ontology has to say about this case. -Tobias1984 (talk) 19:02, 1 November 2014 (UTC)Reply
@DSGalaktos:, @Tobias1984:: I think there are a few obvious ones:
  • quicksort is an instance of a sorting algorithm
  • a stable sorting algorithm a subclass of sorting algorithms
  • GNU Emacs is an instance of editing software
Less obvious are:
  • quicksort and randomized quicksort (neither subclass nor instance of seems appropriate here, as none of the two is a generic class of things. should they be left unrelated (directly)? the closest analogy I would be able to think of is that thet randomized quicksort is a subtype of quicksort. alternatively one can view quicksort simultaneously as a class of sorting algorithms and a specific instance of sorting algorithms.)
  • GNU Emacs and Emacs: as above, here instance of is probably appropriate if one considers Emacs as a generic class of editing software instead of a specific instance.
The root of the problem is that there is a whole hierarchy of "classyness": sorting algorithms > variations of quicksort > specific variation of quicksort > specific implemenation of a variation of quicksort in some programming language > specific implementation of quicksort in some particular machine code > specific copy of that machine code on some storage media. The place where one draws the binary disitinction between "class" and "instance" is then somewhat arbitrary. Since Wikidata will likely only ever have items the three top-most items in this hierarchy, one will pragmatically consider "sorting algorithms" to be a class, and "quicksort" to be an instance (with the additional complication that "quicksort" can refer to both "variations of quicksort" and some "specific variation of quicksort").
Ruud 15:25, 9 December 2014 (UTC)Reply
Ruud, DSGalaktos, consider another mundane example: Microsoft Excel. At first glance, many would say "Microsoft Excel instance of (P31) software" (or instance of program, spreadsheet editor, what have you). However, ontologists typically say "Microsoft Excel subclass of (P279) software". Take a glance at this recent paper of the Software Ontology (SWO), which discusses this case in further detail. You can also explore the SWO entry on Microsoft Excel on NCBO BioPortal. Note how the panel at right says "subClassOf Microsoft developed software", and how Microsoft Excel 2007 is a subclass of Microsoft Excel.
This approach has a sensible philosophical foundation -- see pages 47 and onwards in this Introduction to the Information Artifact Ontology (IAO). It basically elaborates on the point Tobias made above.
Using subclass of for Excel also has practical benefits. Subclass of is the canonical property used by the Semantic Web to specify that all the instances of one class are instances of another. All instances of Microsoft Excel 2007 are instances of Microsoft Excel, etc. Using instance of cannot achieve that powerful entailment. The linked SWO paper describes how useful inferences can be drawn by using subclass of, e.g. inferring that a piece of software is open source in the 'Software licences' section.
As Markus Krötzsch put it in a recent discussion about Porsche 356's: "if it is possible to use subclass of, then it is usually also quite likely that more specific items occur later (Porsche 356 v1 or whatever), and we really will need subclass of to build a hierarchy then".
This includes algorithms, I would say. Consider how SWO classifies the Smith-Waterman algorithm -- via "Smith-Waterman subclass of algorithm". This is directly applicable to quicksort, etc.
Markus and many other Semantic Web researchers have been developing algorithms to do rich inference and querying, but they tend to hinge upon building hierarchies with subclass of (i.e., rdfs:subClassOf). The fact that some entity is a "leaf" in a conceptual hierarchy does not mean that it is necessarily an instance. In fact, most hierarchical leaf nodes in ontologies are treated as classes and linked to their parents via subclass of, not instance of.
So it seems there are solid philosophical and practical reasons to state make statements like "quicksort subclass of sorting algorithm" and "Microsoft Excel 2007 subclass of Microsoft Excel", etc. I think using subclass of (P279) is the way to go here. Emw (talk) 04:51, 10 December 2014 (UTC)Reply

IRC channel edit

@Genium, Danrok: The property IRC channel URL (P1613) is done, but it still seems to only take IRC channels that start out with "http://". See for example the statement for Inkscape (Q8041). Should we have a separate property for the developer channel? And what do you think about linking the archives of the mailing lists (e.g. https://lkml.org/). --Tobias1984 (talk) 09:55, 17 November 2014 (UTC)Reply

Hi Tobias1984, and Thank you so much for supporting this property! Not sure if we need a separate property, we can add new URLs as needed. PostgreSQL has many channels related to the project. IMHO, we should have two sub-properties, one for the channel, and another for the network. Or maybe just one value formatted like this: #inkscape on irc.freenode.net.
For mailing lists, why not have a single one generic property, on which we can also add new URLs as needed (archives of dev, help…) ? genium ⟨✉⟩ 09:46, 19 November 2014 (UTC)Reply
@Genium: How about using qualifiers like this: https://www.wikidata.org/w/index.php?title=Q4115189&diff=175285264&oldid=175284603 ? Sandbox-string would have to be replaced by a new property called something like "channel name". By the way: Adding addresses that start out with IRC:// is now possible! --Tobias1984 (talk) 10:06, 19 November 2014 (UTC)Reply
Sounds very good! genium ⟨✉⟩ 10:30, 19 November 2014 (UTC)Reply

Launch of WikiProject Wikidata for research edit

Hi, this is to let you know that we've launched WikiProject Wikidata for research in order to stimulate a closer interaction between Wikidata and research, both on a technical and a community level. As a first activity, we are drafting a research proposal on the matter (cf. blog post). It would be great if you would see room for interaction! Thanks, --Daniel Mietchen (talk) 01:35, 9 December 2014 (UTC)Reply

file formats, extensions, standards, applications edit

We have many items that are both file name extensions and file formats. Sometimes, it’s even more complicated: For example, JAM is a

  • file format,
  • file name extension, and
  • application that reads those files,

and the Wikipedia article linked to JAM notation (Q6107674) describes all three.
Similarly, JSON (Q2063) is a

  • file format,
  • file name extension, and
  • open standard.

I think that we should have separate statements for each of these uses, for every file format/extension out there. Do you agree? —DSGalaktos (talk) 16:01, 13 February 2015 (UTC)Reply

@DSGalaktos: I think looking at the type statistics would be a good place to start this cleanup (Wikidata:Database_reports/Constraint_violations/P1195#Types_statistics). If we wan't to change how this property is used for programming languages the structure would be: e.g. for vala "uses file format --> vala-source-file" and the vala-source-file-item could have the statement "file-extension --> vala".
Your example is even more difficult. Especially the line between open-(file-format)-standard and file-format seems very blurry. I think the most important thing is that the properties are used consistently. --Tobias1984 (talk) 16:32, 13 February 2015 (UTC)Reply
@DSGalaktos: I don't think there is a difference between a file format and an open standard*. I think we should have separate items for the formats and applications (if both are notable; having an item for the application might make the item for the file format structurally necessary, but the other way around may not be true). File name extensions (like media type (P1163) and FourCC (Q950009) codes) should be properties of a file format. —Ruud 12:57, 16 February 2015 (UTC)Reply
* An exception might be if the text of the standard is used as a reference. Then we might need a separate item for that work.

OS, platform, properties, what for ? edit

@Agabi10:   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead. We have a bunch of properties usable to mention the software or hardware platform a software is made to run on. Is it relevant to actually have a platform and an OS property ? There is a lot of edge cases : java software, emulators who are neither one or the other.

I think we should merge the properties. It would simplify the usage and make the propery usable for a video game on a console or consoles as well as a java software who can run on any hardware and OS (usually) the JVM is implemented on. I don't really see any drawbacks to do this. author  TomT0m / talk page 15:01, 20 October 2015 (UTC)Reply

  • Personally I prefer having both and using platform as something more hardware related and OS to define the operating systems, removing properties will only create a problem where people will start considering everything as a platform. For example, some of the current constraint violations for the platform are instances of software engine, which should be used in software engine (P408) instead. If we merge this two properties we should merge also software engine to them. And the list is probably even longer. What we need to do (or at least is what I think) is to limit platform to a more hardware related property and use a bot to clean the constraint violations moving the most used violations to the property they should have. -- Agabi10 (talk) 15:11, 20 October 2015 (UTC)Reply
  • Relaxing the constraint could also work to remove the constraint violations, but that the objective is not making the constraint violation reports empty (even if it would be great if we get rid of all the violations of all the properties), the objective should be making the data having sense and at the moment this is not what we have. With this property it is clear that we don't know in what we should use it, and instead of broadening its scope we should move the values which fits better with other properties to the other properties. The operating systems should be used with operating system (P306), the software engines with software engine (P408), the programming languages with programmed in (P277) and so on. We have those properties and we should make a correct use of them. Is the only possible way I can think to remove the inconsistencies of these properties and start knowing what we are obtaining when we are querying this property. -- Agabi10 (talk) 23:47, 20 October 2015 (UTC)Reply
    • @Agabi10: This is a clear mapping, but we're talking about the meaning of "platform" or "OS". This actually IS NOT clear ! Because a sofware is usually not made for a specific hardware platform. If you code a software for Java, the hardware platform does not make sense. If you code a software for raw C, the OS does not make sense. A platform like Java however make sense. It does not mean "Language" because ther is several languages on top of the JVM. The only way to handle all this cases is to acknoledge the fact that a platform is a combination of hardware and software sometimes like PS4 platform or Windows/x86 for example (those kind of platforms may even be emulated ...) , other times a portable software platform like a JVM platform. Portable softwares usually tries not to use hardware specific features. We need to redefine our properties to be able to express all that for a good model. author  TomT0m / talk page 08:33, 21 October 2015 (UTC)Reply

Removing old version claims edit

@!болит: I am posting this here so more people see it and are aware of the issue. Wikidata also saves old versions of software, because we are interested in the history of develpment. For example: "When was versio 1.2 of Gedit released" would be a possible question, that we would like to answer. If the old claims of old versions are removed then this does not work. We need as much people as possible to check the software items and explain to people how and why this is done on Wikidata. --Tobias1984 (talk) 09:16, 21 October 2015 (UTC)Reply

This is important all over the place :) Think of the properties like "name at birth" who might even not exist. And a danger of client editing if not done right, I'll put that on the relevant input page. author  TomT0m / talk page 09:30, 21 October 2015 (UTC)Reply
@TomT0m: Hopefully at some point we will have a super easy interface with lots of easy to understand warning and built-in video tutorials. But at the moment the good faith removal of information, is a problem. It would be nice if IPs and people with less than 1000 edits would need a second check on the removal of claims. --Tobias1984 (talk) 09:33, 21 October 2015 (UTC)Reply

Pronom identifier (PUID) property proposal edit

Hello everyone, I fill a proposal to add a property "PRONOM identifier". As some of you may know, PRONOM is a technical registry (maintained by The National Archives in the UK) that gives information on format and on software that can deal with those formats. Having this information on wikidata, will help to check or cross-control information on formats. You may vote here. Thanks. Toto256 (talk) 17:18, 3 February 2016 (UTC)Reply

The developer (P178) property edit

In this page, I can actually read : “if a developer is an organization, the developer (P178) qualifier must be set to the number of employees contributing modifications to the code base”.

I think this qualifier should only be used for the name of the group, the organization, or the developer and not for a number like 3 (Q201) ; so we should use a property which was made for entering numbers. Is number of participants (P1132) good for this use, is there yet another property, or should we create the qualifier “number of developers” (maybe too specific…) — Metamorforme42 (talk) 20:08, 15 July 2016 (UTC)Reply

number of participants (P1132) is better indeed, I'll update the page accordingly as well as the few items that use it. Dachary (talk) 17:06, 16 July 2016 (UTC)Reply

Software and website edit

Hi, following an idea I had when discussing with @Dachary: on problems with one of my batch edit, I found a pattern of mistakes : software that are also websites. It occured to me that I could pretty simply generates a query to find them all with my work around disjoint union of (P2738)  .

I then created software artefact (Q27043072)      and the following query (modulo the comment I added because of a conceptual bug in the generating code that will be corrected) popped up as a link in the bottom of the talkpage of the item.

select ?item where {
	# ?item wdt:P31/wdt:P279* wd:Q27043072 .
	{
		?item wdt:P31/wdt:P279* wd:Q205663 .
		?item wdt:P31/wdt:P279* wd:Q166142 .
	} .
}
Try it!

To make this work, I had to correct a (stupid) bug in Module:Requests add a claim - that needs to be discussed : any website (Q35127)      is a computing process (Q205663)     . The result is a list of 89 software that may actually be websites or the converse, similar mistakes than the two Dachary found out.

author  TomT0m / talk page 13:38, 28 September 2016 (UTC)Reply

It's a useful query to have, thank you :-) Dachary (talk) 14:07, 28 September 2016 (UTC)Reply

FIle formats sub-project created edit

Hi all, I have started a sub-project page at Wikidata:WikiProject Informatics/File formats to help assist with definition of file formats in Wikidata. Please provide input wherever you feel you can assist.   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead., @YULdigitalpreservation: Pixeldomain (talk) 03:55, 14 October 2016 (UTC)Reply

For those interested, there is some good discussion currently ongoing at Wikidata talk:WikiProject Informatics/File formats about how best to represent file format information in Wikidata. Pixeldomain (talk) 07:44, 27 October 2016 (UTC)Reply

New subpage for configured environments edit

I started a new sub-project page at Wikidata:WikiProject Informatics/Configured environments as a place for us to define how we want to be able to describe configured environments in Wikidata. Feedback and input welcome!   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.. YULdigitalpreservation (talk) 18:11, 29 November 2016 (UTC)Reply

Computer network protocols sub-project created edit

Hi all, I have started a sub-project page at Wikidata:WikiProject Informatics/Computer network protocols to help assist with definition of computer network protocols in Wikidata. Please provide input wherever you feel you can assist.   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead. Pixeldomain (talk) 08:12, 13 January 2017 (UTC)Reply

Character encoding sub-project created edit

Hi all, I have started a sub-project page at Wikidata:WikiProject Informatics/Character encoding to help assist with definition of character encoding systems in Wikidata. Please provide input wherever you feel you can assist.   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead. Pixeldomain (talk) 04:25, 4 April 2017 (UTC)Reply

Programming languages sub-project created edit

Hi all, I have created a sub-page on programming languages: Wikidata:WikiProject_Informatics/Programming_Language. The purpose is to map existing infobox properties in different languages to Wikidata properties and to identify new properties so as to achieve a detailed description of hundreds of programming languages out there. Please give feedback.   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.. Jsamwrites (talk) 16:02, 4 April 2017 (UTC)Reply

alternatives property edit

hello   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.,

I was looking for a way to indicate that Yarn (Q28975591) is an alternative to npm (Q7067518), and vice-versa, a bit in the way how AlternativeTo (Q3613175) keeps a graph of software alternatives (ex: Yarn alternatives). While I guess that some cases have way too many alternatives and should then rather be solved by finding all the instances of a given type of software (ex: instances of flight simulator), some softwares share a strong, direct, often competitive, relationship that I think worth signalling by some property. Do you see a property that could be used in this sense, or should I propose a new one? Or do you think it's a plain bad idea? -- Maxlath (talk) 09:49, 22 June 2017 (UTC)Reply

My personal opinion is to make use of instance of (P31) (in a similar manner to your above query). Some items are instances of multiple items (or classes) and this additional information can be utilized to find alternative software solutions. Jsamwrites (talk) 17:05, 22 June 2017 (UTC)Reply

RFC about sub-category differentiation edit

#Request for comments about sub-category differentiation Thanks. --Valerio Bozzolan (talk) 09:01, 30 June 2017 (UTC)Reply

List of graphics processing units (a lot of new properties) edit

Hi, I am new to wikidata and I would like to get all the information/tables from pages like wikipedia:List of AMD graphics processing units and wikipedia:List of Nvidia graphics processing units into a similar format.

If I get this correctly, I would need to create an item for each table entry and I would need a lot of new properties. Is this a valid idea for wikidata? What is the process to request those? I read on Help:Properties that it would be best to request this kind of data on a WikiProject (=here).

I started randomly on this list by adding Q31270335.

--EoD (talk) 07:02, 1 July 2017 (UTC)   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.Reply

Hi, that's an interesting problem - naively I would also think many of these properties would need to be created. We can brainstorm here first to see how to reuse existing ones, and then we can file property proposals for the remaining ones. I think when the values are categories, we can always use has characteristic (P1552) with the appropriate items (to be created, probably). − Pintoch (talk) 21:59, 1 July 2017 (UTC)Reply

Ontology creation similar to MeSH edit

Can we use this project to create an informatics and computer science ontology similar to the scale of MeSH https://meshb.nlm.nih.gov, based on Wikipedia and Wikidata resources. A simple proposal is to identify certain core categories e.g. (taken from Core areas of computer science):

  • Algorithms and Data Structures
  • Architecture
  • Artificial Intelligence and Robotics
  • Database and Information Retrieval
  • Human-Computer Communication
  • Numerical and Symbolic Computation
  • Operating Systems
  • Programming Languages
  • Software Methodology and Engineering


And use subcategories inside each of these to point to fine grained concepts. How to do this in Wikidata can be as follows:

  • Create a property InformaticsOntologyParent which points to all the parents of a given concept.  – The preceding unsigned comment was added by Napsternxg (talk • contribs).
Take a look at the existing Wikiprojects. John Samuel 17:59, 19 September 2017 (UTC)Reply
@Napsternxg: I think a good place to start is looking at some graphs like this: https://angryloki.github.io/wikidata-graph-builder/?property=P279&item=Q8366&iterations=10&limit=10&mode=both - Depending on the start-item and the properties used to traverse the graph it might reveal something you are interested in. MeSH seems to me to be much more coherent and the vocabulary much more controlled than what Wikidata currently has. Nonetheless Wikidata has many hidden insights and starting points for building a more coherent ontology. Either on Wikidata itself or by taking the data and building something own. --Tobias1984 (talk) 19:14, 19 September 2017 (UTC)Reply
@Tobias1984: Thanks for the link. This approach looks quite promising to me. I will try it out.

Git as Protocol? edit

  Notified participants of WikiProject Informatics/FLOSS

We currently have a lot of Software-Items with source code repository URL (P1324)=... and qualifier protocol (P2700)=Git (Q186055). They are marked as potential Problems, since Git (Q186055) is no instance of communication protocol (Q132364). How do we want to deal with that?

I'm not super happy with all these options, but think we should think about this Question now, before they are used to widespread. -- MichaelSchoenitzer (talk) 14:22, 31 October 2017 (UTC)Reply

In a very generic way, every VCS is a communication protocol. You have to respect some rules to share and save your data with others. Thinking on it, everything is a communication protocol. I don't know if I'm jocking. --Valerio Bozzolan (talk) 14:28, 31 October 2017 (UTC)Reply

Due to lack of responses, I do a ping:   WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead. -- MichaelSchoenitzer (talk) 13:15, 4 November 2017 (UTC)Reply

I am not of the opinion to use 'protocol' as a qualifier of software repository for git (or any other VCS). I would suggest to create a property (version control system) that can be used as a Wikidata qualifier like version type (P548). John Samuel 15:46, 4 November 2017 (UTC)Reply
I think every VCS can be considered a comunication protocol, in the case of Git, uses its own protocol over HTTP or SSH. But I think protocol (P2700) it's very imprecise, and maybe we need a new property called 'version control system'. --Giovanni Alfredo Garciliano Díaz diskutujo 19:18, 4 November 2017 (UTC)Reply
Although git sometimes is used over SSH or HTTP, it has its own protocol, which is not based on SSH, HTTP or any other application-level protocol. The same for SVN. DrSauron (talk) 07:45, 5 November 2017 (UTC)Reply
Mmm why not have a property « version control software » ? author  TomT0m / talk page 17:13, 4 January 2018 (UTC)Reply
  Comment Agreed with DrSauron that a "version control software" property cannot adequately describe how the repository is accessed, as the access is protocol specific (SSH, HTTPS, etc). source code repository URL (P1324) is subproperty of (P1647) URL (P2699) and is therefore a property for describing how a version control system is accessed (protocol and host identifier). Numerous new protocol items would need to be created such as "git via SSH" to clarify the "ssh://" in the subproperty of (P1647) URI string. This Git Internals Transfer Protocols documentation page may also provide some assistance in describing the protocol for this "git via SSH" example. Pixeldomain (talk) 00:34, 8 January 2018 (UTC)Reply

Terminal-Applications edit

We have GUI toolkit or framework (P1414) to show with gui-toolkit a software is using. But how to show that a software is a commandline-application? We could use GUI toolkit or framework (P1414)novalue – but there are also gui-toolkits for the commandline like ncurses (Q310974). -- MichaelSchoenitzer (talk) 16:58, 15 January 2018 (UTC)Reply

Description of Maxthon (Q491674) edit

Hi there! In Maxthon (Q491674) I reverted yesterday en+pl descriptions added by @Jasc PL: freeware web browser for MS Windows and Mac OS X developed in China - spyware included! as I thought this version is not neutral. Hovewer, Jasc PL stated that spyware component of Maxthon is a key feature of this web browser and this information is sourced ([1], link added on the item talk page). I'm not so familiar with software/informatics WD rules and that's why I'm asking here, how this description should be written – and if information about spyware shouldn't be placed in description, are there any other means (by properties and qualifiers) to indicate this information in the item? My discussion with Jasc PL is here and here, but it's in Polish. Wostr (talk) 14:53, 6 February 2018 (UTC)Reply

Maybe it should be added as something like described by source (P1343) with [2] as an item and of (P642) with spyware (Q14644)? Wostr (talk) 19:42, 13 February 2018 (UTC)Reply

Procedure to join a community. edit

How to join a wiki community and donate data. Souranil Singha (talk) 08:43, 24 February 2018 (UTC)Reply

modeling commands and keyboard shortcuts edit

What is the best way to model commands and keyboard shortcuts? My idea is to introduce:

  • a "command action" property with values like "focus, select/selection, move/motion, delete/deletion, add/addition, display...";
  • something like a "command object" property with values like "graphical interface elements, part of softwares (e.g. tab, history, addon, bookmark, etc for browsers), file, folder, environment variable, setting...".

Therefore, "cls" command would have "command action: clear" and "command object: screen". Problems could arise for more complex commands that perform multiple or complicated things. --Malore (talk) 22:22, 1 April 2018 (UTC)Reply

@Malore: Have you considered existing property has effect (P1542) being used with command/software items, pointing to a new item "clear screen"? Dhx1 (talk) 04:42, 18 April 2018 (UTC)Reply

Wikidata Infobox - Computing categories edit

  WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

URGENT!
Hi Guys! I would like to inform you that we have the last chance to take a part in discuss about Commons infoboxes in context of computing at Wikidata Infobox - Computing categories. If we are interested in having influence of this project final score - please contribute your needs, proposals, ideas - as soon as possible at the Commons discuss page! Thanks. --Jasc PL (talk) 19:21, 17 April 2018 (UTC)Reply

What are the best modelled items for your area of interest? edit

Hi all

Over the past few months myself and others have been thinking about the best way to help people model subjects consistently on Wikidata and provide new contributors with a simple way to understand how to model content on different subjects. Our first solution is to provide some best practice examples of items for different subjects which we are calling Model items. E.g the item for William Shakespeare (Q692) is a good example to follow for creating items about playwright (Q214917). These model items are linked to from the item for the subject to make them easier to find and we have tried to make simple to understand instructions.

We would like subject matter experts to contribute their best examples of well modelled items. We are asking all the Wikiprojects to share with us the kinds of subjects you most commonly add information about and the best examples you have of this kind of item. We would like to have at least 5 model items for each subject to show the diversity of the subject e.g just having William Shakespeare (Q692) as a model item for playwright (Q214917), while helpful may not provide a good example for people trying to model modern poets from Asia.

You can add model items yourself by using the instructions at Wikidata:Model items. It may be helpful to have a discussion here to collate information first.

Thanks

John Cummings (talk) 15:08, 17 December 2018 (UTC)Reply

2019-08 Q5297 tree edit

FYI: https://tools.wmflabs.org/wikidata-todo/tree.html?q=Q5297&rp=279&lang=en Visite fortuitement prolongée (talk) 22:00, 28 August 2019 (UTC)Reply

TOP500: several supercomputers with the same name edit

Hi,

I'm running a bot to import data from TOP500 database. However, I've noticed several machines shares the same name (for example POWER CHALLENGE (Q69825182), but diffiers on the TOP500 identifier and location. I've merged few machines before noticing that, so, I want opinions if is better to merge them or not.

Thanks. --Amitie 10g (talk) 02:26, 12 October 2019 (UTC)Reply

  WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

Common Information Model, WMI, WBEM etc... edit

Sorry for this possibly naive suggestion but I am new to this group.

Common Information Model is an open standard that defines how managed elements in an IT environment are represented as a common set of objects and relationships between them.

Two very common implementations are WMI and OpenLMI. They define hundreds of classes: CIM_Process, CIM_DataFile, CIM_Directory etc... and their properties. Some are already mentioned here wikipedia:CIM Schema. See for example:

I was wondering if there would be an interest to create these classes into Wikidata ? It would for example complement the Software Ontology (SWO), and provide a solid ontology.

ThanksRc1959 (talk) 05:23, 12 October 2019 (UTC)Reply

@Rc1959: IMHO, the usability of wikidata as a database is more into the instances instead of the class hierarchy themselves. The classes and their organizations are there as a means to support in connecting the instances of a class with instances of another classes. Pebaryan (talk) 16:10, 6 November 2019 (UTC)Reply
@Pebaryan: Thanks for the answer. On the other hand, some Wikidata definitions are nearly one-to-one equivalent to CIM data. For example: "process" Q205663 is equivalent to CIM_Process. "computer file" Q82753 is equivalent to CIM_DataFile etc... Rc1959 (talk) 16:25, 2 December 2019 (UTC)Reply
@Rc1959: Interesting. In that case, I would suggest to create a reusable identifier/reference property to link the existing classes to CIM. I have two concerns though. first, is there any stable URI for CIM Classes? and second, AFAIK the "CIM" is not only exclusively used in computing (from DMTF) but also it is used in electricity (from IEC). i'm just mention it here but i'm not sure if it's going to be an issue or not.
@Pebaryan:
"CIM…also it is used in electricity". There are two different "CIM" standards which only share the same names, but are completely different and are clearly identified as such, see: Common Information Model, Common Information Model (computing) and Common Information Model (electricity).
""is there any stable URI for CIM Classes"".
Yes, there are stable URIs per schema version, in the DMTF world, for example: https://schemas.dmtf.org/wbem/cim-html/2.51.0/CIM_Process.html
These class names are also mirrored on Microsoft documentation, in the directory of their WMI provider: https://docs.microsoft.com/en-us/windows/win32/cimwin32prov/cim-process
IBM also has its z/OS implementation:
https://www.ibm.com/support/knowledgecenter/en/SSLTBW_2.2.0/com.ibm.zos.v2r2.cfzu100/cim_process.htm
"a reusable identifier/reference property to link the existing classes to CIM"
This is indeed what I had in mind, but I am not sure of the best way to implement it.
Any example in mind, please ? Thanks ! Rc1959 (talk) 15:53, 13 December 2019 (UTC)Reply

Since last year, this MOF2RDF specification provides an approach to convert MOF-compliant metamodels, that is, CIM models, into OWL Ontologies and vice-versa: About the MOF to RDF Mapping Specification Version 1.0. (I do not know yet if there is a standard implementation). This would automatically provide RDF definitions of all objects defined in WMI and more generally in WBEM, that is, thousands of classes. Rc1959 (talk) 18:39, 16 November 2021 (UTC)Reply

There is a few of items with both of these statements, 53 to be exact according to this query:

SELECT ?item WHERE 
{
  ?item wdt:P31 wd:Q18343316 .
  ?item wdt:P361 wd:Q1348204 .
}
Try it!

However, that doesn't make sense IMO. If we take for instance cat (Q300918), then it is definitely true that cat (Q300918)instance of (P31)standard UNIX utility or command (Q18343316). However, it may be GNU cat, OpenBSD cat, Busybox cat or even the original Unix cat. part of (P361)GNU Core Utilities (Q1348204) should thus be removed from all of the items found by the aforementioned query. Eventually, we may also want to create specific Qitems for every common implementation of the Unix utilities. Could I get a sanity check on this before going forward with the removing?

--Adrijaned (talk) 13:40, 30 March 2020 (UTC)Reply

  WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

grep (Q283302)instance of (P31)application programming interface (Q165194)
grep (Q283302)described by source (P1343)POSIX (Q14658)
GNU Grep (Q21282417)instance of (P31)GNU package (Q20825628)
GNU Grep (Q21282417)implementation of (P4428)grep (Q283302)
Note that POSIX (Q14658) is versioned, so there are potentially many different versions of the grep (Q283302) application programming interface (Q165194) corresponding to different versions of POSIX (Q14658). -- Dhx1 (talk) 23:04, 30 March 2020 (UTC)Reply
I agree: we need an item for every notable implementation as well as an item for the general concept. That's the only way to handle version numbers and different commandline flags. I also agree that application programming interface (Q165194) isn't suited for this. Not sure which is. Also don't forget to to link the implementations with the general item in both ways with different from (P1889) otherwise people won't notice that there are different items and might mix them up or try merge them. -- MichaelSchoenitzer (talk) 21:20, 31 March 2020 (UTC)Reply
@Adrijaned: @MichaelSchoenitzer: I think that generic items such as grep (Q283302) should at least be a subclass of technical specification (Q20819677). This is due to POSIX (Q14658) being a subclass of technical specification (Q20819677), and grep (Q283302) is a specification for software that is also part of POSIX (Q14658). Dhx1 (talk) 21:45, 31 March 2020 (UTC)Reply

Data types edit

  WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

I'd like to discuss data types and digital representation of information in general, but there doesn't seem to be a corresponding sub-project for data, only separate pages for character encodings and the data structure infobox template, both of which may fit under a 'Data' umbrella. Should one be created?

In particular, I'm right now looking into what may be a bit of ontological confusion as to which items relate to digital data structures and which to the real-world information they represent. Some time ago, text (Q234460) was made a subclass of (P279) string (Q184754) which resulted in a loop as string (Q184754) was already a subclass of (P279) text (Q234460). The loop has since been resolved by the latter (older) sub-class statement being removed, but this sort of thing is bound to happen again, especially when the items involved aren't clearly (and correctly) labelled as well as described in several languages.

I believe that in general, the data structures or data types should not be sub-classes of the corresponding information items (an audio file format is not a sound) or the other way around (a picture is not an array of pixels), as the properties differ between the domains. They do however map to each other in certain respects, and they may have shared, intersecting sub-classes, especially with the commercial distribution of fully digital information products (e-books, streaming media services). The following class pairs deserve to be reviewed as to their mutual relationships (what sub-classes, instances, parts, or other properties go where):

So far I have no indication that computer font (Q1147633) and font (Q4868296) are confused as well (except that the former is a subclass of (P279) the latter, but that just may be ok). However, as we develop tests and criteria to eliminate confusion, the typography domain may of course benefit from those too. Maybe you can find other class pairs to investigate. --SM5POR (talk) 07:00, 22 June 2020 (UTC)Reply

I forgot to mention File formats and Filesystem as those pages may well fit under a 'Data' umbrella page too, but I don't propose moving anything as I'm unfamiliar with the consequences, and that is not the issue I'm concerned about right now anyway. --SM5POR (talk) 07:36, 22 June 2020 (UTC)Reply
By 'this project' I can only see that you are referring to Wikidata:WikiProject_Informatics (where I would suggest creating a 'Data' sub-project for the general topic of representing real-world information in digital data formats). Are you saying that file formats and file systems may even be outside the scope of 'Informatics'? I don't see why. We do need to coordinate with other projects to the extent we see issues with the types of information we discuss, but I think the general issue of representing information belongs in Informatics, from where we will find connections elsewhere as needed. I'd illustrate it with a table, the ultimate form of structured expression. --SM5POR (talk) 08:44, 22 June 2020 (UTC)Reply
I'm working on some related class queries in my User:SM5POR/Informatics#Data section, in case you want examples of what I'm referring to (such as what sub-classes are found under string (Q184754) and text (Q234460), respectively). As for characters, I assume any work on those could benefit from coordination with Wikidata:WikiProject_Alphabet. --SM5POR (talk) 08:19, 22 June 2020 (UTC)Reply
When I said, 'this project', I meant the sub-project Wikidata:WikiProject_Informatics/Data_Structure for handling cases like text, string. However, seeing the query results from User:SM5POR/Informatics#Data, I would suggest creating a sub-project in Wikidata:WikiProject_Informatics to talk about different data types. You can also take a look at sub-projects for file systems, file formats etc.: Wikidata:WikiProject_Informatics/Filesystem, Wikidata:WikiProject Informatics/File formats. John Samuel (talk) 09:40, 22 June 2020 (UTC)Reply

Questions regarding P31 edit

Hi, I saw that many softwares have their use (automotive navigation system (Q27731), media player software (Q210337), ...) stated in P31. I think that those items should be moved to has use (P366), leaving some basic items like software (Q7397) or mobile app (Q620615) as statement for P31. Moreover, F(L)OSSs have also instance of (P31) free software (Q341) as written here but I think that this is an incorrect use of P31 as being free software is a intrinsic characteristic of the software so I think that it should be moved to has characteristic (P1552). P31 is not a dump to describe an item. What do you think? --★ → Airon 90 08:36, 27 July 2020 (UTC)Reply

Regarding the usage of has characteristic (P1552), I don’t think this works at all. First, being free is not really an « intrinsic » characteristic of the software, you can release the exact same source code or binary, with the exact same features, as a free or not as a free software. This would not change the functioning of the software a bit. An intrinsic characteristic would be the fact that he has some feature. Second, to me has characteristic (P1552) is really very much related to instance of (P31). The difference is that « instance of » domain is a class of object with some characteristic, and has characteristic (P1552) is more the definition of a characteristic. There could basically always be a class associated to some characteristic, for example « bald person » could be associated to baldness (Q181391)     
Second, as always, there is a compromise in granularity, and many software types have their own dedicated classes, which have their own wikipedia articles, for example video editing software (Q1373429). You can note that there is a claim video editing software (Q1373429) so its usage ever can help saving statements and editing time. If you want to query all software that have some usage a query like
select ?software { ?software wdt:P31/wdt:P279* wd:Q7397 . ?software (wdt:P31/wdt:P279*)?/wdt:P366 wd:Q1154312 . }
Try it!
will find them, wether there is a class or not. I don’t think it’s really a good idea to forbid using such classes, they exists anyway. author  TomT0m / talk page 15:04, 27 July 2020 (UTC)Reply
Last, there may be cases where a « feature » property could conflict with « usage » in the case of software, so using classes might help. author  TomT0m / talk page 15:04, 27 July 2020 (UTC)Reply

malloc, printf, fopen ... and callable symbols. Which type ? edit

Any suggestion of an instance for callable and linkable symbols like malloc(), fopen(), etc ... in C libraries ? Same question for methods of C++ classes. Also, a similar type would be needed for Python functions and classes. It is close to subroutine (Q190686) but handled by linker (Q523796).

Rc1959 (talk) 08:09, 17 March 2022 (UTC)Reply

Comment from WikiProject Video Games edit

I asked about software modeling at WikiProject Video games and got a comment.

There is a lot to consider here. The video game project has a large number of contributors and in some important ways, profiling game software is comparable to profiling other kinds of software.

I do not have a deep insight to share right now but I did want to share the thought that I am thinking of making a comparison between the projects' data models. Bluerasberry (talk) 13:47, 22 March 2022 (UTC)Reply

Wikidata software profiling hackathon, June 6&8 edit

Those interested in software + Wikidata are invited to the Scholia Hackathon 6&8 June 2022.

WD:Scholia is a Wikidata front end which does scholarly profiling, and is best known as tool for browsing the WikiCite collection of WD:WikiProject Source Metadata.

An example Scholia profile for the software Stata (Q1204300) is

Anyone interested in examining any part of Wikidata connecting to software is welcome. Bluerasberry (talk) 20:36, 19 May 2022 (UTC)Reply

Modelling the input & output of functions and function-like software edit

  WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

My first attempt at proposing two new properties for modelling the input & output of functions and function-like software has been rightfully rejected because "input" and "output" are too vague for property labels.

I have now drafted a new proposal, you can read the draft at: User:Push-f/invoked with & invocation returns (note that the proposal is in my user namespace ... it is not yet ready to be voted on).

I would very much welcome your feedback about this new proposal (ideally here :).

My draft currently suggests qualifying optional inputs with nature of statement (P5102)optional (Q59864995) for the lack of a better alternative, since quantity (P1114) has data type quantity and thus only allows numeric values. I think it would make sense to introduce a new "quanity of object" qualifier property of data type item, to allow statements such as:

  • Xinvoked withAquantity of objectone or zero (A?)
  • Xinvoked withAquantity of objectone or more (A+)
  • Xinvoked withAquantity of objectzero or more (A*)

Something else I am not quite certain about is how to model "either or", e.g. that a function takes either a string or an integer. There are union of (P2737) and disjoint union of (P2738) but these are properties and could not be used as the object.

What do you think about my draft and this qualifier idea?

Cheers, --Push-f (talk) 09:13, 15 November 2022 (UTC)Reply

To define ranges of numeric values, there are the properties lower limit (P5447) and upper limit (P5448). Not sure if a range would be applicable here though. I'd like to consider some other approach to the "quantity of object" idea, so I'm deferring this issue until later.
I suppose programming language functions could be modeled on mathematical functions, where the input values belong to a definition domain (P1568) or input set (P1851) and the output values to a codomain (P1571).
It's true that the property union of (P2737) cannot be used directly as a value (or even a qualifier) of a statement, but it's used to form combined class items such as goods and services (Q2897903) for use in a business context. I think you could define a similar "combined data type" item such as "string or integer" is the same fashion, though its of (P642) qualifier should be replaced by something else, say has part(s) (P527).
Note also that if you want to qualify either the string or the integer further in some way (such as requiring the integer to be non-negative), you should not put that qualifier on the definition domain (P1568) statement as it would be applied also to any string values, but you will probably have to create yet another item for that particular case ("string or non-negative integer").
If the function has multiple positional input parameters, use multiple data type declarations qualified with series ordinal (P1545); same thing with multiple output values if supported by the programming language (Lisp, Python). For keyword parameters, state the keyword somehow (you had another proposal about that, I think).
While optional (Q59864995) may be an acceptable attribute on the input side, I find it less appropriate on the output side, where a string value may be expected in most cases, but that could be described using a "string or novalue" combined class item. If such an item exists anyway, it could be used on the input side as well. Also, take a look at option type (Q7099015).
In particular, I intuitively object to the nature of statement (P5102) qualifier being assigned to a positional parameter, as it's almost as if that statement could simply have been left out. It may work with a keyword parameter though. SM5POR (talk) 07:14, 17 November 2022 (UTC)Reply

Multiple logic family concepts edit

I didn't find a WikiProject dedicated to electronics, so I guess the following question belongs here:

I have marked logic family (Q173359) as a Wikipedia article covering multiple topics (Q21484471) since the article en:Logic family states that the term refers to one of two related concepts (type of circuitry vs design methodology), though I haven't created items for those concepts yet. Do you agree that they are indeed distinct enough to warrant separate items, and is there maybe already an item for either concept without English labels that has been created for a WP article in some other language (so that it can be used right away rather than merged at some later time)? SM5POR (talk) 09:46, 18 November 2022 (UTC)Reply

Okay, I eventually found Wikidata:WikiProject Electronic Components but it seems dormant right now, no active participants. Then there is Wikidata:WikiProject Physics which is however focused on particle physics. Pinging @Vladimir Alexiev who I hope may have some idea on my question above. SM5POR (talk) 01:54, 21 November 2022 (UTC)Reply
I'm honored by your ping, but I don't think I'm any authority on this.
You seem to know a lot about electronics, so if you think these are 2 distinct items, go ahead and split them.
  • Pick one of them to correspond to the existing WP articles
  • Link both to each other using "different from"
  • fix the description: "monolithic digital integrated circuit devices" seems completely wrong to me
Vladimir Alexiev (talk) 13:10, 25 November 2022 (UTC)Reply
Thank you for responding! Well, I took a guess as I saw you mention IEEE Thesaurus (Q113673155) in a comment on Wikidata:WikiProject Electronic Components. In any case, I ended up creating two new items logic circuit technology family (Q115616667) and logic circuit design methodology (Q115616690), both with "logic family" as an alias, as I prefer the primary labels to be unique (I made up those labels; if they aren't perfect I invite anyone to fix them). I haven't re-linked any of the WP articles to either sub-item as most of them probably cover both topics as well. I have also added most of the technologies and methodologies as instances of either topic.
I have no idea what to replace the description "monolithic digital ..." with. SM5POR (talk) 07:33, 7 December 2022 (UTC)Reply

I/O method vs device edit

  WikiProject Informatics has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead.

Was there any discussion as to the desirability of limiting the properties input device (P479) and output device (P5196) (devices or methods) to just deal with hardware devices, thereby excluding user interface methods that are not hardware devices, such as command-line interface (Q189053)? I'm not necessarily opposed to the change, but since it affects property labels and descriptions in multiple languages as well as the constraints, I would have expected careful consideration of the implications such as the need for new properties to deal with the non-hardware methods, as well as notification of the groups affected by the change (listed in the input device (P479) property documentation under "This property is being used by", a list which for to me unknown reasons doesn't include WikiProject Informatics or even Wikidata:WikiProject Video games).

Wikidata:Database reports/Constraint violations/P479 and Wikidata:Database reports/Constraint violations/P5196 now each list a number of value type constraint violations, but I haven't had the opportunity to investigate to what extent they existed previously or have been added due to this action.

I'm not myself asking for an immediate reversal, but I would really like to see some comments on how this should be handled before we proceed in either direction. --SM5POR (talk) 11:26, 1 December 2022 (UTC)Reply

I merely adjusted the labels and constraints of both properties to match what was intended in their proposals, see Wikidata:Property_proposal/Archive/8#P479 and Wikidata:Property proposal/output method.
The broadening of these properties to user interfaces in general has not been discussed as far as I can tell. And I don't think it's a good idea because contrary to "input devices" software input methods can either be part of the software in question or an external software component, so I think it's better to express that with either has part(s) (P527) or uses (P2283) accordingly, perhaps qualified with object has role (P3831).
--Push-f (talk) 11:40, 1 December 2022 (UTC)Reply

XML-based formats identification pattern / digital signatures edit

Hi, I've a question about the identification of XML-based formats with file format identification pattern (P4152). Please take a look: Property talk:P4152#Value for XML-based formats. Dipsode87 (talk) 10:16, 13 December 2022 (UTC)Reply

Model monorepos? edit

I decided to make an item for the emotion monorepo, and link all the packages that it contain back to it (emotion (Q116036073)). Is this a good idea to do with other monorepos? RPI2026F1 (talk) 17:57, 5 January 2023 (UTC)Reply

"Model for", property for sciences and computing edit

If you see this please review the property proposal for model for (2) that can link a model or theory to the real world (or not) object it models. author  TomT0m / talk page 16:34, 15 April 2024 (UTC)Reply

Return to the project page "WikiProject Informatics".