User:RPI2026F1Bot/Task1

The main task of User:RPI2026F1Bot is to import information from anime and manga databases. This is a comprehensive list of what is being imported.

Bot Information edit

The bot's source code is located at https://github.com/PythonCoderAS/wikidata-anime-import.

The bot will scan all values of all properties in the list of scanned properties. When a value is found, the bot will create a statement for it if the property is on the list of properties the bot is allowed to create. All imported values that match existing statements will have a reference added to them, even if the bot will not create them.

Reference Spec edit

The bot adds these properties to the reference group created for every matched value:

Property Value
retrieved (P813) The current time in UTC when the bot went to create the reference
stated in (P248) The item of the database the bot imported the value from
reference URL (P854) The URL to the external item on the database. Should be the same value as the search formatter of the associated property.
<property ID> The external ID of the property it imported information from.

To prevent duplicating references, the bot will not add a reference if there exists a reference with the same stated in (P248) value, if there is a reference with reference URL (P854) and the bot-generated reference URL is a substring of that URL, or if the external ID of the database's property is present.

Anime edit

Manga edit

Databased Imported From edit

Database Property Contributes External Links Notes
MyAnimeList (Q4044680) MyAnimeList manga ID (P4087) Yes
AniList (Q86470198) AniList manga ID (P8731) Yes
MangaDex (Q110093307) MangaDex title ID (P10589) Yes May be deleted soon.
MangaUpdates (Q114730827) MangaUpdates manga ID (P11149) No
Anime-Planet (Q112180497) Anime-Planet manga ID (P11348) No
INKR Comics (Q115633593) INKR title ID (P11315) No

Properties Imported edit

Non-external ID properties imported edit

Property Creates Value Shared Notes Database Notes
genre (P136) No Will try to simplify values wherever possible. For example, if both romance anime and manga (Q15637310) and comedy anime and manga (Q15286013) are present, it will store them as romantic comedy anime and manga (Q15712145). There is similar behavior for comedy drama anime and manga (Q15712927).

Note: The bot does not actually modify the existing statements, only the internal representation used for checking which statements to add references to.

MyAnimeList (Q4044680)
No AniList (Q86470198) Uses the official genres rather than user-generated tags
No MangaDex (Q110093307)
No MangaUpdates (Q114730827)
No Anime-Planet (Q112180497)
Yes INKR Comics (Q115633593) Imports are allowed as this is an official, legitimate source.
intended public (P2360) Yes MyAnimeList (Q4044680)
AniList (Q86470198)
MangaDex (Q110093307)
MangaUpdates (Q114730827)
Anime-Planet (Q112180497)
start time (P580) No This usually means the start of serialization for most manga. MyAnimeList (Q4044680) Imports precisions between year and day depending on the number of fields the API returns
AniList (Q86470198) Imports precisions between year and day depending on the number of fields the API returns
MangaDex (Q110093307) Imports precision of year only
MangaUpdates (Q114730827) Imports precision of year only
Anime-Planet (Q112180497) Imports precision of year only
end time (P582) No This usually means the end of serialization for most manga. MyAnimeList (Q4044680) Imports precisions between year and day depending on the number of fields the API returns
AniList (Q86470198) Imports precisions between year and day depending on the number of fields the API returns
Anime-Planet (Q112180497) Imports precision of year only
country of origin (P495) No AniList (Q86470198)
MangaDex (Q110093307) Inferred based on the value of language of work or name (P407)
language of work or name (P407) No AniList (Q86470198) Inferred based on the value of country of origin (P495)
MangaDex (Q110093307)
hashtag (P2572) No AniList (Q86470198)
number of parts of this work (P2635) Yes Imports the number of volume (Q1238720). MyAnimeList (Q4044680)
AniList (Q86470198)
MangaDex (Q110093307)
Anime-Planet (Q112180497)
title (P1476) No AniList (Q86470198) Only imports titles for ja and en

External IDs edit

All external ID properties are always created.

Imported Specifically edit
Property Shared Notes Database Notes
MyAnimeList manga ID (P4087) AniList (Q86470198) A lot of values of AniList manga ID (P8731) are equal to MyAnimeList manga ID (P4087) + 30000 since Anilist initially copied MyAnimeList's database.
MangaDex (Q110093307)
AniList manga ID (P8731) MangaDex (Q110093307)
MangaUpdates manga ID (P11149) MangaDex (Q110093307) Contains both the old schema and new schema IDs. The bot will add a reference to the old schema if encountered.
BookWalker series ID (JP version) (P11259) MangaDex (Q110093307)
BookWalker series ID (global version) (P11363) MangaDex (Q110093307)
Anime-Planet manga ID (P11348) MangaDex (Q110093307) May contain redirects to the true ID
eBookJapan books ID (P11476) MangaDex (Q110093307)
Generally edit

Sources that have nonspecific external links (such as "related links", "link to raw", etc.) contribute towards a link pool. Then the bot uses regexes to extract IDs and sends the leftovers to described at URL (P973) (see #Adding described at URL (P973)).

Property Regex Notes
Niconico ID (P11176) seiga\.nicovideo\.jp/comic/(\d+)
BookWalker series ID (JP version) (P11259) (?:global\.)?bookwalker\.jp/(?:series|book)/(\d+)
BookWalker series ID (global version) (P11363)
INKR title ID (P11315) comics\.inkr\.com/title/(\d+)
Anime News Network manga ID (P1984) animenewsnetwork\.com/encyclopedia/manga\.php\?id=(\d+)
Media Arts Database ID (P7886) mediaarts-db\.bunka.go\.jp/id/C(\d+)
Bangumi subject ID (P5732) bgm\.tv/subject/(\d+)
AnimeClick manga ID (P5849) animeclick\.it/manga/(\d+)
eBookJapan books ID (P11476) ebookjapan\.yahoo\.co\.jp/books/(\d+)

Other edit

Archive URLs edit

If the bot ever runs into a URL for a property that is an archive.org URL, it will extract the information, update the target URL to the real URL, and set qualifiers archive URL (P1065), archive date (P2960), and reason for deprecated rank (P2241)link rot (Q1193907) as well as set the rank of the statement to deprecated.

Adding described at URL (P973) edit

A lot of anime and manga databases contain a collection of external links. Some of these links have properties associated with them, but many don't. Links that have no property associated with them will be added under the property described at URL (P973) if the property is whitelisted.

URL Blacklist edit

The bot will not add any URLs that match these domains/regex under described at URL (P973):

URL/Domain/Regex Reason
twitter.com Social Media Site, not applicable for an anime or manga
youtube.com Social Media Site, not applicable for an anime or manga
instagram.com Social Media Site, not applicable for an anime or manga
pixiv.com Social Media Site, not applicable for an anime or manga
pixiv.net Social Media Site, not applicable for an anime or manga