User:Mahir256/lonelyitems

The majority of my activity on Wikidata since around June 2017 has been the merging of items with only one sitelink to another wiki (which Bene* has termed "lonely items" in one of his tools) with other items. As such a chart showing where they occur is definitely helpful.

Where they occur on Wikidata is one thing:

50,000
100,000
150,000
200,000
250,000
300,000
Q5M
Q10M
Q15M
Q20M
Q25M
Q30M
Q35M
Q40M
Q45M
Q50M
Q55M
Q60M
Q65M
Q70M
Q75M
Q80M
Q85M
Q90M
Q95M
Q100M
  •   non-lonely items
  •   lonely items
  •   items without sitelinks
  •   redirects
  •   (the remainder)

but where they come from is another, especially among Wikipedias:

(Note that in the interest of making the tail of the full graph more readable, I have omitted the six Wikipedias with the most lonely items from these graphs, namely enwiki with 2749136 lonely items, cebwiki 1913750, dewiki 912844, jawiki 644003, frwiki 568904, and ruwiki 508345, and split the graph into two at the 80,000 lonely item mark.)

100,000
200,000
300,000
400,000
500,000
it
zh
pl
es
nl
sv
pt
ar
uk
arz
ko
vi
sr
fa
ca
no
fi
cs
tr
id
hu
ce
10,000
20,000
30,000
40,000
50,000
60,000
70,000
80,000
he
et
kk
ro
ms
da
lt
gl
ta
hy
new
bg
te
sk
hi
az
eu
el
hr
sl
th
ur
be
tg
sh
ky
my
lv
sq
min
eo
ka
mk
mr
ml
cy
uz
su
gu
ht
nn
bn
is
fy
jv
la
ku
zh_yue
lb
si
kn
pnb
bs
pa
tt
ne
tl
br
bo
sd
an
wa
am
sah
af
km
sw
ast
ba
mn
oc
mg
nds
glk
ps
gor
ckb
be_x_old
simple
hif
zh_min_nan
os
sn
cv
lrc
cdo
qu
ug
bar
mi
lmo
scn
sa
azb
or
gom
so
mhr
yi
war
pms
bat_smg
ga
ace
map_bms
pag
vo
diq
mai
als
sco
ia
mzn
hsb
shn
gd
li
hyw
dv
zh_classical
fo
lg
yo
xmf
as
bcl
frr
lo
tk
bpy
nah
nv
fiu_vro
gag
ln
lez
gan
kv
to
dty
szl
tyv
hak
myv
vls
tcy
pam
io
co
mrj
se
vec
nds_nl
gv
mt
pdc
sat
pcd
roa_tara
bh
om
koi
bjn
kab
fur
sc
rue
wuu
eml
ang
ary
nso
vep
udm
csb
nap
ksh
krc
atj
wo
ay
av
kaa
frp
lij
mdf
kbd
nrm
rm
stq
avk
ha
na
olo
pfl
xh
rw
nov
dsb
srn
ltg
got
ig
arc
inh
jbo
ab
mwl
kw
ext
lfn
ts
tet
lad
roa_rup
tn
sm
ilo
gn
crh
cbk_zam
ki
pap
zu
lbe
rn
zea
ny
bug
haw
kbp
ady
ban
bxr
bm
chr
pi
kg
bi
ti
ve
ks
tum
ie
ty
ak
tpi
chy
kl
st
ff
jam
ss
iu
dz
rmy
tw
xal
ik
pnt
ee
pih
ch
za
fj
cu
sg
cr
nqo
din
gcr
szy
smn
mnw
mh
mus
ho
awa

Item statistics edit

  • Inspired by the graph at User:Succu
  • Both graphs are as of 14 November 2020, ~20:30 UTC
  • Non-lonely and lonely items are counted here. (In the comments is User:MisterSynergy's query for total items.)
  • Redirected items are counted using one of MisterSynergy's queries.
  • Items without sitelinks are inferred from lonely, non-lonely, total, and redirected items.
  • Deleted items are inferred from the total items and the sum of the rest.

Some comments about the first graph (from 29 June 2018, ~14:15 UTC) edit

  • The large number of items without sitelinks around Q14M are mostly villages in China (thanks User:Liangent!).
  • The large number of items without sitelinks around Q23M are mostly biological compounds (thanks User:Putmantime, User:Andrawaag, User:Sebotic, and User:Gstupp!)
  • The large number of lonely items between Q30M and Q35M, along with those near Q50M, are due in part to additions of pages from various wikis, especially cebwiki (thanks User:GZWDer!).
  • The masses of items without sitelinks after around Q33M are due mostly to imports of scientific articles (thanks User:Daniel Mietchen and User:Harej!).

Some mysteries of the first graph (from 29 June 2018, ~14:15 UTC) edit

  • The purge of items near Q11.25M may have been items that were merged (since the redirect facility didn't exist before a certain point), but I recall a notice reading that past deleted items due to merges were changed retroactively to redirects, so what else could have happened?
  • What were the purges of Q18M, Q27.5M, Q31.5M, Q47.25M?
  • What is being purged since Q53.25M?