Wikidata offers copies of the available content for anyone to download.
Note that there are also several other methods for accessing structured content from Wikidata, which may not require a complete database dump.
There are several different kinds of data dumps available. Note that while JSON and RDF dumps are considered stable interfaces, XML dumps are not. Changes to the data formats used by stable interfaces are subject to the Stable Interface Policy.
JSON 덤프 (추천)
JSON dumps containing all Wikidata entities in a single JSON array can be found under https://dumps.wikimedia.org/wikidatawiki/entities/. The entities in the array are not necessarily in any particular order, e.g., Q2 doesn't necessarily follow Q1. The dumps are being created on a weekly basis.
This is the recommended dump format. Please refer to the JSON structure documentation for information about how Wikidata entities are represented.
Hint: Each entity object (data item or property) is placed on a separate line in the JSON file, so the file can be read line by line, and each line can be decoded separately as an individual JSON object.
Note that the files are using parallel compression, which means that some decompressors cannot reliably unpack the files. If you are using Windows you can use e.g. Bzip2.
JsonDumpReader is a PHP library for reading the dumps.
First, canonical RDF dumps using the Turtle and NTriples formats can be found under https://dumps.wikimedia.org/wikidatawiki/entities/. The mapping is described here. These full statements are noted as all.
Secondly, so called truthy dumps are provided. They use the nt format. They are in the same format as the full dumps, but limited to direct, truthy statements. Therefore, they do not contain meta data such as qualifier and references.
The complete dumps together contain all entity information in Wikidata with the exception of order (of aliases, of statements, etc.), which is not naturally represented in RDF. Simplified dumps encode statements that have no qualifiers as single RDF triples (references are omitted).
전체 XML 덤프본은 https://dumps.wikimedia.org/wikidatawiki/ 에서 찾으실 수 있습니다.
Warning: The format of the JSON data embedded in the XML dumps is subject to change without notice, and may be inconsistent between revisions. It should be treated as opaque binary data. It is strongly recommended to use the JSON or RDF dumps instead, which use canonical representations of the data!
추가덤프 (또는 변경덤프)도 또한 다운로드하여 사용하실 수 있습니다. 이러한 덤프들은 지난 24시간동안 추가된 데이터들을 포함하고 있으며, 모든 데이터베이스를 다운로드하여 데이터베이스를 사용하는 불편함을 줄여줍니다. 이러한 덤프들은 데이터베이스 전체본보다 훨씬 크기가 적습니다.
이러한 덤프들은 https://dumps.wikimedia.org/other/incr/wikidatawiki/ 에서 확인하실 수 있습니다.
오래된 JSON과 RDF 덤프
오래된 RDF와 JSON 덤프는 Internet Archive (Q461)에서 찾을 수 있습니다:
The data model can be looked up here. The data model describes the fundamental building blocks of Wikidata's data.
An overview over the schema of the database can be found at this page. (This is not the schema of the data in Wikidata.)