Wikidata

From Botanical Knowledge
Jump to: navigation, search
  • Wikidata aims to provide a centralized data repository that will feed all the wikipedias, as well as any other platform that wants to use the data. (Currently, each wikipedia - English, French, Russian, etc. - is maintained independently.)
  • Two main goals:
  1. Ease curation and support smaller wikipedias
  2. Enable queries like "What are the ten largest cities with a female mayor"?


Phase 1: Language links recently: every language links to every other language wikidata: one page for each entity, list representations in each language wikipedias pull language links from wikidata


Phase 2: Infoboxes currently: each article calls an infobox with values wikidata: centralize the values wikipedias will call the infobox, and populate with values from Wikidata


Phase 3: Queries

       currently: http://www.mediawiki.org/wiki/Extension:Wikibase/API


  • Why not use SMW?

i. Centralization

    Wikidata aims to do for data what WikiCommons does for images.

ii. Internationalization

    SMWs assumed to be of one language.
    Possible to build with I18n, but hairy.
    Wikidata has internationalization built in from the beginning.

iii. Reification

    http://meta.wikimedia.org/wiki/Wikidata/Data_model
    http://meta.wikimedia.org/wiki/Wikidata/Notes/Data_model_primer


  • Advantages of SMW:
  1. Querying capability - http://semantic-mediawiki.org/wiki/Help:Inline_queries#Parser_function_.23ask
  2. Semantic Results Format - http://www.semantic-mediawiki.org/wiki/Semantic_Result_Formats
  3. Mixing of structured and unstructured data.


  • Advantages of Wikidata:
  1. Built-in provenance


  • Can we satisfy our primary use case with SMW?

Probably, by using Approved Revs


  • Overlap of data models
  1. One page = one thing.
  2. Information is being stored, mutatis mutandis, as triples: <the subject of a wikipedia page> <has property> <some value>
  • Overlap of personnel: Overlap of personnel: Denny Vrandecic, Jeroen de Dauw, Markus Krotzsch