Skip to content

Judgement Day

Wikipedia label by Gmhofmann on Commons

At the dawn of Wikidata, I wrote a tool called “Terminator”. Not just because I wanted to have one of my own, but as a pun on the term “term”, used in the database table name (“wb_term”) where Wikidata labels, descriptions, and aliases are stored. The purpose of the tool is to find important (by some definition) Wikidata items that lack a label in a specific language. This can be very powerful, especially in languages with low Wikidata participation; setting the label for teacher (Q37226) in a language will immediately allow all Wikidata items using that item (as an occupation, perhaps) to show that label. A single edit can improve hundreds or thousands of items, and make them more accessible in that language.

Well, Wikidata has grown a lot since I started that tool, and the Terminator didn’t cope well with the growth; it was limited to a handful of languages, and the daily update was compute intensive. Plus, the interface was slow and ugly. Time for a rewrite!

So without further ado, I present version 2 of the Terminator tool. Highlights:

  • Now covers all Wikidata languages
  • Get the top items with missing labels, descriptions, or Wikipedia articles
  • Sort items by total number of claims, external IDs, sitelinks, or a compound score
  • The database currently contains the top (by compound score) ~4.1 million items on Wikidata
  • Updated every 10 minutes
  • Search for missing labels in multiple languages (e.g. German, Italian, or Welsh)
  • Only show items that have labels in languages you know
  • Automatically hides “untranslatable” items (scientific articles, humans, Wikipedia-related pages such as templates and categories), unless you want those as well
  • Can use a SPARQL query to filter items (only shows items that match all the above, plus are in the SPARQL result, for results with <10K items or so)
  • Game mode (single, unsorted random result, more details, re-flows on mobile)

Please let me know through the usual channels about bugs and feature requests. I have dropped some functionality from the old version, such as data download; but that version is still linked form the new main page. Enjoy!

6 Comments

  1. Asaf Bartov wrote:

    Wonderful! Thank you very much, this is great!

    The addition of a simple SPARQL query (with ~2500 items) seems to hang, though.

    Wednesday, July 25, 2018 at 17:59 | Permalink
  2. click here wrote:

    I always was interested in this topic and still am, thank you for putting up.

    Friday, August 10, 2018 at 15:06 | Permalink
  3. click wrote:

    Great blog here! Additionally your web site lots up fast! What host
    are you the usage of? Can I am getting your affiliate hyperlink in your host?
    I wish my site loaded up as quickly as yours lol.

    Monday, August 13, 2018 at 13:46 | Permalink
  4. click Here wrote:

    Excellent blog here! Additionally your site rather a lot
    up fast! What web host are you the usage of?
    Can I get your affiliate hyperlink for your host? I wish my web site loaded up
    as quickly as yours lol

    Monday, August 13, 2018 at 14:55 | Permalink
  5. Visit wrote:

    Thanks on your marvelous posting! I definitely enjoyed reading it, you’re a great author.I will make
    certain to bookmark your blog and may come back from
    now on. I want to encourage you to definitely continue your great job, have a nice morning!

    Monday, August 13, 2018 at 16:07 | Permalink
  6. click Here wrote:

    You actually make it appear really easy with your presentation however I to find this matter to be
    actually something which I believe I’d never understand. It kind
    of feels too complex and very extensive for me. I’m taking a look ahead for your
    next put up, I will attempt to get the hold of it!

    Monday, August 13, 2018 at 16:38 | Permalink