Wikidata Usage and Coverage in WMF Projects

The definition of an Article used here is: namespace = 0, no redirects.
The dashboard reports two different measures of Wikidata utilization in the WMF projects: (a) Wikidata usage, and (b) Wikidata Coverage.
Wikidata usage excludes the (S)itelinks usage aspect [see: L, T, O, X, D, and C usage aspects, wbc_entity_usage table in the Wikibase schema]. In this dashboard, data on Wikidata usage are reported in the following columns: Number of Articles that use WD and % of Articles that use WD.
Wikidata coverage includes only the (S)itelinks usage aspect [see: S usage aspect, wbc_entity_usage table in the Wikibase schema]. In this dashboard, data on Wikidata coverage are reported in the following columns: Number of Articles with Sitelinks and% of Articles with Sitelinks.
Explanation. The data presented here refer to Wikidata usage in a particular article of a particular project; i.e. the data refer to Wikidata usage in Infoboxes and Templates. We look into the available wbc_entity_usage tables and count the number of articles in each project that make any use of Wikidata at all. The percents are derived by dividing the number of articles that make use of Wikidata (in the aforementioned sense) by the total number of articles in a particular project.
Note. Only projects with any Wikidata usage or coverage present (as defined above) are considered.
Q: My project is not listed while I am almost certain that we use Wikidata? A: Please see Technical Notes below the table.




Contact: Goran S. Milovanovic, Data Scientist, WMDE
e-mail: goran.milovanovic_ext@wikimedia.de
IRC: goransm


Technical Notes. The usage and sitelinks statistics are obtained from the Wikidata Concepts Monitor. In fact, we search the Hive table produced by the WDCM_Sqoop_Clients.R Module which searches through the wbc_entity_usage tables across the client projects that have client-side Wikidata usage monitoring enabled. Important: This WDCM module updates the usage statistics on weekly basis and approximately five times a month. The reported statistics are thus a compromise between a weekly run across the Wikidata usage statistics and a daily run across the MediaWiki page tables for each project.
Then we iterate through the page tables for all WMF projects and sort out only the content (i.e. namespace is 0 + no redirects) pages. The rest is simply calculating proportions: how many pages in a particular project make any use of (S)itelinks for coverage statistics, and how many pages in a particular project make any use of any other Wikidata usage aspect. The data are finally filtered out from all projects in which we do not find any Wikidata usage.
Again, to understand the Wikidata usage aspects, check out the Wikibase Schema documentation.
If your project is not found in the table above, you need to check if the client-side Wikidata usage tracking is enabled. Also, it happens very rarely that the analytics-mysql cannot find the project\'s database on the expected shard.



Loading...

Note. Percents of articles that make use of Wikidata and percents of articles with Sitelinks, per WMF project type.


Loading...

Note. Percents refer to the count of articles that use WD relative to the total number of articles in a given Project Type.


Loading...

Note. The pie chart represents distribution of total WD usage (in % of total WD usage) across the Project Types.


Loading...

Note. The chart represents the top 20 Wikimedia Projects per WD usage.


Loading...

Note. The chart represents the top 20 Wikimedia Projects per proprotion of WD usage relative to the total number of articles in them.