Package 'WikidataR' reference manual

Title:	Read-Write API Client Library for Wikidata
Description:	Read from, interrogate, and write to Wikidata <https://www.wikidata.org> - the multilingual, interdisciplinary, semantic knowledgebase. Includes functions to: read from Wikidata (single items, properties, or properties); query Wikidata (retrieving all items that match a set of criteria via Wikidata SPARQL query service); write to Wikidata (adding new items or statements via QuickStatements); and handle and manipulate Wikidata objects (as lists and tibbles). Uses the Wikidata and QuickStatements APIs.
Authors:	Thomas Shafee [aut, cre] , Os Keyes [aut] , Serena Signorelli [aut], Alex Lum [ctb] , Christian Graul [ctb], Mikhail Popov [ctb]
Maintainer:	Thomas Shafee <[email protected]>
License:	MIT + file LICENSE
Version:	2.3.3
Built:	2025-03-20 20:41:42 UTC
Source:	https://github.com/ts404/wikidatar

Convert an input to a property PID

Description

Convert an input string to the most likely property PID

Usage

as_pid(x)
as_pid(x)

Arguments

`x`	a vector, data frame, or tibble of strings representing Wikidata properties

Value

If the inputted string is a valid PID, return the string. If the inputted string matches a property label, return its PID. If the inputted string matches multiple labels of multiple properties, return the PID of the first hit.

Examples

# if input string is a valid PID
as_pid("P50")
# if input string matches multiple item labels
as_pid("author")
# if input string matches a single unique label
as_pid("Scopus author ID")
# if input string is a valid PID
as_pid("P50")
# if input string matches multiple item labels
as_pid("author")
# if input string matches a single unique label
as_pid("Scopus author ID")

Convert an input to a item QID.

Description

Convert an input string to the most likely item QID.

Usage

as_qid(x)
as_qid(x)

Arguments

`x`	a vector, data frame, or tibble of strings representing wikidata items

Value

if the inputted string is a valid QID, return the string. If the inputted string matches an item label, return its QID. If the inputted string matches multiple labels of multiple items, return the QID of the first hit.

Examples

# if input string is a valid QID
as_qid("Q42")
# if input string matches multiple item labels
as_qid("Douglas Adams")
# if input string matches a single unique label
as_qid("Douglas Adams and the question of arterial blood pressure in mammals")
# if input string is a valid QID
as_qid("Q42")
# if input string matches multiple item labels
as_qid("Douglas Adams")
# if input string matches a single unique label
as_qid("Douglas Adams and the question of arterial blood pressure in mammals")

Add quotations marks

Description

Add escaped quotation marks around strings that need them ready for submission to an API.

Usage

as_quot(x, format = "tibble")
as_quot(x, format = "tibble")

Arguments

`x`	a vector, data frame, or tibble of strings
`format`	either "tibble" / "csv" to use plain quotation marks (default), or "api" / "website" to use '%22'

Value

A tibble of items inside of escaped quotation marks unless they are already in escaped quotation marks, is a QID, in which chase it is returned unchanged.

Examples

as_quot("text")
as_quot("text")

Convert an input to a source property SID

Description

Convert an input string to the most likely source SID (equivalent to PID.)

Usage

as_sid(x)
as_sid(x)

Arguments

`x`	a vector, data frame, or tibble of strings representing Wikidata source properties.

Value

if the inputted string is a valid SID, return the string. If the inputted string matches a property label, return its SID. If the inputted string matches multiple labels of multiple properties, return the SID of the first hit.

Examples

# if input string is a valid SID
as_pid("S854")
# if input string matches multiple item labels
as_pid("URL")
# if input string matches a single unique label
as_pid("Reference URL")
# if input string is a valid SID
as_pid("S854")
# if input string matches multiple item labels
as_pid("URL")
# if input string matches a single unique label
as_pid("Reference URL")

Generic input checker

Description

Utility function to handle namespaces. Used by get_item and get_property

Usage

check_input(input, substitution)
check_input(input, substitution)

Arguments

`input`	string to check
`substitution`	string for what's been looked for

Value

boolian indicating whether the checked string contains a match for the substitution string

"CREATE" rows

Description

Add in empty lines for QuickStatements CREATE rows that mint new QIDs. This is a slightly messy quirk of the QuickStatements format that mints new QIDs via a line containing only "CREATE", so this function is a way to approximate that behavior in a tibble

Usage

createrows(items, vector)
createrows(items, vector)

Arguments

`items`	a vector, data frame or tibble of items (which may or may not contain the keyword "CREATE")
`vector`	a vector of properties or values which may be expanded based on the items vector

Value

if the vector is NULL, return NULL. Otherwise, if the "CREATE" keyword appears in the items vector, insert blank strings at those positions in the vector.

"CREATE" rows from tidy format

Description

Add in QuickStatements CREATE rows that mint new QIDs from tidy input data. New items are created by any item starting that starts with the text "CREATE" followed by any unique ID.

Usage

createrows.tidy(QS.tib)
createrows.tidy(QS.tib)

Arguments

QS.tib

a tibble of items, values and properties (optionally qualifiers and sources).

Value

a tibble, with items that start with "CREATE" followed by any unique text causing the addition of a "Create" line above, being replaced with "LAST" in the QuickStatements format to create new QIDs.

Disambiguate QIDs

Description

Interactive function that presents alternative possible QID matches for a list of text strings and provides options for choosing between alternatives, rejecting all presented alternatives, or creating new items. Useful in cases where a list of text strings may have either missing Wikidata items or multiple alternative potential matches that need to be manually disambiguated. Can also used on lists of lists (see examples). For long lists of items, the process can be stopped partway through and the returned vector will indicate where the process was stopped.

Usage

disambiguate_QIDs(
  list,
  variablename = "variables",
  variableinfo = NULL,
  filter_property = NULL,
  filter_variable = NULL,
  filter_firsthit = FALSE,
  Q_min = NULL,
  auto_create = FALSE,
  limit = 10
)
disambiguate_QIDs(
  list,
  variablename = "variables",
  variableinfo = NULL,
  filter_property = NULL,
  filter_variable = NULL,
  filter_firsthit = FALSE,
  Q_min = NULL,
  auto_create = FALSE,
  limit = 10
)

Arguments

`list`	a list or vector of text strings to find potential QID matches to. Can also be a list of lists (see examples)
`variablename`	type of items in the list that are being disambiguated (used in messages)
`variableinfo`	additional information about items that are being disambiguated (used in messages)
`filter_property`	property to filter on (e.g. "P31" to filter on "instance of")
`filter_variable`	values of that property to use to filter out (e.g. "Q571" to filter out books)
`filter_firsthit`	apply filter to the first match presented or only if alternatives requested? (default = FALSE, note: true is slower if filter not needed on most matches)
`Q_min`	return only possible hits with QIDs above the provided value
`auto_create`	if no match found, automatically assign "CREATE"
`limit`	number of alternative possible Wikidata items to present if multiple potential matches

Value

a vector of:

QID: Selected QID (for when an appropriate Wikidata match exists)
CREATE: Mark that a new Wikidata item should be created (for when no appropriate Wikidata match yet exists)
NA: Mark that no Wikidata item is needed
STOP: Mark that the process was halted at this point (so that output can be used as input to the function later)

Examples

## Not run: 
#Disambiguating possible QID matches for these music genres
#Results should be:
# "Q22731" as the first match
# "Q147538" as the first match
# "Q3947" as the second alternative match
disambiguate_QIDs(list=c("Rock","Pop","House"),
                 variablename="music genre")

#Disambiguating possible QID matches for these three words, but not the music genres
#This will take longer as the filtering step is slower
#Results should be:
# "Q22731" (the material) as the first match
# "Q147538" (the soft drink) as the second alternative match
# "Q3947" (the building) as the first match
disambiguate_QIDs(list=c("Rock","Pop","House"),
                 filter_property="instance of",
                 filter_variable="music genre",
                 filter_firsthit=TRUE,
                 variablename="concept, not the music genre")

#Disambiguating possible QID matches for the multiple expertise of
#these three people as list of lists
disambiguate_QIDs(list=list(alice=list("physics","chemistry","maths"),
                           barry=list("history"),
                           clair=list("law","genetics","ethics")),
                 variablename="expertise")

## End(Not run)
## Not run: 
#Disambiguating possible QID matches for these music genres
#Results should be:
# "Q22731" as the first match
# "Q147538" as the first match
# "Q3947" as the second alternative match
disambiguate_QIDs(list=c("Rock","Pop","House"),
                 variablename="music genre")

#Disambiguating possible QID matches for these three words, but not the music genres
#This will take longer as the filtering step is slower
#Results should be:
# "Q22731" (the material) as the first match
# "Q147538" (the soft drink) as the second alternative match
# "Q3947" (the building) as the first match
disambiguate_QIDs(list=c("Rock","Pop","House"),
                 filter_property="instance of",
                 filter_variable="music genre",
                 filter_firsthit=TRUE,
                 variablename="concept, not the music genre")

#Disambiguating possible QID matches for the multiple expertise of
#these three people as list of lists
disambiguate_QIDs(list=list(alice=list("physics","chemistry","maths"),
                           barry=list("history"),
                           clair=list("law","genetics","ethics")),
                 variablename="expertise")

## End(Not run)

Extract claims from returned item data

Description

extract claim information from data returned using get_item.

Usage

extract_claims(items, claims)
extract_claims(items, claims)

Arguments

`items`	a list of one or more Wikidata items returned with `get_item`.
`claims`	a vector of claims (in the form "P321", "P12") to look for and extract.

Value

a list containing one sub-list for each entry in items, and (below that) the found data for each claim. In the event a claim cannot be found for an item, an NA will be returned instead.

Examples

# Get item data
adams_data <- get_item("42")
# Get claim data
claims <- extract_claims(adams_data, "P31")
# Get item data
adams_data <- get_item("42")
# Get claim data
claims <- extract_claims(adams_data, "P31")

Extract a paragraph of text

Description

Return the nth paragraph of a section of text Useful for extracting information from Wikipedia or other wikimarkup text

Usage

extract_para(text, para = 1, templ = NULL)
extract_para(text, para = 1, templ = NULL)

Arguments

`text`	the input text as a string
`para`	number indicating which paragraph(s) to return (default=1)
`templ`	an optional string specifying a mediawikitemplate within which to restrict the search restrict search

Value

the nth paragraph of the input text.

Filter QIDs

Description

For a QID or vector of QIDs, remove ones that match a particular statement (e.g. remove all that are instances of academic publications or books).

Usage

filter_qids(
  ids,
  property = "P31",
  filter = c("Q737498", "Q5633421", "Q7725634", "Q13442814", "Q18918145"),
  message = NULL
)
filter_qids(
  ids,
  property = "P31",
  filter = c("Q737498", "Q5633421", "Q7725634", "Q13442814", "Q18918145"),
  message = NULL
)

Arguments

`ids`	QIDs to check
`property`	property to check (default = P31 to filter on "instance of")
`filter`	values of that property to use to filter out (default = Q737498, Q5633421, Q7725634, Q13442814, and Q18918145 to remove academic publications or books)
`message`	message to return (useful for disambiguate_QIDs function)

Details

The Wikidata Q identifier (QID) is the unique identifier (UID) used in Wikidata.

Value

a vector of QIDs that do not match the property filter

Examples

## Not run: 
# Filter three items called "Earth Science" to show only those that aren't
# books, journals or journal articles
filter_qids(ids = c("Q96695546","Q8008","Q58966429"),
            property = "P31",
            filter = c("Q737498","Q5633421","Q7725634","Q13442814","Q18918145"))

## End(Not run)
## Not run: 
# Filter three items called "Earth Science" to show only those that aren't
# books, journals or journal articles
filter_qids(ids = c("Q96695546","Q8008","Q58966429"),
            property = "P31",
            filter = c("Q737498","Q5633421","Q7725634","Q13442814","Q18918145"))

## End(Not run)

Search for Wikidata items or properties that match a search term

Description

find_item and find_property allow you to retrieve a set of Wikidata items or properties where the aliases or descriptions match a particular search term. As with other WikidataR code, custom print methods are available; use str to manipulate and see the underlying structure of the data.

Usage

find_item(
  search_term,
  language = "en",
  limit = 10,
  response_language = "en",
  ...
)

find_property(
  search_term,
  language = "en",
  response_language = "en",
  limit = 10
)
find_item(
  search_term,
  language = "en",
  limit = 10,
  response_language = "en",
  ...
)

find_property(
  search_term,
  language = "en",
  response_language = "en",
  limit = 10
)

Arguments

`search_term`	A term to search for.
`language`	The language to return the labels and descriptions in; this should consist of an ISO language code. Defaults to `"en"`.
`limit`	The number of results to return; set to `10` by default.
`...`	further arguments to pass to dehttr:ink[httr::GET]GET.

Value

A list containing the result of the query.

Examples


#Check for entries relating to Douglas Adams in some way
adams_items <- find_item("Douglas Adams")

#Check for properties involving the peerage
peerage_props <- find_property("peerage")

#Check for entries relating to Douglas Adams in some way
adams_items <- find_item("Douglas Adams")

#Check for properties involving the peerage
peerage_props <- find_property("peerage")

Get an example SPARQL query from Wikidata

Description

Gets the specified example(s) from [SPARQL query service examples page](https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples) using [Wikidata's MediaWiki API](https://www.wikidata.org/w/api.php).

Usage

get_example(example_name)
get_example(example_name)

Arguments

example_name

the names of the examples as they appear on [this page](https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples)

Details

If you are planning on extracting multiple examples, please provide all the names as a single vector for efficiency.

Value

The SPARQL query as a character vector.

Examples

## Not run: 
sparql_query <- extract_example(c("Cats", "Horses"))
query_wikidata(sparql_query)
# returns a named list with two data frames
# one called "Cats" and one called "Horses"
sparql_query <- extract_example("Largest cities with female mayor")
cat(sparql_query)
query_wikidata(sparql_query)

## End(Not run)
## Not run: 
sparql_query <- extract_example(c("Cats", "Horses"))
query_wikidata(sparql_query)
# returns a named list with two data frames
# one called "Cats" and one called "Horses"
sparql_query <- extract_example("Largest cities with female mayor")
cat(sparql_query)
query_wikidata(sparql_query)

## End(Not run)

Get geographic entities based on a bounding box

Description

get_geo_box retrieves all geographic entities in Wikidata that fall between a bounding box between two existing items with geographic attributes (usually cities).

Usage

get_geo_box(
  first_city_code,
  first_corner,
  second_city_code,
  second_corner,
  language = "en",
  ...
)
get_geo_box(
  first_city_code,
  first_corner,
  second_city_code,
  second_corner,
  language = "en",
  ...
)

Arguments

`first_city_code`	a Wikidata item, or series of items, to use for one corner of the bounding box.
`first_corner`	the direction of `first_city_code` relative to `city` (eg "NorthWest", "SouthEast").
`second_city_code`	a Wikidata item, or series of items, to use for one corner of the bounding box.
`second_corner`	the direction of `second_city_code` relative to `city` (eg "NorthWest", "SouthEast").
`language`	the two-letter language code to use for the name of the item. "en" by default.
`...`	further arguments to pass to dehttr:ink[httr::GET]GET.

Value

a data.frame of 5 columns:

item the Wikidata identifier of each object associated with entity.
name the name of the item, if available, in the requested language. If it is not available, NA will be returned instead.
latitude the latitude of item
longitude the longitude of item
entity the entity the item is associated with (necessary for multi-entity queries).

Examples

# Simple bounding box
bruges_box <- get_geo_box("Q12988", "NorthEast", "Q184287", "SouthWest")

# Custom language
bruges_box_fr <- get_geo_box("Q12988", "NorthEast", "Q184287", "SouthWest",
                                        language = "fr")

# Simple bounding box
bruges_box <- get_geo_box("Q12988", "NorthEast", "Q184287", "SouthWest")

# Custom language
bruges_box_fr <- get_geo_box("Q12988", "NorthEast", "Q184287", "SouthWest",
                                        language = "fr")

Retrieve geographic information from Wikidata

Description

get_geo_entity retrieves the item ID, latitude and longitude of any object with geographic data associated with another object with geographic data (example: all the locations around/near/associated with a city).

Usage

get_geo_entity(entity, language = "en", radius = NULL, limit = 100, ...)
get_geo_entity(entity, language = "en", radius = NULL, limit = 100, ...)

Arguments

`entity`	a Wikidata item (`Q...`) or series of items, to check for associated geo-tagged items.
`language`	the two-letter language code to use for the name of the item. "en" by default, because we're imperialist anglocentric westerners.
`radius`	optionally, a radius (in kilometers) around `entity` to restrict the search to.
`limit`	the maximum number of results to return.
`...`	further arguments to pass to dehttr:ink[httr::GET]GET.

Value

a data.frame of 5 columns:

item the Wikidata identifier of each object associated with entity.
name the name of the item, if available, in the requested language. If it is not available, NA will be returned instead.
latitude the latitude of item
longitude the longitude of item
entity the entity the item is associated with (necessary for multi-entity queries).

Examples

# All entities
sf_locations <- get_geo_entity("Q62")

# Entities with French, rather than English, names
sf_locations <- get_geo_entity("Q62", language = "fr")

# Entities within 1km
sf_close_locations <- get_geo_entity("Q62", radius = 1)

# Multiple entities
multi_entity <- get_geo_entity(entity = c("Q62", "Q64"))

# All entities
sf_locations <- get_geo_entity("Q62")

# Entities with French, rather than English, names
sf_locations <- get_geo_entity("Q62", language = "fr")

# Entities within 1km
sf_close_locations <- get_geo_entity("Q62", radius = 1)

# Multiple entities
multi_entity <- get_geo_entity(entity = c("Q62", "Q64"))

Retrieve specific Wikidata items or properties

Description

get_item and get_property allow you to retrieve the data associated with individual Wikidata items and properties, respectively. As with other WikidataR code, custom print methods are available; use str to manipulate and see the underlying structure of the data.

Usage

get_item(id, ...)

get_property(id, ...)
get_item(id, ...)

get_property(id, ...)

Arguments

`id`	the ID number(s) of the item or property you're looking for. This can be in various formats; either a numeric value ("200"), the full name ("Q200") or even with an included namespace ("Property:P10") - the function will format it appropriately. This function is vectorized and will happily accept multiple IDs.
`...`	further arguments to pass to dehttr:ink[httr::GET]GET.

Examples


#Retrieve a specific item
adams_metadata <- get_item("42")

#Retrieve a specific property
object_is_child <- get_property("P40")

#Retrieve a specific item
adams_metadata <- get_item("42")

#Retrieve a specific property
object_is_child <- get_property("P40")

Get names of properties

Description

For a claim or set of claims, return the names of the properties

Usage

get_names_from_properties(properties)
get_names_from_properties(properties)

Arguments

properties

a claims list from extract_claims

Value

tibble of labels for each property for a set of claims

Retrieve randomly-selected Wikidata items or properties

Description

get_random_item and get_random_property allow you to retrieve the data associated with randomly-selected Wikidata items and properties, respectively. As with other WikidataR code, custom print methods are available; use str to manipulate and see the underlying structure of the data.

Usage

get_random_item(limit = 1, ...)

get_random_property(limit = 1, ...)
get_random_item(limit = 1, ...)

get_random_property(limit = 1, ...)

Arguments

`limit`	how many random items to return. 1 by default, but can be higher.
`...`	arguments to pass to dehttr:ink[httr::GET]GET.

Examples

## Not run: 
#Random item
random_item <- get_random_item()

#Random property
random_property <- get_random_property()

## End(Not run)
## Not run: 
#Random item
random_item <- get_random_item()

#Random property
random_property <- get_random_property()

## End(Not run)

Identifier from identifier

Description

Convert unique identifiers to other unique identifiers.

Usage

identifier_from_identifier(
  property = "ORCID iD",
  return = "IMDb ID",
  value = "0000-0002-7865-7235"
)
identifier_from_identifier(
  property = "ORCID iD",
  return = "IMDb ID",
  value = "0000-0002-7865-7235"
)

Arguments

`property`	The identifier property to search (for caveats, see `as_pid`)
`return`	The identifier property to convert to
`value`	The identifier value to match.

Value

A vector of identifiers corresponding to identifiers submitted.

Examples

identifier_from_identifier(property ='ORCID iD',
                           return = 'IMDb ID',
                           value = c('0000-0002-7865-7235','0000-0003-1079-5604')
                           )
identifier_from_identifier(property ='ORCID iD',
                           return = 'IMDb ID',
                           value = c('0000-0002-7865-7235','0000-0003-1079-5604')
                           )

Format short form person names

Description

Converting names into first initial and surname, or just initials

Usage

initials(x, format = "FLast")
initials(x, format = "FLast")

Arguments

`x`	a vector of people's names as strings
`format`	a vector of strings of either "FLast" or "FL" to indicate the output format

Value

the inputted name strings with first names shortened based on the selected format.

List properties of a Wikidata item

Description

for a downloaded wikidata item, list the properties of all statements

Usage

list_properties(item, names = FALSE)
list_properties(item, names = FALSE)

Arguments

`item`	a list of one or more Wikidata items returned with `get_item`.
`names`	a boolian for whether to return property names, or just P numbers and extract.

Value

a list containing one sub-list for each entry in items, and (below that) the found data for each claim. In the event a claim cannot be found for an item, an NA will be returned instead.

Examples

# Get item data
adams_data <- get_item("42")
# Get claim data
claims <- extract_claims(adams_data, "P31")
# Get item data
adams_data <- get_item("42")
# Get claim data
claims <- extract_claims(adams_data, "P31")

Print method for find_item

Description

print found items.

Usage

## S3 method for class 'find_item'
print(x, ...)
## S3 method for class 'find_item'
print(x, ...)

Arguments

`x`	find_item object with search results
`...`	Arguments to be passed to methods

Print method for find_property

Description

print found properties.

Usage

## S3 method for class 'find_property'
print(x, ...)
## S3 method for class 'find_property'
print(x, ...)

Arguments

`x`	find_property object with search results
`...`	Arguments to be passed to methods

Print method for Wikidata objects

Description

print found objects generally.

Usage

## S3 method for class 'wikidata'
print(x, ...)
## S3 method for class 'wikidata'
print(x, ...)

Arguments

`x`	Wikidata object from get_item, get_random_item, get_property or get_random_property
`...`	Arguments to be passed to methods

QID from DOI

Description

simple converter from DOIs to QIDs (for items in Wikidata)

Usage

qid_from_DOI(DOI = "10.15347/WJM/2019.001")
qid_from_DOI(DOI = "10.15347/WJM/2019.001")

Arguments

DOI

digital object identifiers submitted as strings

Value

vector of QIDs corresponding to DOIs submitted

QID from identifier

Description

Convert unique identifiers to QIDs (for items in Wikidata).

Usage

qid_from_identifier(
  property = "DOI",
  value = c("10.15347/WJM/2019.001", "10.15347/WJM/2020.002")
)
qid_from_identifier(
  property = "DOI",
  value = c("10.15347/WJM/2019.001", "10.15347/WJM/2020.002")
)

Arguments

`property`	The identifier property to search (for caveats, see `as_pid`.)
`value`	The identifier value to match.

Details

The Wikidata Q identifier (QID) is the unique identifier (UID) used in Wikidata.

Value

A vector of QIDs corresponding to identifiers submitted.

Examples

qid_from_identifier('ISBN-13','978-0-262-53817-6')
qid_from_identifier('ISBN-13','978-0-262-53817-6')

QID from label name

Description

simple converter from label names to QIDs (for items in wikidata). Essentially a simplification of find_item

Usage

qid_from_name(name = "Thomas Shafee", limit = 100, format = "vector")
qid_from_name(name = "Thomas Shafee", limit = 100, format = "vector")

Arguments

`name`	name labels submitted as strings
`limit`	if multiple QIDs match each submitted name, how many to return
`format`	output format ('vector' to return a simple vector, or 'list' to return a nested list)

Value

vector of QIDs corresponding to names submitted. Note: some names may return multiple QIDs.

QID from ORCID

Description

simple converter from ORCIDs to QIDs (for items in wikidata)

Usage

qid_from_ORCID(ORCID = "0000-0002-2298-7593")
qid_from_ORCID(ORCID = "0000-0002-2298-7593")

Arguments

ORCID

digital object identifiers submitted as strings

Value

vector of QIDs corresponding to ORCIDs submitted

Send one or more SPARQL queries to WDQS

Description

Makes a POST request to Wikidata Query Service SPARQL endpoint.

Usage

query_wikidata(sparql_query, format = "tibble", ...)
query_wikidata(sparql_query, format = "tibble", ...)

Arguments

`sparql_query`	SPARQL query (can be a vector of queries)
`format`	'tibble' (default) returns a pure character data frame, 'simple' returns a pure character vector, while 'smart' fetches JSON-formatted data and returns a tibble with datetime columns converted to 'POSIXct'.
`...`	Additional parameters to supply to `httr:POST`.

Value

A 'tibble' or 'vector'. Note: QID values will be returned as QIDs, rather than URLs.

Query limits

There is a hard query deadline configured which is set to 60 seconds. There are also following limits: - One client (user agent + IP) is allowed 60 seconds of processing time each 60 seconds - One client is allowed 30 error queries per minute See query limits section in the Wikidata Query Service User Manual for more information.

Examples

# R's versions and release dates:
sparql_query <- 'SELECT DISTINCT
  ?softwareVersion ?publicationDate
  WHERE {
    BIND(wd:Q206904 AS ?R)
    ?R p:P348 [
      ps:P348 ?softwareVersion;
      pq:P577 ?publicationDate
    ] .
}'
query_wikidata(sparql_query)

## Not run: 
# "smart" format converts all datetime columns to POSIXct
query_wikidata(sparql_query, format = "smart")

## End(Not run)
# R's versions and release dates:
sparql_query <- 'SELECT DISTINCT
  ?softwareVersion ?publicationDate
  WHERE {
    BIND(wd:Q206904 AS ?R)
    ?R p:P348 [
      ps:P348 ?softwareVersion;
      pq:P577 ?publicationDate
    ] .
}'
query_wikidata(sparql_query)

## Not run: 
# "smart" format converts all datetime columns to POSIXct
query_wikidata(sparql_query, format = "smart")

## End(Not run)

Convert an input to a item QID

Description

Convert an input string to the most likely item QID

Usage

searcher(search_term, language, limit, response_language, type, ...)
searcher(search_term, language, limit, response_language, type, ...)

Arguments

`search_term`	a term to search for.
`language`	the language to conduct the search in; this should consist of an ISO language code. Set to "en" by default.
`limit`	the number of results to return; set to 10 by default.
`response_language`	the language to return the labels and descriptions in; this should consist of an ISO language code. Set to "en" by default.
`type`	type of wikidata object to return (default = "item")
`...`	Additional parameters to supply to [httr::POST]

Value

If the inputted string matches an item label, return its QID. If the inputted string matches multiple labels of multiple items, return the QID of the first hit. If the inputted string is already a QID, return the string.

Examples

# if input string is a valid QID
as_qid("Q42")
# if input string matches multiple item labels
as_qid("Douglas Adams")
# if input string matches a single unique label
as_qid("Douglas Adams and the question of arterial blood pressure in mammals")
# if input string is a valid QID
as_qid("Q42")
# if input string matches multiple item labels
as_qid("Douglas Adams")
# if input string matches a single unique label
as_qid("Douglas Adams and the question of arterial blood pressure in mammals")

Download full Wikidata items matching a SPARQL query

Description

Utility wrapper for wikidata spargl endpoint to download items. Used by get_geo_entity and get_geo_box.

Usage

sparql_query(query, ...)
sparql_query(query, ...)

Arguments

`query`	The SPARQL query as a string
`...`	Additional parameters to supply to `httr:POST`.

Value

a download of the full wikidata objects formatted as a nested json list

Remove special characters

Description

Special characters can otherwise mess up wikidata read-writes

Usage

unspecial(x)
unspecial(x)

Arguments

`x`	a vector of strings to check for special characters

Value

the inputted strings with special characters replaced with closest match plan characters.

Extract an identifier from a wikidata URL

Description

Convert a URL ending in an identifier (returned by SPARQL queries) to just the plain identifier (QID or PID).

Convert a URL ending in an identifier (returned by SPARQL queries) to just the plan identifier (QID or PID).

Usage

url_to_id(x)

url_to_id(x)
url_to_id(x)

url_to_id(x)

Arguments

`x`	A vector of strings representing Wikidata URLs.

Details

The Wikidata Q identifier (QID) is the unique identifier (UID) used in Wikidata.

Value

if the URL ends in a QID or PID, return that PID or QID, else return the original string

QID or PID.

Examples

url_to_id("http://www.wikidata.org/entity/42")
url_to_id("http://www.wikidata.org/Q42")
url_to_id("http://www.wikidata.org/entity/42")
url_to_id("http://www.wikidata.org/Q42")

Download a Wikidata item

Description

Utility wrapper for Wikidata API to download item. Used by get_item and get_property.

Usage

wd_query(title, ...)
wd_query(title, ...)

Arguments

`title`	The Wikidata item or property as a string.
`...`	Additional parameters to supply to `httr:POST`.

Value

A downloaded full wikidata object (item or property) formatted as a nested json list.

Download random Wikidata items

Description

Utility wrapper for Wikidata API to download random items. Used by random_item.

Usage

wd_rand_query(ns, limit, ...)
wd_rand_query(ns, limit, ...)

Arguments

`ns`	string indicating namespace, most commonly "Main" for QID items, "Property" for PID properties.
`limit`	How many random object to return.
`...`	Additional parameters to supply to `httr:POST`.

Value

Downloaded full wikidata objects (items or properties) formatted as nested json lists.

Global variables for Wikidata properties

Description

A dataset of Wikidata global variables.

Format

A list of tibbles documenting key property constraints from Wikidata

SID.valid: valid reference source properties
PID.datatype: required data type for each property
PID.constraint: expected regex match for each property
lang.abbrev: language abbreviations
lang.abbrev.wiki: language abbreviations for current wikis
abbrev.wiki: Wikimedia abbreviations for current wikis

...

API client library for Wikidata

Description

This package serves as an API client for reading and writing to and from Wikidata, (including via the QuickStatements format), as well as for reading from Wikipedia.

Write statements to any Wikibase instance

Description

Upload data to a Wikibase instance, including creating items, adding statements to existing items (via the quickstatements format and API).

Usage

write_wikibase(
  items,
  properties = NULL,
  values = NULL,
  qual.properties = NULL,
  qual.values = NULL,
  src.properties = NULL,
  src.values = NULL,
  remove = FALSE,
  format = "tibble",
  format.csv.file = NULL,
  api.username = NULL,
  api.token = NULL,
  api.format = "v1",
  api.batchname = NULL,
  api.submit = TRUE,
  quickstatements.url = NULL,
  coordinate_pid = NULL
)
write_wikibase(
  items,
  properties = NULL,
  values = NULL,
  qual.properties = NULL,
  qual.values = NULL,
  src.properties = NULL,
  src.values = NULL,
  remove = FALSE,
  format = "tibble",
  format.csv.file = NULL,
  api.username = NULL,
  api.token = NULL,
  api.format = "v1",
  api.batchname = NULL,
  api.submit = TRUE,
  quickstatements.url = NULL,
  coordinate_pid = NULL
)

Arguments

`items`	a vector of strings indicating the items to which to add statements (as QIDs or labels). Note: In contrast to `write_wikidata`, this function takes no labels as input, just QIDs. New QIDs can be created by using the "CREATE_xyz", where "_xyz" is any unique string. Using the same id will add additional statements to those new items
`properties`	a vector of strings indicating the properties to add as statements (as PIDs or labels). Note: In contrast to `write_wikidata`, this function takes no labels as input, just PIDs. Four special properties can also be used: labels, aliases, descriptions and sitelinks. See [this link](https://www.wikidata.org/wiki/Help:QuickStatements#Adding_labels,_aliases,_descriptions_and_sitelinks) for the syntax.
`values`	a vector of strings indicating the values to add as statements (as QIDs). Note: if strings are provided, they will be treated as plain text.
`qual.properties`	a vector, data frame, or tibble of strings indicating the properties to add as qualifiers to statements (as PIDs).
`qual.values`	a vector, data frame, or tibble of strings indicating the values to add as statements (as QIDs or strings). Note: if strings are provided, they will be treated as plain text.
`src.properties`	a vector, data frame, or tibble of strings indicating the properties to add as reference sources to statements (as SIDs or labels). Note: if labels are provided, and multiple items match, the first matching item will be used (see `as_sid` function), so use with caution.
`src.values`	a vector, data frame, or tibble of strings indicating the values to add reference sources to statements (as QIDs or strings). Note: if strings are provided, they will be treated as plain text.
`remove`	a vector of boolians for each statemnt indicating whether it should be removed from the item rather than added (default = FALSE)
`format`	output format as a string. Options include: tibble easiest format to further manipulation in R csv can be copy-pasted to the Wikibase QuickStatements website (or manipulated in a spreadsheet programs). In contrast to write_wikidata function the delimiter is 'tab', because Quickstatements expect tab-separated data api a url that can be copy-pasted into a web browser, or automatically submitted (see `api.submit` parameter) website open a [QuickStatements](https://quickstatements.toolforge.org/) web browser window summarizing the edits to be made to Wikidata)
`format.csv.file`	path to save the csv file. If none is provided, then printed to console.
`api.username`	a string indicating your Wikimedia username
`api.token`	a string indicating your api token (the unique identifier that you can find listed at [your user page](https://quickstatements.toolforge.org/#/user))
`api.format`	a string indicating which version of the quickstatement format used to submit the api (default = "v1")
`api.batchname`	a string create a named batch (listed at [your batch history page](https://quickstatements.toolforge.org/#/batches)) and tag in the edit summaries
`api.submit`	boolian indicating whether to submit instruction directly to wikidata (else returns the URL that can be copy-pasted into a web browser)
`quickstatements.url`	url to access quickstatements of the corresponding Wikibase instance.
`coordinate_pid`	PID of a geocoordinates; need to have a different formatting

Value

data formatted to upload to Wikidata (via quickstatemsnts), optionally also directly uploaded to Wikidata (see format parameter).

Examples

# Add a statement to the "Wikidata sandbox" item (Q4115189)
# to say that it is an "instance of" (P31) of Q1 (the universe).
# The instruction will submit directly to Wikidata via the API
# (if you include your Wikibase/Wikimedia username and token)

## Not run: 
write_wikibase(
  items = "Q24",
  properties = "P2",
  values = "Q8",
  format = "api",
  api.username = "myusername",
  api.token = "mytoken",
  api.submit = TRUE,
  quickstatements.url = NULL
)

## End(Not run)
# note:

# Add a statement to the "Wikidata sandbox" item (Q4115189)
# to say that it is an "instance of" (P31) of Q1 (the universe).
# The instruction will submit directly to Wikidata via the API
# (if you include your Wikibase/Wikimedia username and token)

## Not run: 
write_wikibase(
  items = "Q24",
  properties = "P2",
  values = "Q8",
  format = "api",
  api.username = "myusername",
  api.token = "mytoken",
  api.submit = TRUE,
  quickstatements.url = NULL
)

## End(Not run)
# note:

Write statements to Wikidata

Description

Upload data to Wikidata, including creating items, adding statements to existing items (via the quickstatements format and API).

Usage

write_wikidata(
  items,
  properties = NULL,
  values = NULL,
  qual.properties = NULL,
  qual.values = NULL,
  src.properties = NULL,
  src.values = NULL,
  remove = FALSE,
  format = "tibble",
  api.username = NULL,
  api.token = NULL,
  api.format = "v1",
  api.batchname = NULL,
  api.submit = TRUE
)
write_wikidata(
  items,
  properties = NULL,
  values = NULL,
  qual.properties = NULL,
  qual.values = NULL,
  src.properties = NULL,
  src.values = NULL,
  remove = FALSE,
  format = "tibble",
  api.username = NULL,
  api.token = NULL,
  api.format = "v1",
  api.batchname = NULL,
  api.submit = TRUE
)

Arguments

`items`	a vector of strings indicating the items to which to add statements (as QIDs or labels). Note: if labels are provided, and multiple items match, the first matching item will be used (see `as_qid` function), so use with caution. New QIDs can be created by using the "CREATE_xyz", where "_xyz" is any unique string. Using the same id will add additional statements to those new items
`properties`	a vector of strings indicating the properties to add as statements (as PIDs or labels). Note: if labels are provided, and multiple items match, the first matching item will be used (see `as_pid` function), so use with caution. Four special properties can also be used: labels, aliases, descriptions and sitelinks. See [this link](https://www.wikidata.org/wiki/Help:QuickStatements#Adding_labels,_aliases,_descriptions_and_sitelinks) for the syntax.
`values`	a vector of strings indicating the values to add as statements (as QIDs or strings). Note: if strings are provided, they will be treated as plain text.
`qual.properties`	a vector, data frame, or tibble of strings indicating the properties to add as qualifiers to statements (as PIDs or labels). Note: if labels are provided, and multiple items match, the first matching item will be used (see `as_pid` function), so use with caution.
`qual.values`	a vector, data frame, or tibble of strings indicating the values to add as statements (as QIDs or strings). Note: if strings are provided, they will be treated as plain text.
`src.properties`	a vector, data frame, or tibble of strings indicating the properties to add as reference sources to statements (as SIDs or labels). Note: if labels are provided, and multiple items match, the first matching item will be used (see `as_sid` function), so use with caution.
`src.values`	a vector, data frame, or tibble of strings indicating the values to add reference sources to statements (as QIDs or strings). Note: if strings are provided, they will be treated as plain text.
`remove`	a vector of boolians for each statemnt indicating whether it should be removed from the item rather than added (default = FALSE)
`format`	output format as a string. Options include: tibble easiest format to further manipulation in R csv can be copy-pasted to [the QuickStatements website](https://quickstatements.toolforge.org/) (or manipulated in a spreadsheet programs) api a url that can be copy-pasted into a web browser, or automatically submitted (see `api.submit` parameter) website open a [QuickStatements](https://quickstatements.toolforge.org/) web browser window summarizing the edits to be made to Wikidata)
`api.username`	a string indicating your Wikimedia username
`api.token`	a string indicating your api token (the unique identifier that you can find listed at [your user page](https://quickstatements.toolforge.org/#/user))
`api.format`	a string indicating which version of the quickstatement format used to submit the api (default = "v1")
`api.batchname`	a string create a named batch (listed at [your batch history page](https://quickstatements.toolforge.org/#/batches)) and tag in the edit summaries
`api.submit`	boolian indicating whether to submit instruction directly to wikidata (else returns the URL that can be copy-pasted into a web browser)

Value

data formatted to upload to wikidata (via quickstatemsnts), optionally also directly uploaded to wikidata (see format parameter).

Examples

# Add a statement to the "Wikidata sandbox" item (Q4115189)
# to say that it is an "instance of" (P31) of Q1 (the universe).
# The instruction will submit directly to wikidata via the API
# (if you include your Wikimedia username and token)

## Not run: write_wikidata(items        = "Wikidata Sandbox",
               properties   = "instance of",
               values       = "Q1",
               format       = "api",
               api.username = "myusername", 
               api.token    = , #REDACTED#
               )
## End(Not run)
#note: 

# Add a statement to the "Wikidata sandbox" item (Q4115189)
# to say that it is an "instance of" (P31) of Q1 (the universe).
# The instruction will submit directly to wikidata via the API
# (if you include your Wikimedia username and token)

## Not run: write_wikidata(items        = "Wikidata Sandbox",
               properties   = "instance of",
               values       = "Q1",
               format       = "api",
               api.username = "myusername", 
               api.token    = , #REDACTED#
               )
## End(Not run)
#note:

Package 'WikidataR'

Help Index

Convert an input to a property PID

Description

Usage

Arguments

Value

Examples

Convert an input to a item QID.

Description

Usage

Arguments

Value

Examples

Add quotations marks

Description

Usage

Arguments

Value

Examples

Convert an input to a source property SID

Description

Usage

Arguments

Value

Examples

Generic input checker

Description

Usage

Arguments

Value

"CREATE" rows

Description

Usage

Arguments

Value

"CREATE" rows from tidy format

Description

Usage

Arguments

Value

Disambiguate QIDs

Description

Usage

Arguments

Value

Examples

Extract claims from returned item data

Description

Usage

Arguments

Value

Examples

Extract a paragraph of text

Description

Usage

Arguments

Value

Filter QIDs

Description

Usage

Arguments

Details

Value

Examples

Search for Wikidata items or properties that match a search term

Description

Usage

Arguments

Value

See Also

Examples

Get an example SPARQL query from Wikidata

Description

Usage

Arguments

Details

Value

See Also

Examples