Research in programming Wikidata/Academic institution
This research is devoted to the study of Academic institution presented at Wikidata. With the help of SPARQL queries to Wikidata, the following tasks are solved:
- Building a list of all academic institutions;
- Building a histogram of academic institutions, which are sorted by date of foundation;
- Display on the map all academic institutions around the world.
In the course of the work, coordinates of the location of the Wikidata (Academic institution) objects have been filled in Wikidata. It was made conclusions about the completeness and incompleteness of information, presented in Wikipedia and the Wikidata — it was shown numder of academic institutions with filled property "coordinate location". Also there were identified years, in which the largest number of academic institutions were founded.
List of academic institutions
[edit | edit source]Given:
- Item: academic institution (Q4671277).
- Instance: instance of (P31).
Let's build a list of all academic institutions.
#added 2017-02
#added 2017-02
#List of `instances of` "academic institution"
SELECT ?academic_institution ?academic_institutionLabel
WHERE
{
?academic_institution wdt:P31 wd:Q4671277. #academic institution
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
SPARQL query, 3892 records.
👍> The most complete and well-developed academic institutions on the Wikimedia are: Balashov branch of Saratov State University, Moscow State Institute of Culture, Samara State Regional Academy.
👎> Almost empty and uninformative academic institutions were: Novorossiysk Polytechnic Institute, Rostov College of Arts, Omsk State Institute of Service.
The date of the founding of academic institutions in countries around the world
[edit | edit source]Let's build a histogram of academic institutions sorted by the date of foundation and located in different countries.
Given:
- Item: academic institution (Q4671277).
- Instance: inception (P571).
#defaultView:BarChart
SELECT DISTINCT (SAMPLE(?year) AS ?year) (COUNT(?year) AS ?count) (SAMPLE(?_academicLabel) AS ?_academicLabel) WHERE {
?object (wdt:P31) wd:Q4671277. #academic institution
BIND(str(YEAR(?_inception)) AS ?year)
?object wdt:P571 ?_inception. #date of foundation
?object wdt:P17 ?_country. #country
?_country rdfs:label ?_academicLabel.
FILTER((LANG(?_academicLabel)) = "en")
}
GROUP BY ?_inception ?_country
ORDER BY ?year ?_inception
As of May 2017, SPARQL query has returned 381 records. As of december number of records have been increased to 1097 — their quantity has been increased 3 times. Data is shown on histogram for 18.05.2017.
According to the first histogram for May 2017, it can be determined that the largest number (15) of academic institutions was established in 2002. Let's build new histogram for 07.12.2017
According to the second histogram for December 2017, it is clear that the leaders in the number of founded academic institutions (35) are 2002 and 2007 years.
Academic institutions of Russia
[edit | edit source]Let's build a SPARQL query to find how many information about domestic academic institutions can be found in Wikidata.
Given:
- Item: academic institution (Q4671277).
- Instance: instance of (P31).
#Number of academic institutions in Russia, USSR and Russian Empire.
SELECT ?academic_institution ?academic_institutionLabel ?countryLabel WHERE {
?academic_institution wdt:P31 wd:Q4671277. #academic institution
?academic_institution wdt:P17 ?country. #country
FILTER(?country = wd:Q34266 || ?country = wd:Q15180 || ?country = wd:Q159).
#Russian Empire, USSR, Russia
SERVICE wikibase:label { bd:serviceParam wikibase:language "ru". }
}
SPARQL query, 27 records.
Category "Academic institutions in Russia" of English Wikipedia contains information about several dozen academic institutions located in Russia. Russian Wikipedia does not have pages that are associated with the concept of an academic institution. Thus most of the information about academic institutions in Russia is presented in the English Wikipedia
Academic institutions on the world map
[edit | edit source]Let's display on the map academic institutions and higher educational institutions around the world.
Given:
- Item: academic institution (Q4671277), higher educational institution (Q38723).
- Instance: coordinate location (P625), instance of (P31).
#List of `coordinate location` "academic institution and higher educarional institution"
#defaultView:Map
SELECT ?academic_institutionLabel ?academic_institutionDescription ?coord
WHERE
{
{ ?academic_institution wdt:P31 wd:Q4671277; wdt:P625 ?coord } UNION #coordinate location of academic institutions
{ ?academic_institution wdt:P31 wd:Q38723; wdt:P625 ?coord } #coordinate location of higher education institutions
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
SPARQL query, 1603 records.
Completeness of Wikidata: academic institutions around the world
[edit | edit source]Academic institution (according to the English Wikipedia) at the end of studying allows a person to obtain a certain academic degree or graduate qualification(like Bachelor, Master). Thus, all higher educational institutions can be academic institutions.
Statistics of universities:
- According to Webometrics Ranking of World Universities[1] — world ranking of university — there are more than 19000 universities on the Earth.
- According to the category "Universities in alphabetical order" of Russian Wikipedia there are more than 2500 universities around the world.
As can be seen from last SPARQL-query, only 1604 higher educational institutions have filled property coordinate location (P625). This is due to the fact that small number of Wikimedia objects — about 8% — are filled correctly and fully.
A list of academic institutions with the empty property coordinate location
[edit | edit source]Let's list the academic institutions and higher educational institutions with the empty property coordinate location (P625) in order to show incompleteness of Wikidata.
Given:
- Item: academic institution (Q4671277), higher educational institution (Q38723).
- Instance: coordinate location (P625), instance of (P31).
#List of `coordinate location` "academic_institution without a coordinate location"
SELECT ?academic_institution ?academic_institutionLabel ?academic_institutionDescription ?coord
WHERE
{
{?academic_institution wdt:P31 wd:Q4671277} UNION #academic_institution
{?academic_institution wdt:P31 wd:Q38723} #higher education institution
MINUS { ?academic_institution wdt:P625 [] } . #coordinate location of organization
SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}
SPARQL query, 3058 records.
Incompleteness of Wikidata: academic institutions around the world
[edit | edit source]As can be seen from SPARQL-query, only 1603 academic institutions and higher educational institutions have the filled coordinate location property. The data is visible on the version 1.0 image.
Let's improve this situation and fill the property "coordinate location" of one hundred objects "academic institution", thereby increasing the number of records returned by the SPARQL-query from 1603 to 1703. The data is visible in the version 1.1 image.
Future work
[edit | edit source]- Find out the most popular language among acedemic institution (property "language used"). For clarity, build a bubble diagram.
- Find academic institutions with a website (property "official website"). Mark them on the map.
- Rate of academic institutions by the number of awards according to the Wikidata (property "award received").
Tests
[edit | edit source]
SPARQL-queries with answers:
- Logos of academic institutions
- The dates of the founding of academic institutions
- Academic institutions of Russia
Notes
[edit | edit source]References
[edit | edit source]- "Ranking of World Universities". Retrieved 2017-05-13.
- Krizhanovsky A., Horolskiy E. Foundation dates and geography of academic institutions // Authorea
- Krizhanovsky A., Horolskiy E. Foundation dates and geography of academic institutions // Nauchkor