This notebook queries the FREYA PID Graph via Datacite's GraphQL API to retrieve works created by a person. It takes an ORCID URL as input which is used to filter for all works registered at Datacite and some registered at Crossref where 'creator.nameIdentifier
' matches the given ORCID URL. From the resulting list of works we output all DOIs.
# Prerequisites:
import requests # dependency to make HTTP calls
from benedict import benedict # dependency for dealing with json
The input for this notebook is an ORCID URL, e.g. 'https://orcid.org/0000-0003-2499-7741
'.
# input parameter
example_orcid="https://orcid.org/0000-0003-2499-7741"
We use it to query Datacite's GraphQL API for the person's metadata and all works connected to them. Since the API uses pagination, we need to loop through all pages to get the complete result set.
# Datacite's GraphQL endpoint for the FREYA PID Graph
DATACITE_GRAPHQL_API = "https://api.datacite.org/graphql"
# GraphQL query to retrieve a person and all their created works
QUERY_PERSON2WORKS = """query person($orcid :ID!, $after:String){
person(id: $orcid) {
works(first:1000, after: $after) {
pageInfo {
endCursor
hasNextPage
}
nodes {
doi
titles {
title
}
versions {
nodes {
doi
}
}
}
}
}
}"""
# query for all works connected to given ORCID
def query_freya_for_person2works(orcid):
continue_paginating = True
cursor=""
while continue_paginating:
vars = {'orcid': orcid, 'after': cursor}
response = requests.post(url=DATACITE_GRAPHQL_API,
json={'query': QUERY_PERSON2WORKS, 'variables': vars},
headers={'Accept': 'application/json'})
response.raise_for_status()
result=response.json()
if 'errors' in result:
raise requests.exceptions.HTTPError(result)
# check if next page exists and set cursor to next page
cursor = next_cursor(result)
continue_paginating = has_next_page(result)
yield result
# check if there is another page with results to query
def has_next_page(response_data):
resp_dict = benedict.from_json(response_data)
has_next_page = resp_dict.get("data.person.works.pageInfo.hasNextPage")
return has_next_page
# set cursor to next value
def next_cursor(response_data):
resp_dict = benedict.from_json(response_data)
cursor = resp_dict.get("data.person.works.pageInfo.endCursor")
return cursor
#--- example execution
list_of_pages=query_freya_for_person2works(example_orcid)
From the returned pages we
versions.nodes.doi
' contains a DOI for the successing work,Note:
While we are able to filter some versions of a work if they are linked via the metadata field 'versions.nodes.doi
', others would need advanced filters (for example based on name similarity) which is out of scope for our project.
# from the result pages we get from the GraphQL API, extract the data about the works
def extract_works_from_page(page):
page_dict=benedict.from_json(page)
return [work for work in page_dict.get('data.person.works.nodes') or []]
# remove old versions from the list of works
def filter_older_versions(works):
return [work for work in works if not benedict.from_json(work).get('versions.nodes[0].doi')]
# extract DOI from work
def extract_doi(work):
work_dict = benedict.from_json(work)
doi = work_dict.get('doi')
title = work_dict.get('titles[0].title')
return doi, title
#--- example execution
for page in list_of_pages or []:
works=extract_works_from_page(page)
print(f"Complete number of works: {len(works)}")
works_filtered=filter_older_versions(works)
print(f"Filtered number of works: {len(works_filtered)}")
for work in works_filtered or []:
doi, title = extract_doi(work)
print(f"{doi}, {title}")
Complete number of works: 105 Filtered number of works: 91 10.6084/m9.figshare.647329, Members of Deutscher Bibliotheksverband e. V. (dbv) 10.2314/coscv1.1, Literatur recherchieren und verwalten 10.2314/coscv1.2, Organisieren 10.2314/coscv1.3, Daten sammeln und verarbeiten 10.2314/coscv1, CoScience - Gemeinsam forschen und publizieren mit dem Netz 10.2314/coscv2, CoScience - Gemeinsam forschen und publizieren mit dem Netz 10.2314/coscv2.2, Oganisieren 10.2314/coscv2.3, Daten sammeln und verarbeiten 10.6084/m9.figshare.647329.v1, Members of Deutscher Bibliotheksverband e. V. (dbv) 10.6084/m9.figshare.5271943, TIB-FIS-Discovery - VIVO at the German National Library of Science and Technology (TIB) 10.6084/m9.figshare.5271943.v1, TIB-FIS-DISCOVERY VIVO AT THE GERMAN NATIONAL LIBRARY OF SCIENCE AND TECHNOLOGY (TIB) 10.6084/m9.figshare.5271943.v2, TIB-FIS-Discovery - VIVO at the German National Library of Science and Technology (TIB) 10.6084/m9.figshare.5285743, Lost in translation – challenges of tailoring VIVO to the needs of the German scholarly landscape 10.6084/m9.figshare.5285743.v1, Lost in translation – challenges of tailoring VIVO to the needs of the German scholarly landscape 10.6084/m9.figshare.6465149, User perceptions, feedback, and stories.pdf 10.6084/m9.figshare.6465149.v1, User perceptions, feedback, and stories.pdf 10.5281/zenodo.1287885, Supporting A Vivo Regional Community 10.6084/m9.figshare.6819971, Information Security Challenges in VIVO - Adapting the BSI IT Security Catalog Standards 10.6084/m9.figshare.6819971.v1, VIVO_Conference_2018_BSI-IT-Security.pdf 10.6084/m9.figshare.6820217, Challenges and opportunities of using VIVO as a reporting tool 10.6084/m9.figshare.6820217.v1, Challenges and opportunities of using VIVO as a reporting tool 10.6084/m9.figshare.6819971.v2, Information Security Challenges in VIVO - Adapting the BSI IT Security Catalog Standards 10.6084/m9.figshare.6819722.v2, VIVO-DE: Collaborative ontology editing & management with VoCol 10.5281/zenodo.1445521, Vivo 2018 - Ein Rückblick 10.5281/zenodo.1464108, Vivo - Eine Einführung 10.5281/zenodo.1464897, Reporting / Kdsf Und Vivo 10.5281/zenodo.1478571, Forschungsevaluation Und Visualisierung Von Zitationsnetzwerken 10.6084/m9.figshare.6819722, VIVO-DE: Collaborative ontology editing & management with VoCol 10.6084/m9.figshare.6819722.v1, VIVO-DE: Collaborative ontology editing & management with VoCol 10.5281/zenodo.1689951, Referenzimplementierung für offene szientometrische Indikatoren (ROSI) 10.5281/zenodo.2613539, ROSI – Reference Implementation for Open Scientometric Indicators 10.5281/zenodo.2615675, The ROSI Project 10.5281/zenodo.2639714, VIVO software 1.10.0 release 10.5281/zenodo.2639713, VIVO software 1.10.0 release 10.5281/zenodo.3242680, ROSI – Open Metrics for Open Repositories 10.5281/zenodo.3243485, Registry of [Open] Scientometric Data Sources – Technische Evaluierung von Offenen Datenquellen 10.6084/m9.figshare.9756770, Identifying Ontological Domains Related to VIVO 10.6084/m9.figshare.9756770.v1, Identifying Ontological Domains Related to VIVO 10.6084/m9.figshare.9771701, VIVO Ontology Version 2 10.6084/m9.figshare.9771701.v1, VIVO Ontology Version 2 10.5281/zenodo.3407681, Visualising open scientometric data in VIVO 10.5281/zenodo.3407680, Visualising open scientometric data in VIVO 10.6084/m9.figshare.9897008, A Library of Queries and Reports: Introducing the Vitro Query Tool 10.6084/m9.figshare.9897008.v1, A Library of Queries and Reports: Introducing the Vitro Query Tool 10.5281/zenodo.3518713, confIDent - for FAIR conference metadata 10.5281/zenodo.3518714, confIDent - for FAIR conference metadata 10.5281/zenodo.3564456, Trends und Entwicklungen rund um VIVO 10.5281/zenodo.3564457, Trends und Entwicklungen rund um VIVO 10.21105/joss.01182, VIVO: a system for research discovery 10.31263/voebm.v72i2.2808, Open Science und die Bibliothek – Aktionsfelder und Berufsbild 10.3897/rio.4.e31656, Reference implementation for open scientometric indicators (ROSI) 10.6084/m9.figshare.11961699, Ontological Domains for Representing Scholarship 10.6084/m9.figshare.11961699.v1, Ontological Domains for Representing Scholarship 10.5281/zenodo.3862804, Bibliometrische Visualisierungen auf dem Prüfstein – Versuch einer bibliothekarischen Perspektive 10.5281/zenodo.3896517, AEON - Towards an Academic Events Ontology 10.5281/zenodo.3900193, Research Profile Ownership through User Studies: A Case Study in the German National Research System 10.25815/cex2-cc69, Virtual Conferences Require Dedicated Time, Too 10.6084/m9.figshare.13645577.v1, VIVO Ontology Development: Why, What, and How 10.6084/m9.figshare.13645577, VIVO Ontology Development: Why, What, and How 10.25968/opus-837, Roving Librarians in der Zentralbibliothek der Hochschule Hannover: ein Experiment 10.25968/opus-6, Libworld. Biblioblogs global 10.25968/opus-1, Teaching Information Literacy with the Lerninformationssystem 10.25968/opus-2, Personalisiertes Lernen in der Bibliothek: das Düsseldorfer Online-Tutorial (DOT) Informationskompetenz 10.25968/opus-198, LibWorld - library blogging worldwide 10.25968/opus-521, CoScience : gemeinsam forschen und publizieren mit dem Netz 10.25968/opus-102, 13 Dinge 10.25968/opus-330, Die Online-Auskunft der Universitäts- und Landesbibliothek Düsseldorf : erste Evaluation eines Erfolgsmodells 10.25968/opus-272, Working Paper: Open Government Data 10.25968/opus-303, Lernen 2.0 : Bericht aus der Praxis 10.25968/opus-453, Empfehlungen zur Öffnung bibliothekarischer Daten 10.25798/qsdh-en13, 12th Annual / International VIVO Conference 2021 10.6084/m9.figshare.14842221, Geospatial information in VIVO -- Thoughts, Ideas, Suggestions 10.6084/m9.figshare.14842221.v1, Geospatial information in VIVO -- Thoughts, Ideas, Suggestions 10.5281/zenodo.5031664, Der Forschungsatlas - a community-oriented research profile system in the making 10.5281/zenodo.5031663, Der Forschungsatlas - a community-oriented research profile system in the making 10.5281/zenodo.5040057, Towards FAIR research information - insights from expert workshops 10.5281/zenodo.5040056, Towards FAIR research information - insights from expert workshops 10.5281/zenodo.5060204, Forschungsevaluierung FAIR(ER) gestalten 10.5281/zenodo.5060203, Forschungsevaluierung FAIR(ER) gestalten 10.5281/zenodo.5075098, Sharing Queries and Reports with the Reporting Marketplace 10.5281/zenodo.5075108, Creating a Semantic Catalogue of Architectural Drawings 10.5281/zenodo.5075107, Creating a Semantic Catalogue of Architectural Drawings 10.5281/zenodo.5082237, Priorities of a VIVO Community - results of a survey at the 5. VIVO-Workshop 2021 10.5281/zenodo.5082236, Priorities of a VIVO Community - results of a survey at the 5. VIVO-Workshop 2021 10.5281/zenodo.5082370, Geospatial information in VIVO - thoughts, ideas, suggestions 10.5281/zenodo.5082369, Geospatial information in VIVO - thoughts, ideas, suggestions 10.5281/zenodo.5526786, Das Projekt OPTIMETA – Stärkung des Open-Access-Publikationssystems durch offene Zitationen und raumzeitliche Metadaten 10.5281/zenodo.5526785, Das Projekt OPTIMETA – Stärkung des Open-Access-Publikationssystems durch offene Zitationen und raumzeitliche Metadaten 10.5281/zenodo.5767060, How even small and independent journals can contribute to the citation commons 10.5281/zenodo.5767059, How even small and independent journals can contribute to the citation commons 10.25968/opus-2135, Are Conference Posters Being Cited?