DataCite Publisher

Overview

The DataCite Publisher acts as the intermediary between the DiSSCo PID API and the DataCite API. The Publisher receives messages from the DiSSCo PID API through a RabbitMQ Queue. These messages contain the records of 200-400 PIDs minted through the specimen ingestion process, and may either be a batch of PIDs for Digital Specimens or Media Objects. These messages are mapped from the DiSSCo FDO Profile to the DataCite metadata schema. Once messages are mapped, they are sent, one at a time, to the DataCite API via POST request. This process informs DataCite of the minting of new DOIs that must be available within their system.

Thanks to the RabbitMQ queue, this process is done asynchronously from the rest of the ingestion process. The PID records are created in the DiSSCo PID API during ingestion, and are upgraded to DataCite APIs when the message reaches this Publisher.

Infrastructure Diagram

Profiles

There are three profiles:

PUBLISH: This publishes messages to DataCite (test or production environment, depending on configuration)
TEST: The service formats a request, but does not publish messages to DataCite.
WEB: Exposes a controller to recover from errors (see "Error Recovery")

Error Recovery

To recover from errors, we include the WEB profile. This exposes a controller which accepts a list of DOIs to re-send a message to DataCite.

The recovery service reads the FDO record for each handle and sends a request to DataCite, either an update or a create.

If Event Type is Unknown: If it is unknown if DataCite has a record of the DOI, we may send multiple requests to recover from the error. First, we send a POST to DataCite. If DataCite already has a copy of this record, they will return a 422 UNPROCESSABLE ENTITY and an error message indicating the DOI is already taken. In that case, we recover from this error and send an update message to DataCite instead. Only in the WEB profile is this error recovery flow implemented; in the regular flow, we assume we know if it is an update or a new DOI record, and structure the message to DataCite accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.github/workflows		.github/workflows
docs		docs
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
lombok.config		lombok.config
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DataCite Publisher

Overview

Profiles

Error Recovery

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

DiSSCo/datacite-publisher

Folders and files

Latest commit

History

Repository files navigation

DataCite Publisher

Overview

Profiles

Error Recovery

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages