Welcome to epmc-tools’s documentation!

epmc-tools is a Python package that provides a powerful command-line interface (epmc-cli) and library for interacting with scientific literature. It allows you to:

  • Process JATS XML from local files or URLs, converting them to JSON.

  • Extract accession numbers from text.

  • Split text into sentences.

  • Access the Europe PMC APIs for searching articles, grants, and annotations.

  • Harvest metadata via the OAI-PMH service.

Indices and tables