Documentation

The entire Epigraf application is an Application Programming Interface (API). Epigraf allows data retrieval and modification by external programs via defined endpoints. In principle, Epigraf can also be used as a backend for independently hosted front ends. Data can be exchanged in a structured format (JSON). Further, rendered HTML components, such as the article list (without the layout), can be embedded in a front end via API access.

The endpoints correspond to the URLs that are visible when navigating the pages in the browser. To output structured formats via these endpoints instead of HTML web pages, append the desired format extension to the path:

The XML format includes all content on a page and is structured hierarchically. For example, article elements contain all sections. XML is also a valuable starting point for export pipelines.
The JSON format has a hierarchical structure, but is generally more compact than XML and easier to process for other applications.
The CSV format contains tabular data and is compatible with many statistical programs. Epigraf outputs entities from different hierarchical levels (e.g. articles and their sections) in rows below each other. The hierarchy of entities is stored in id columns.

You will find buttons for different formats at the bottom of the pages. Alternatively, the URL displayed in the browser can be modified so that the path ends with the file format (.xml, .json, .csv).

Example: The article list of a database can be accessed in the browser via /epi/public/articles as a webpage.

The same list can be retrieved in JSON format via /epi/public/articles.json.
The data is served in XML format via /epi/public/articles.xml.
A CSV table is generated via /epi/public/articles.csv.

Table pages such as the projects table or the articles table are served in a Hydra-compatible format, enabling harvesting of collections. If the appropriate configuration is set up, articles can be output in RDF-compatible triple formats.

JSON-LD (jsonld extension)
Turtle (ttl extension)
RDF/XML (rdf extension)

In addition to retrieving structured data, the following rendered views can be generated:

HTML (without extension): Interactive webpage to be displayed in browsers. To embedd components into another webpage, the show parameter can be used to select specific elements. The theme parameter determines the CSS styling.
Markdown (md extension): Presentation of article content in rendered markdown.

All endpoints return UTF-8 encoded data. Non-printing characters such as the unit separator control character are filtered out.

An access token is required to access non-public data and for write access. This token is appended to the URL or sent as a bearer token, for example: /epi/stage/articles.csv?token=ABCDEFG. The token is generated in Epigraf's user account.

To use the access token, API permissions must be granted both for accessing the relevant database and for accessing the endpoint. For example, to use the articles/index endpoint with a token, the following permissions are required:

User: \<username>
Role: remains empty
Requested by: "api"
Permission type = "access"
Entity Type: "databank"
Entity name: \
Entity ID: can remain empty, optionally the ID of the databank
Permission name = "epi/articles/index"

In all endpoints, the path parameter <db> must be replaced by the selected project database, for example by "public".

For index and view endpoints, the optional idents query parameter determines how IDs are composed in the ID fields:

id: By default, the IDs are generated from the table name and entity ID, for example articles-551.
tmp: Temporary IDs are created. When used in subsequent imports, new entities will be created for temporary IDs. Example: articles-tmp551.
iri: Replace IDs by IRI paths, for example articles/epi-article/dresden~551.

Pagination and order

Each query returns as many entities as requested in the limit parameter. Additional entities can be retrieved using the page parameter.

Query Parameter	Explanation
limit	Optional. Default: Number of entities per page. The default varies depending on the endpoint; typically, 100 entities are returned.
page	Optional. Default: 1. Page number.
total	Optional. Maximum number of entities returned (even if more are available)
sort	Optional. Specifies the field by which the data is sorted. The first entities are returned; further entities can be retrieved by pagination. Multiple fields are separated by commas.
direction	Optional. In conjunction with the sort parameter, defines the sort direction. asc: Ascending order desc: Descending order If sorting is performed on multiple fields, the sort order for each field can be defined in a comma-separated list.
id	Optional: Restricts the result to a single entity.

Some endpoints support cursor-based pagination. With cursor-based pagination, a reference entity is defined as a reference point for the result. Cursor-based pagination can be used to narrow down a category tree (epi/<db>/properties/index) to specific segments. The seek parameter can be used to jump to specific nodes within the result list. For the properties/index endpoint, the entity hierarchy is expanded: ancestors of the filtered entities are always included in the result. In addition, when seeking a specific entity, the previous entities in the result set are returned.

Query Parameter	Explanation
cursor	ID of a reference entity. The entity that follow in the sorted result set are returned. For the `properties/index` endpoint, the sorting is fixed to the tree structure.
direction	Sorting direction: ascending (default value asc) or descending (desc).
limit	Number of entities returned by a query. In the `properties/index` endpoint, the number of entities may be larger, as ancestor nodes are always returned.
children	In combination with `cursor`, the `properties/index` endpoint determines whether the returned entities start at the next sibling node (default value 0) or at the next child node (value 1). This parameter is only taken into account for ascending sort order (direction=asc).
collapsed	In combination with `cursor`, the `properties/index`-endpoint determines whether the result set should be restricted to nodes on the same level (value 1) or whether all descendants should be returned (default value 0). This parameter depends on the `children` and `cursor` parameters: if child nodes are requested, the result set is restricted to the child node level. Otherwise, the result set is restricted to the cursor node level. If no cursor parameter is provided, the result set is restricted to the first level.
seek	ID of a reference entity. Unlike the `cursor` parameter, the number of entities does not refer directly to the reference entity. Instead, additional preceding (sort direction desc) or following (sort direction asc) entities are returned, with the number depending on the `limit` parameter. This allows to query the previous and next nodes of an entity. This parameter excludes using the parameters `cursor`, `collapsed` and `children`.

The index endpoints support selecting columns to be returned by using the columns parameter. Multiple columns are selected as a comma-separated list.

If the columns parameter is empty, the default fields will be returned.
Additional columns are configured in the types configuration by the columns key.
Registered users can also request ad hoc columns using extraction keys. An ad hoc column consists of the caption, followed by an equal sign and the extraction key. The parameter must be URL-encoded because it contains an equal sign.
Settings the columns parameter to false (or 0) on the index endpoints will result in full entity data (e.g. articles and all contained data) being returned instead of just the flat table.

The extraction key can adress simple fields of the article object (e.g. title) or nested fields using dot notation (e.g. project.shortname). This follows the conventions of the types configuration. To access lists, an asterisk placeholder can be used (e.g. items.{\*}.value). Lists can be filtered with conditions in square brackets (e.g. items.{\*}[itemtype=conditions].date). For further details, see the CakePHP documentation on hash syntax.

By default, all ID fields contain IRI paths. To request IDs, use the idents parameter.

For HTML output, specific components can be explicitly requested using the show parameter. This allows, for example, the main content to be embedded in external pages without sidebars. The parameter supports a comma-separated list of the following elements:

mainmenu: The main menu.
searchbar: The search bar in table views.
content: The main content area.
footer: The footer area.
leftsidebar: The left sidebar.
rightsidebar: The right sidebar.

Styling of the rendered view depends on the theme parameter. The minimal theme creates a compact view with reduced outer margins, which is particularly useful when embedding content in iframes.

The flow parameter determines the click target of entities in a collection (e.g. table views). The options include frame, popup or tab.

All endpoints are structured according to the same pattern. There are two different endpoint routes:

Global endpoints access application level actions. They consist of a controller, which usually corresponds to a database table, the action to be executed and, if necessary, additional parameters: /<controller>/<action>?<parameter>
Project endpoints additionally contain the path segment epi and a database name: <db>:/epi/<db>/<controller>/<action>?<parameter>

To find out the available endpoints, simply browse Epigraf in a webbrowser and pay attention to the adress bar. Each URL is an endpoint, consisting of a controller path (e.g. projects, articles, properties) and an action path. Typically, a controller provides the following actions

index
view
show
edit
add
delete
import
transfer
mutate

Note: The following documentation is not complete yet. But most endpoints can be inferred from the URLs in the browser address bar.

A list of all projects. Example: /epi/public/projects?term=greifswald.

Query Parameter	Explanation
term	Optional. Only projects whose name, description or IRI contain the search term are returned.
projecttypes	Optional. A comma-separated list, restricts the result to projects with the specific types.

The content of a single project entity.

Parameter	Explanation
<id>	Mandatory. The ID of the project.

A paginated list of articles. Identical to the short form GET /epi/<db>/articles.

Query Parameter	Description
term	Optional. In conjunction with the field parameter, restricts the articles to articles matching the search term.
field	Optional. In conjunction with the term parameter, specifies which fields to search. captions: Search by article number and title. articlenumber: Search by article number. title: Search by article title. status: Search by the status field of the article. text: Searches the full text index of articles. The searchable full text must be indexed, which means that items of type `search` must be created that contain the indexed text in their content field.
articletypes	Optional. Restricts the results by article types. A comma-separated list of article types as defined in the configuration.
projects	Optional. Restricts the results to articles belonging to the given projects. A comma-separated list of project IDs.
properties.<propertytype>.selected	Optional. Restricts the results to articles with the given categories assigned via links or items. A comma-separated list of category IDs. The `<propertytype>` placeholder has to be replaced by the name of the property configuration. Example: `/epi/public/articles.json?properties.objecttypes.selected=975`
properties.<propertytype>.flags	Optional. A comma-separated list of options related to the category filter. Available options: all: Select all categories. des: Select descendants, i.e. sub categories. inv: Reverse the selection. grp: Outputs grouped or enriched results. This generates pie charts in maps, groups the tile view into lanes, and displays annotated text passages below the articles in the table.
lat	Optional. Used in combination with the parameters lng and zoom. When a map is displayed, it is centered on the given point (at and lng parameters) and zoom level (zoom parameter). In addition, items can be sorted according to their distance from this point by setting the sort parameter to `distance`.
lng	Optional. Used in combination with the parameters lat and zoom.
zoom	Optional. Used in combination with the parameters lat and lng.
tile	Optional. Returns only articles containing coordinates in the given tile. Tiles are formed according to the pattern <zoom>/<x>/<y>, see https://wiki.openstreetmap.org/wiki/Slippy_map_tilenames. The slash is URL-encoded as %2F. Example: `/epi/public/articles.json?tile=11%2F676%2F1066`
quality	Optional. Used in conjunction with the `tiles` parameter. When querying geocoded data, the quality level or validity can be restricted: 0 = Unchecked: All coordinates are taken into account. 1 = Uncertain: The coordinates must at least have been automatically checked for plausibility. 2 = Verified: Only manually verified coordinates are taken into account. Depending on the accuracy of the data, points may be approximate and only represent the area most likely to be related to an article.
template	Optional. Layout of the HTML output. table: The basic tabular template. map: A map with geocoded articles. tiles: A tile view with article previews. lanes: Lanes grouping articles by categories. coding: A view optimized for quick editing. In the sidebar, articles are always opened in edit mode.
columns	Optional. Comma-separated list of fields to be displayed in the table. By default the fields articlenumber, title and norm_iri are displayed. Available fields are configured in the columns key of article types. Authenticated users can define ad hoc fields. An ad hoc field consists of a column name followed by an equal sign and an extraction key. The field must be URL-encoded because it contains an equal sign. You can set the columns parameter to `false` or `0` to get full entity data instead of a table with extracted columns. Extraction keys are article fields (e.g., `title`) or nested fields with dot notation (e.g., `project.shortname`). To access lists, an asterisk placeholder can be used (e.g., `items.{}.value`). Lists can be filtered with conditions in square brackets (e.g., `items.{}[itemtype=conditions].date`). For further details, see the [extraction key documentation](/user/coreconcepts/keys/).
published	Optional. Only articles that meet or exceed the given publication level will be returned: 0 = Drafted 1 = In progress 2 = Completed 3 = Published 4 = Searchable
snippets	Optional. A comma-separated list of snippets to be included in the result. A snippet is a predefined compilation of content: paths: The result contains detailed information on the hierarchical structure of sections and lemmas. search: Return the full text index content. indexes: Category indexes are generated. iris: Return IRIs. published: Include the publication status field. tags: Extract tags from XML fields. warnings: Output warnings regarding the structure of entity content.
details	Optional (experimental). A comma-separated list of extraction keys. The extracted content is displayed in the table view as search results. Used to show segments annotated by categories. Example: `/articles/index?details=items..tags.`.
idents	Optional. Determines how IDs are composed in the ID fields: id: By default, the IDs are generated from the table name and entity ID, for example `articles-551`. tmp: Temporary IDs are created. When used in subsequent imports, new entities will be created for temporary IDs. Example: `articles-tmp551`. iri: Replace IDs by IRI paths, for example `articles/epi-article/dresden~551`.

The content of a single article entity.

Parameter	Explanation
id	Mandatory. The ID of the article.

Import articles including all associated data. The data structure must correspond to the Relational Article Model. Three variants are supported: 1. Provide a file to be uploaded and imported. 2. Provide the name of a file available on the server. 3. Provide JSON data.

Example: /epi/epi_movies/articles/import?filename=newmovies.csv

Parameter	Explanation
filename	When importing files or folders already present on the server: file name or folder name on the server, relative to the import folder of the database. The files must contain XML or CSV. Folders can contain multiple XML files.
file	When uploading a file: The file content following the Relational Article Model in CSV or XML format.
data	When posting JSON data: The data array according to the Relational Article Model.
pipeline_id	In case the data should be transformed using a pipeline, provide the pipeline ID. Only available in combination with the filename parameter.
tree	By default (value '1'), the tree structure of sections is regenerated after each import step. Disable with '0'.
solved	Set to '1' to return a mapping between input IDs and resulting database IDs. Default is '0'. Handle with care, the result ist stored in the job's result field in the database and may grow large.

Delete an article, including its sections, items, footnotes and links.

Parameter	Explanation
<id>	Numeric article ID (e.g. 1).

Move sections within an article after or before a reference section.

Parameter	Explanation
<id>	ID of the section to move.
reference_id	ID of a reference section.
reference_pos	Desired relation of the reference section to the moved section: `parent` or `preceding`.

List of categories of a category type.

Parameter	Explanation
<propertytype>	Mandatory. Name of the category system. You can provide it as path parameter or as query parameter.

The content of a single property entity.

Parameter	Explanation
<id>	Mandatory. The ID of the property.

A GET request results in a preview of how the target category would look after merging with source categories.

The endpoint can be used with query or with path parameters. Example: GET /epi/<db>/properties/merge/<source>/<target>.

Parameter	Explanation
source	A comma separated list of IDs determining the source categories to be merged and deleted.
target	ID of the target category into which source categories will be merged.
concat	If set to '1', content from text fields (lemma, sortkey) is concatenated, if the source entities differ.By default ('0') only the references are rewired.

Merges multiple categories by transferring the contents of one or more source categories to a target category. If the concat parameter is set to '1', contents from text fields is merged. All references to the source categories (e.g. from articles) are rewired to the target category. The source categories are then deleted.

Example: POST /epi/public/properties/merge?source=2,3,4&target=1

The endpoint can alternatively be used with path parameters: POST /epi/<db>/properties/merge/<source>/<target>

If the tree structure is affected by the merge, a follow-up POST request should be executed on the endpoint /epi/<db>/properties/mutate/<propertytype> to recover the tree by a sort operation on the lft field.

Parameter	Explanation
source	A comma separated list of IDs determining the source categories to be merged and deleted.
target	ID of the target category into which source categories will be merged.
concat	If set to '1', content from text fields (lemma, sortkey) is concatenated, if the source entities differ.By default ('0') only the references are rewired.

The mutate endpoint can be used to perform batch processing tasks.

For categories, the sort task can be used to reorder the category tree. It is further used to recover the tree structure in case it gets invalid. Tree structures are invalid if the lft/rght values are no longer correct, for example after import processes or after merging categories.

Parameter	Explanation
task	The task: `sort`: Reorder or recover the tree.
<propertytype>	The property type, for example `fonttypes`.

Payload	Explanation
config.params.sortby	Task option are provided in the payload of the POST request: `lemma` Sort by lemma field. `sortkey` Sort by sortkey field. `lft` Order by existing lft values to repair the tree. All lft and rght values are regenerated. `sortno` Order by sort numbers used by EpiDesktop to repair the tree. All lft/rght values are regenerated.

The types endpoint returns the database configuration. Each type defines what kind of projects, articles, sections, items, annotations (links and footnotes) and categories occur in a database.

Parameter	Explanation
scope	Optional. Limit the returned configuration entities. A comma separated list containinge one or more of `projects`, `articles`, `sections`, `items`, `links`, `footnotes` or `properties`.
term	Optional. Filter the result by entities contain the search term in their name, caption, description or IRI.

The content of a single type entity.

Parameter	Explanation
<id>	Mandatory. The ID of the type.

Access to endpoints

General parameters of all endpoints

Common parameters

Fields, columns and blocks

Available endpoints

List of endpoints

Projects

GET /epi/\<db>/projects/index

GET /epi/\<db>/projects/view/\<id>

Articles

GET /epi/\<db>/articles/index

GET /epi/\<db>/articles/view/\<id>

POST /epi/\<db>/articles/import

DELETE /epi/\<db>/articles/delete/

Sections

POST /epi/\<db>/sections/move/\<id>

Properties

GET /epi/\<db>/properties/\<propertytype>/index

GET /epi/\<db>/properties/view/\<id>

GET /epi/\<db>/properties/merge

POST /epi/\<db>/properties/merge

POST /epi/\<db>/properties/mutate/\<propertytype>

Types

GET /epi/\<db>/types/\<scope>/index

GET /epi/\<db>/types/view/\<id>