Merative Annotator for Clinical Data Container Edition

Prerequisites

Annotator for Clinical Data (ACD) Container Edition has the following prerequisites:

  • Red Hat OpenShift Container Platform (OCP) 4.10 or later
  • Share storage for ACD configuration storage - This can be:
    • A persistent volume claim against a ReadWriteMany shared file system
      • This is most often used with on-premise, cloud-based clusters based on VMWare or OpenStack with an NFS file system. If using a shared file system ensure it supports persistent volume claims of ReadWriteMany (RWX) access mode across all zones and nodes of the cluster.
    • An object bucket
      • On IBM Cloud - IBM Cloud Object Storage (COS) with a Regional bucket in the same cloud region as the cluster using Standard Storage Class
      • On AWS - S3 Bucket in the same region
  • Command line tools
    • oc - OpenShift CLI for interacting with the cluster
  • Login credentials and other cluster connection details from your cluster administrator
  • A dedicated OpenShift project (namespace) per ACD deployment
  • Purchased ACD registry credentials for pulling images from the ACD registry
  • A secret for accessing a storage bucket, if planning to use an object storage bucket

Obtain the connection details for your OpenShift Container Platform cluster from your administrator. For additional planning and installation details, see:

Resources required

By default, an ACD installation requires the following minimum resources:

ComponentNumber of worker nodesCPU/nodeMemory/node (G)
ACD38 min (16 recommended )64

For high availability, run 3 replicas of the ACD service on a minimum of 3 worker nodes that have 16 CPU/node and 64 GB of memory (or 6+ 8x32 nodes). For a development or test environment, 1 or 2 replicas can be configured and 8 CPU/node may be used with 64 GB of memory per replica. See performance and capacity planning for more information.

By default, the ACD pods may use all of the CPUs on a node. If needed, you can limit the ACD deployment CPU usage (see Configuration).

These are the requirements for ACD. The cluster itself has additional requirements for master, infrastructure and possible additional worker node for monitoring, logging and other components or applications being run. Please see the OpenShift recommended host practices for guidance on adding infrastructure nodes and moving resources to those nodes.