Link

Setup Pre-annotator

The pre-annotator is an agent running in the background to make a best guess of un-reviewed data before human annotators start to annotate the samples. A good preannotator can potentially speed up annotation process and improve annotation quality.

SmartAnno uses adaptive models in the backend to complete the pre-annotations:

  • At the beginning of the annotation, it uses rule-base pre-annotator (built on top of the ConText algorithm), which can efficiently rule out negated samples.
  • After annotators complete some annotations, it will switch to using a machine-learning-based model to complete the pre-annotations. This machine learning-based-model can be replaced or customized, as long as it follows the required interface

Configure rule-based pre-annotator

At the beginning, you will need to configure this rule-based pre-annotator by adding keywords to each annotation type. add keywords

The UMLS-based synonym expander

If you have configured UMLS API key, you can now use UMLS- provided REST service to expand your keywords.

First, you can select which keywords you want to discover using embedding: select keywords

Then you will be prompted to choose which candidates are the synonyms to the selected keywords: Animation of UMLS-based synonym expander

Note: Because UMLS REST service is internet-based and the service can be overwhelmed by too many requests, you will often encounter lag when using this UMLS-based synonym expander.

If you have configured the word embedding model, you can now use the embedding model to expand your keywords.

First, you can select which keywords you want to discover using embedding: select keywords

Then you will be prompted to choose which candidates are relevant to the selected keywords: Animation of embedding-based synonym expander

SmartAnno will automatically merge the duplicated keywords. All these keywords will be used to pre-annotate the incoming samples.

Machine-learning-based pre-annotator

The machine-learning-based pre-annotator will be configured after some manual annotations are completed.