Collection Vocabulary Extraction Workflow
Collection Vocabulary Extraction Workflow

Workflow Inputs

NameDescription
collection-analysis-item

The item representing the analysis that is to be created

collection-item

The item representing the collection from which the vocabulary is to be extracted

document-analysis-type

The type of analysis to be performed on the documents (this is always Documents Vocabulary Terms Extraction)

read-level

The read access level that the vocabulary is to have


Workflow Outputs

NameDescription
analysed-collection-item

The item representing the collection, once the vocabulary extraction is complete


Work Items

NameDescriptionActionDurationRun Locally
collection parsing

Parsing of the HTML files of the documents in the collection

[Parse HTML]|parse collection10false
vocabulary extraction from collection documents

Extraction of the key terms from the parsed documents

[Analyse]|term extraction from collection documents10false
documents vocabulary combination

Combine the document vocabularies to create a collection vocabulary

[Analyse]|collection documents term combination10false


Work Item Workflow Input Connections

Work Item InputWorkflow Input
[collection parsing]|collection-itemcollection-item
[documents vocabulary combination]|collection-analysis-itemcollection-analysis-item
[documents vocabulary combination]|document-analysis-typedocument-analysis-type
[documents vocabulary combination]|read-levelread-level
[vocabulary extraction from collection documents]|analysis-typedocument-analysis-type
[vocabulary extraction from collection documents]|read-levelread-level


Work Item Interconnections

Work Item InputPreceding Work Item Output
[documents vocabulary combination]|analysed-documents-collection-item[vocabulary extraction from collection documents]|analysed-documents-collection-item
[vocabulary extraction from collection documents]|parsed-collection-item[collection parsing]|parsed-collection-item


Work Item Workflow Output Connections

Workflow OutputWork Item Output
analysed-collection-item[documents vocabulary combination]|analysed-collection-item


Work Item Inputs Check

Work ItemWork Item InputConnection tdWorkflow InputWorkflow
collection parsingcollection-item-1011_-9223372036854457271collection-item-1011_-9223372036854457299
documents vocabulary combinationcollection-analysis-item-1011_-9223372036854438661collection-analysis-item-1011_-9223372036854457299
documents vocabulary combinationdocument-analysis-type-1011_-9223372036854457274document-analysis-type-1011_-9223372036854457299
documents vocabulary combinationread-level-1011_-9223372036854457272read-level-1011_-9223372036854457299
vocabulary extraction from collection documentsanalysis-type-1011_-9223372036854457270document-analysis-type-1011_-9223372036854457299
vocabulary extraction from collection documentsread-level-1011_-9223372036854457269read-level-1011_-9223372036854457299

Workflow Outputs Check

Workflow OutputConnected Work Item Output
analysed-collection-item[documents vocabulary combination]|analysed-collection-item