Using the Enrich and Clean feature, it is possible to standardize metadata.
If different content publishers spelled an author's name in different ways, finding the author's content becomes tedious.
In this example, the author is Alan Smithee, and the name of the author has been spelled in the following ways:
- Alan Smithee
- Allen Smithee
- Alan Smithy
- Allen Smithy
In this case, every variation of the name becomes a filter and a tag, although the name refers to the same author.
To standardize metadata values, proceed as follows:
-
Declare Allen Smithee, Alan Smithy, and Allen Smithy as synonyms of Alan Smithee.
-
Upload the CSV vocabulary file.
-
Add an Enrich and Clean rule by selecting the New rule button.
-
Select the
Author
metadata. -
Select the vocabulary file with the synonyms of Alan Smithee.
-
Name the generated metadata, for example
Author_cleaned
. -
Select the Create button.
-
Select the Save button.
-
Select the Reprocess button.
-
Go the Homepage section of the Portal tab in the Administration menu.
-
Select the Search tab.
-
Add the
Author_cleaned
metadata under the Select and order facets section. -
Select the cogwheel icon next to
Author_cleaned
, and rename the facetAuthor
. -
Select the Save button.
Once the reprocessing over, all content published with Alan Smithee, Allen Smithee, Alan Smithy, or Allen Smithy as an author is under the Allan Smithee name.