Content auditor - reports are missing
Problem description
The reading grade level, undesirable text or duplicate titles reports are missing.
Solution
This is usually caused by customization of the filters that are run for a data source.
Check the data source configuration for each data source that is part of your search package.
-
Check to see if you have
filter.classesset in your data source configuration. If this is set in your configuration then you need to ensure it includes theJSoupProcessingFilterProviderwithin the listed set of filters.if filter.classes is not listed then your data source is configured with the default filter chain and Jsoup filtering will be enabled. e.g. this is the default filter chain:
filter.classes=TikaFilterProvider,ExternalFilterProvider:JSoupProcessingFilterProvider:DocumentFixerFilterProvider
-
Check to see if you have
filter.jsoup.classesset in your data source configuration. If this is set in your configuration then you need to ensure it includes theContentGeneratorUrlDetection,FleschKincaidGradeLevel,UndesirableTextand`TitleDuplicatesfilters within the listed set of filters.filter.jsoup.classes=ContentGeneratorUrlDetection,FleschKincaidGradeLevel,UndesirableText,TitleDuplicates