Indexing fails for large XLSX items in Veritas Enterprise Vault

book

Article ID: 100052429

calendar_today

Updated On:

Description

Error Message

Slow performance during indexing.

Cause

While processing the XLSX files (individual XLSX files or XLSX files in zipped folders) for archiving, Oracle Content Converter generates large converted content for the XLSX files, and the same is passed to the WEX Indexing Engine. The WEX Indexing Engine consumes resources for indexing these large contents and ultimately fails due to the known limitations of the Indexing Engine.

Resolution

Warning: Incorrect use of the Windows registry editor may prevent the operating system from functioning properly. Great care should be taken when making changes to a Windows registry. Registry modifications should only be carried-out by persons experienced in the use of the registry editor application. It is recommended that a complete backup of the registry and workstation be made prior to making any registry changes.

Enable Content Conversion Optimization for the XLSX files by adding the following registry keys:

Name: OptimizeContentConversionForExcelSheets
Location: [HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\KVS\Enterprise Vault\Storage\Content Conversion]
Type: DWORD
Value: “0” Optimization disabled/ “1” Optimization enabled

Name: MinimumSizeOfLargeExcelFile
Location: [HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\KVS\Enterprise Vault\Storage\Content Conversion]
Type: DWORD
Value: 1 to 5 (in Mbs)

Steps:

  1. Create the above-mentioned registry keys on the Enterprise Vault index server.

  2. Change the registry OptimizeContentConversionForExcelSheets value to 1.

  3. Adjust the size of the large XLSX file using MinimumSizeOfLargeExcelFile.

  4. Restart the Enterprise Vault Storage service.

Remarks:

  • If the XLSX file size is less than the size set in the registry key MinimumSizeOfLargeExcelFile, then there is no change in the processing of the file.

  • If the XLSX file size is more than the size set in the registry MinimumSizeOfLargeExcelFile, then it is converted into text instead of HTML. The text conversion reduces the Converted Content size significantly.

  • All contents of the Excel files get completely indexed. However, the preview of text converted Excel files is not as readable as the HTML converted Excel files.

Issue/Introduction

Indexing takes a long time and ultimately fails for large XLSX items.

Additional Information

JIRA: CFT-3879