Searching for the word "false" in "Subject or Content" will return messages containing the word false including every message with an attachment embedded or not.

book

Article ID: 100041652

calendar_today

Updated On:

Cause

The OCR conversion in Enterprise Vault provides metadata on embedded objects, with this metadata indexed by default.  This metadata can contain information such as the following:

Dimensions‪300 x 200‬
Width‎300 pixels
Height‎200 pixels
Bit depth24
Media class primary ID{6FB2E74A-B8CB-40BB-93F3-FAC5F00FA203}
Horizontal resolution‎96 dpi
Vertical resolution‎96 dpi
System.MIMETypeimage/png
System.Media.DlnaProfileIDPNG_LRG
{13795680-4F98-11D3-96DD-00C04F6888FF}\29False
Pages1
{60D6BD9A-FD54-46C0-9511-F276B66E0F94}\2Tagged Image File Format (TIFF)
{13795680-4F98-11D3-96DD-00C04F6888FF}\61

With this metadata indexed, searches with terms such as "false" or "media" will return items with embedded objects where the search terms cannot be seen.

Resolution

This issue has been addressed in the following releases available from Downloads:

Enterprise Vault 12.2.3

Enterprise Vault 12.3

 

The solution in these and future EV releases is to configure a new setting in the Vault Admin Console as follows:

  1. Launch the Vault Admin Console using an account with EV Administrator permissions.
  2. Expand the left tree view to see the Site folder.
  3. Right click on the Site folder and select the Properties option.
  4. Click the Advanced tab in the Site Properties window.
  5. Ensure the List settings from: option in top of the Advanced tab is set to Content Conversion.
  6. Scroll down to and select the Include metadata properties option.
  7. Click on the Modify button.  This will open a new window in which the setting of this option can be changed.
  8. Change the setting to Off and click the OK button to save this change and close this window.
  9. Click the OK button to acknowledge the pop-up alert stating the change will become effective after the appropriate task or service is restarted and to close the Site Properties window.
  10. Restart all EV services to ensure this change becomes effective.

After these steps have been completed, any new content that is archived will not have embedded objects' metadata indexed.  Caution is advised with making this configuration change as there could be text within the embedded objects' metadata that would be wanted for searching as this change will not make that data available in the index.

To set the configuration back to indexing the metadata of embedded objects, follow Steps 1 through 7 above, click the Reset button in Step 8 and then complete Steps 9 and 10.

Issue/Introduction

Searching for the word "false" in "Subject or Content" will return messages containing the word false including every message with an attachment embedded or not. This behavior can also be observed with other terms, such as "media".

Additional Information

JIRA: CFT-821 JIRA: CFT-926