File types that are supported for ingestion into eDiscovery Platform

book

Article ID: 100038417

calendar_today

Updated On:

Description

Description

This document includes a comprehensive reference of all the file formats supported by the application. The file formats are categorized by file type.

Note:  This document assumes that the original file extensions have not been renamed to something other than what is on the supported list below.  Clearwell will automatically determine the file type by the type of data within the file to be ingested.  If the original file type is not on the supported list, then the file may be discovered but not ingested into a case.

Click on the links below to move to each section of the document.

Supported Email File Types


Supported Loose File and Email Attachment Types


Supported Container Extraction File Type

File Type Mapping

 

Supported Email File Types

Supported File Types (EMail)

Email File Extension Version
Microsoft Outlook PST PST Versions 97-2016
Microsoft Outlook MSG MSG  
Microsoft Outlook Express EML EML  
Lotus Notes NSF
(8.x - Windows only, with Domino 8.x Server or Notes 8.x Client - Extraction, conversion, viewing)
NSF Version 6.0 and higher
Apple OS X Mail EMLX EMLX V5.5 and later

 

Supported Through Conversion to PST

Type File Extension Version
Microsoft Outlook OST OST V8.3 CHF2 and later
mbox files MBOX V5.5 and later

Starting with version 8.3 CHF2 and later, the platform converts OST files to PST files. If you are on a pre-8.3 CHF2 version, please refer to this technical article: www.veritas.com/docs/000109381.

 

Supported Loose File and Email Attachment Types

All Word Processing Documents

Generic Text File Extension Type or Version
ANSI Text  TXT 7 and 8 bit
ASCII Text  TXT 7 and 8 bit
DOS character set  TXT  
EBCDIC   All versions
Hypertext Markup Language HTML HTML HTM Through 4.0 (some limitations)
IBM DCA  DCA  
IBM Revisable Form Text  varies All versions
Macintosh character set  varies  
Microsoft Rich Text Format (RTF)  RTF All versions
Unicode Text (including .c and .h)  varies 3.0, 4.0
UTF-8  varies  
WML  WML  
XHTML  XHTML File ID only
XML  XML Text only

 

DOS Word Processors

Name File Extensions  Type or Version
DEC DX  DX Through 4.0
DEC DX Plus  DX 4.0, 4.1
Enable   3.0 - 4.5
First Choice  FOL 1.0, 3.0
Framework  framework 3.0
IBM DCA/FFT  DCA  
IBM DisplayWrite  RFT, DCA 2.0 – 5.0
IBM Writing Assistant  IWA 1.01
Lotus Manuscript  MANU Through 2.0
MASS11  M11 Through 8.0
Microsoft Word DOC, DOCX 4.0 – 2016
Microsoft Works  WPS 2.0
MultiMate  PAT Through 4.0
MultiMate Advantage  MM 2.0
Navy DIF  DIF All versions
Nota Bene  NB 3.0
Novell PerfectWorks  WPW 2.0
Office Writer   4.0 – 6.0
PC-File  PC 5.0
PFS:Write  PFS A, B
Professional Write for DOS  PW 1.0, 2.
Q&A Write  QW 2.0, 3.0
Samna Word  SM Versions through Samna Word IV+
Signature   1.0
SmartWare II  SMD 1.02
Sprint  SPR 1.0
Total Word  TW 1.2
Wang IWP  DOC Through 2.6
WordMarc  WMC Through Composer Plus
WordPerfect  WP4 4.2
WordStar  WS 3.0 – 7.0
WordStar 2000  WS2 Through 3.0
XyWrite  XYW Through III Plus

 

Windows Word Processors

Name File Extension Type or Version
Adobe FrameMaker (MIF)

 FM, MIF,
BOOK

3.0 – 6.0
Adobe Illustrator Postscript  PPD Level 2
Hangul Version 97, 2002  HWP 97 – 2007
JustSystems Ichitaro  JTD 5.0, 6.0, 8.0–13.0, 2004
JustWrite  JW Through 3.0
KingSoft WPS Writer
(2010 – Extraction, conversion, viewing)

 WPS
 WPT

2010
 
Legacy  CHP 1.1
Lotus AMI/AMI Professional  SAM 2.0, 3.0
Lotus WordPro  LWP 9.7, 96 – Millennium 9.6
Microsoft Publisher  PUB 2003 – 2007
Microsoft Word
(2016 – Extraction, conversion, viewing)

 DOC, DOCX

98-J, Through 2016
Microsoft WordPad  DOC All versions
Microsoft Works  WKS 3.0, 4.0
Microsoft Write  WRI 1.0 – 3.0
Novell PerfectWorks  WPW 2.0
Novell/Corel WordPerfect
(X4 – Extraction, conversion, viewing)
 WPD 5.1 – X4
OpenOffice Document  ODF  
OpenOffice Writer   1.1, 2.0, 3.x
Professional Write Plus  PFS 1.0
Q&A Write  QW 2.0, 3.0
StarOffice Writer
(v9 – Extraction, conversion, viewing)
 SXW 5.2 – 9
 
WordStar  DOC, WS 1.0

 

Mac Word Processors

Name File Extension  Type or Version
MacWrite II  MCW 1.1
Microsoft Word (Mac)   4.0 – 6.0, 98 – 2008
Microsoft Works (Mac)   2.0
Novell WordPerfect   1.02 – 3.1

 

All Spreadsheets

Name File Extension Type or Version
Enable Spreadsheet  SSF 3.0 – 4.5
First Choice SS  PFS Through 3.0
Framework SS  CSS 3
IBM Lotus Symphony Spreadsheets  ODF 1.x
KingSoft WPS Spreadsheets
(Extraction, conversion, viewing)
 WPS 2010
Lotus 1-2-3  WK4 Through Millennium 9.6
Lotus 1-2-3 Charts (DOS and Windows)   Through 5.0
Lotus 1-2-3 for OS/2   2
Microsoft Excel Charts
(2010 – Extraction, conversion, viewing)
  2.x – 2016
Microsoft Excel for Macintosh   98 – 2008
Microsoft Excel for Windows
(2010 – Extraction, conversion, viewing)
XLS, XLSX 3.0 – 2016
Microsoft Excel for Windows (File ID only)
(2010 – Extraction, conversion, viewing, no
graphics)
  2007/2013 Binary
Microsoft Works SS for DOS   2
Microsoft Works SS for Macintosh   2
Microsoft Works SS for Windows  WKS 3.0, 4.0
Multiplan  MP 4
Novell PerfectWorks Spreadsheet  WPW 2
OpenOffice Calc (3.x – Extraction, conversion,
viewing)
  1.1 – 3.x
Openoffice Spreadsheet  ODF  
PFS Plan  PFS 1
QuattroPro for DOS  WQ Through 5.0
QuattroPro for Windows  WQ Through X3
SmartWare II SS   1.02
SmartWare Spreadsheet    
StarOffice Calc
(v9 – Extraction, conversion, viewing)
  5.2 – 9
SuperCalc   5
Symphony   Through 2.0
VP Planner   1
WordPerfect Spreadsheets
(X4 – Extraction, conversion, viewing)
  X4

 

All Presentations

Name File Extensions Type or Version
Corel Presentations PQW, PQF  
Harvard Graphics Presentation DOS  PLY 3.0
IBM Lotus Symphony Presentations   1.x
KingSoft WPS Spreadsheets
(Extraction, conversion, viewing)
 WPS 2010
Lotus Freelance  DRW 1.0–Millennium 9.6
Lotus Freelance for OS/3   2
Lotus Freelance for Windows   95, 97
Microsoft PowerPoint for Macintosh   4.0 – 2008
Microsoft PowerPoint for Windows
(2010 – Extraction, conversion, viewing)
PPT, PPTX 3.0 –2016
Novell Presentations   3.0, 7.0
OpenOffice Impress
(3.x – Extraction, conversion, viewing)
  1.1, 2.0, 3.x
StarOffice Impress
(v9 – Extraction, conversion, viewing)
  5.2 – 9
WordPerfect Presentations
(X4 – Extraction, conversion, viewing)
  6.0 – X4

 

All Images

Name File Extension Type or Version
Adobe Illustrator  AI 4.0 - 7.0, 9.0
Adobe Illustrator (XMP only)   11 – 13 (CS 1 – 3))
Adobe InDesign (XMP only)  INDD 3.0 – 5.0 (CS 1 - 3)
Adobe InDesign Interchange (XMP only)    
Adobe PDF   1.0 – 1.7 (Acrobat 1 - 9)
Adobe PDF Package
(Extraction, conversion, viewing)
 PDF  
Adobe PDF Portfolio
(Extraction, conversion, viewing)
   
Adobe Photoshop  PSD 4.0
Adobe Photoshop (XMP only)   8.0 – 10.0 (CS 1 – 3)
Ami Draw  AMI SDW
AutoCAD Drawing  DWG 2.5, 2.6
AutoCAD Drawing  DWG 9.0 – 14.0
AutoCAD Drawing  DWG 2000 - 2007
AutoShade Rendering  RND 2
CALS Raster (GP4)  CAL Type I
CALS Raster (GP4)  CAL Type II
Computer Graphics Metafile  CGM ANSI
Computer Graphics Metafile  CGM CALS
Computer Graphics Metafile  CGM NIST
Corel Draw  CDR 2.0 – 9.0
Corel Draw Clipart   5.0, 7.0
Encapsulated PostScript (EPS)  EPS TIFF header Only
Enhanced Metafile (EMF)  EMF  
Escher graphics  EGR  
FrameMaker Graphics (FMV)  FMV 3.0 – 5.0
Gem File (Vector)    
GEM Image (Bitmap)    
Graphics Interchange Format (GIF)  GIF  
Harvard Graphics Chart DOS  CH3 2.0 – 3.0
Harvard Graphics for Windows  CH3  
HP Graphics Language  HPGL 2.0
IBM Graphics Data Format (GDF)  GDF 1.0
IBM Picture Interchange Format  PCX 1.0 s
IGES Drawing   5.1 – 5.3
JBIG2  JB2 Graphic Embeddings in PDF
JFIF (JPEG not in TIFF format)  JFIF  
JPEG  JPEG  
JPEG 2000  JP2 JP2
Kodak Flash Pix  FPX  
Kodak Photo CD  KODAK 1.0
Lotus PIC  PIC  
Lotus Snapshot    
Macintosh PIC   BMP only
Macintosh PICT2   BMP only
MacPaint    
Micrografx Designer  DSF Through 3.1
Micrografx Designer  DSF 6.0
Micrografx Draw  DRW Through 4.0
Microsoft Windows Bitmap  BMP  
Microsoft Windows Cursor  CUR  
Microsoft Windows Icon  ICO  
Microsoft XPS (Text only)  XPS  
Novell PerfectWorks Draw   2
OpenOffice Draw   1.1 – 3.x
OS/2 Bitmap    
OS/2 Warp Bitmap    
Paint Shop Pro (Win32 only)   5.0, 6.0
PC Paintbrush (PCX)  PCX  
PC Paintbrush DCX (multi-page PCX)  PCX  
Portable Bitmap (PBM) PMB  
Portable Graymap PGM  PGM  
Portable Network Graphics (PNG)  PNG  
Portable Pixmap (PPM)  PPM  
Progressive JPEG  JPG  
StarOffice Draw
(v9 – Extraction, conversion, viewing)
 SDA 6.x – 9
Sun Raster  RAS  
TIFF Group  TIFF, TIF Group 5 & 6
TIFF CCITT Group   Group 3 & 4
TruVision TGA (Targa)  TGA 2.0
Visio  VSX 5.0 - 2007
Visio (Page Preview mode WMF/EMF)   4.0
Visio XML VSX (File ID only)   2007
WBMP wireless graphics format  WBMP  
Windows Metafile  WMF  
Word Perfect Graphics
(X4 – Extraction, conversion, viewing)
 WPG 1.0, 2.0 – 10.0, X4
X-Windows Bitmap  XBM x10 compatible
X-Windows Dump   x10 compatible
X-Windows Pixmap   x10 compatible

 

Database

 

Name File Extension Type or Version
DataEase  DBA 4.x
DBase  DBF III, IV, V
First Choice DB  PFS Through 3.0
Framework DB  FW 3.0
Microsoft Access  MDB 1.0, 2.0
Microsoft Works DB for DOS  WDB 2.0
Microsoft Works DB for DOS   1.0
Microsoft Works DB for Macintosh   1.0
Microsoft Works DB for Windows   3.0, 4.0
Paradox for DOS  DB 2.0 – 4.0
Paradox for Windows   2.0 – 4.0
Q&A Database   Through 2.0
R:Base   R:Base 5000
R:Base   R:Base System V
Reflex   2.0
SmartWare II DB  DB 1.02

 

All Multimedia (Sound and Video)

 

Name File Extension  Type or Version
AVI (Metadata extraction only)  AVI  
Flash (text extraction only)  FLA 6.x, 7.x, Lite
MP3 (ID3 metadata only)  MP3  
MPEG-1 Audio layer 3 V ID3 v1 (File ID only)  MPG  
MPEG-1 Audio layer 3 V ID3 v2 (File ID only)    
MPEG-1 Video V 2 (File ID only)    
MPEG-1 Video V 3 (File ID only)    
MPEG-2 Audio (File ID only)    
MPEG-4 (Metadata extraction only)    
MPEG-7 (Metadata extraction only)    
QuickTime (Metadata extraction only)  QTFF  
Real Media - (File ID only)  RM  
WAV (Metadata extraction only)  WAV  
Windows Media ASF (Metadata extraction only)  ASF  
Windows Media Audio WMA (Metadata extraction only)  WMA  
Windows Media DVR-MS (Metadata extraction only)  DVR-MS  
Windows Media Video WMV (Metadata extraction only)  WMV  
All Multimedia (Sound and Video)    

 

Other Types

 

Name File Extension Type or Version
Microsoft Project (File ID only)  PRJ 2007
Microsoft Project (text only)   98 – 2003
Microsoft Windows Executable    
vCalendar  VCAL 2.1
vCard  VCAR 2.1
Yahoo! Messenger  YMG 6.x – 8

 

Supported Container Extraction File Types

By default, the application will extract the contents of the following container file types during processing. Container extraction can be disabled during case setup.

 

Name File Extension Type or Version File ID Cannot Exclude
ZIP  ZIP   1802  
RAR RAR   1821  
TAR  TAR   1807  
LZH (and LHA)  LZH   1813, 1814  
JAR  JAR   1802  
GZIP  GZIP   1815  
Self-Extracting ZIP files (.exes)  EXE   1803  
Self-Extracting RAR files (.exes)  EXE   1822  
UNIX_COMP  LIB   1806  
BZ2 (bzip2)  BZIP2   65537  
7Zip  7Z   65538,1826, 1827  
LEF (.L01)  L01 V5.1 and later  
E01  E01 V5.1 and later  
MBOX  MBOX   1817
OST  OST  V8.3 CHF2+ 65545  

Detection of supported container files is performed by looking at the actual file content, not simply by file extension. As a result, it is possible that additional formats are also supported because they are in fact identical to the officially supported formats. For example, DEB and AR files are usually similar enough to TAR that they can be extracted.

If an unsupported container format is encountered, it will be treated as a loose file/attachment during processing.

Note: Container File IDs may be useful for the "Not Processed Documents", "Other Type - Extensions", "Processing Reconciliation" reports.

 

File Type Mapping

 

File Type Mapping
Adobe Acrobat PDF, PDFIMAGE
Microsoft Word WORD4, WORD5, MACWORD3, MACWORD4, WINWORD1, WINWORD1COMPLEX, WINWORD2, MACWORD5, WORD6, WINWORD6, WINWORD1J, WINWORD5J, WINWORD2_OLECONV, WINWORD7, MACWORD6, WINWORD97, MACWORD97, WINWORD2000, WINWORD2002, WINWORD2003, WORDXML12, WINWORD2007, ENCRYPTED_WORD2007, WINWORDTEMPLATE2007, DRM_WORD, DRM_WORD2007
Microsoft Excel EXCEL, EXCEL3, EXCEL4, EXCEL5, MACEXCEL4, MACEXCEL5, EXCEL97, EXCEL3WORKBOOK, EXCEL4WORKBOOK, MACEXCEL4WORKBOOK, REGMACEXCEL4WB, EXCEL2000, EXCEL2002, EXCEL2003, EXCEL2007, ENCRYPTED_EXCEL2007, EXCEL2007_BINARY, DRM_EXCEL, SSEND
Microsoft Power Point POWERPOINT4, POWERPOINT3, POWERPOINT7, POWERPOINTMAC3, POWERPOINTMAC4, EXTPOWERPOINT4, EXTPOWERPOINTMAC4, POWERPOINTMACB3, POWERPOINTMACB4, POWERPOINT97, POWERPOINT9597, POWERPOINT2000, POWERPOINT2, POWERPOINT2007, ENCRYPTED_PPT2007, DRM_POWERPOINT, DRM_POWERPOINT2007
Email (.eml file) MIMEOUTLOOKEML, TEXTMAIL, MIMEMAIL, EMLX
Email (.msg file) OUTLOOK_MSG
All word processing documents WORD4, WORD5, WORDSTAR5, WORDSTAR4, WORDSTAR2000, WORDPERFECT5, MULTIMATE36, MULTIMATEADV, RFT, TXT, SMART, SAMNA, PFSWRITEA, PFSWRITEB, PROWRITE1, PROWRITE2, IBMWRITING, FIRSTCHOICE, WORDMARC, DIF, VOLKSWRITER, DX, SPRINT, WORDPERFECT42, TOTALWORD, IWP, WORDSTAR55, WANGWPS, RTF, MACWORD3, MACWORD4, MASS11PC, MACWRITEII, XYWRITE, FFT, MACWORDPERFECT, DISPLAYWRITE4, MASS11VAX, WORDPERFECT51, MULTIMATE40, QAWRITE, MULTIMATENOTE, PCFILELETTER, MANUSCRIPT1, MANUSCRIPT2, ENABLEWP, WINWRITE, WORKS1, WORKS2, WORDSTAR6, OFFICEWRITER, MACWORD4COMPLEX, DISPLAYWRITE5, WINWORD1, WINWORD1COMPLEX, AMI, AMIPRO, FIRSTCHOICE3, MACWORDPERFECT2, MACWORKSWP2, PROWRITEPLUS, LEGACY, SIGNATURE, WINWORDSTAR, WINWORD2, JUSTWRITE, WORDSTAR7, WINWORKSWP, JUSTWRITE2, AMICLIP, LEGACYCLIP, PROWRITEPLUSCLIP, MACWORD5, ENABLEWP4, WORDPERFECT6, WORD6, DX31, WPFENCRYPT, QAWRITE3, MACWORDPERFECT3, CEOWORD, WINWORD6, WORDPERFECT51J, ICHITARO3, ICHITARO4, WINWORD1J, WINWORD5J, MATSU4, MATSU5, P1, RTFJ, CEOWRITE, WINWORKSWP3, WORDPAD, WPFUNKNOWN, WINWORD2_OLECONV, WORDPERFECT61, FTDF, WORDPERFECT5E, WORDPERFECT6E, HTML, WINWORD7, AREHANGEUL, HANA, WINWORKSWP4, PERFECTWORKS1, WORDPERFECT7, WORDPRO, HTML_LATIN2, HTML_JAPANESESJIS, HTML_JAPANESEEUC, HTML_CHINESEBIG5, HTML_CHINESEEUC, HTML_CHINESEGB,
HTML_KOREANHANGUL, HTML_CYRILLIC1251, HTML_CYRILLICKOI8,CYRILLIC1251, CYRILLICKOI8, WWRITE_SHIFTJIS, WWRITE_CHINESEGB, WWRITE_HANGEUL, WWRITE_CHINESEBIG5, WPSPLUS,  ACWORD6, WINWORD97, RAINBOW, INTERLEAF, MACWORD97, INTERLEAFJ, WORDPERFECT8, ICHITARO8, VCARD, HTML_CSS, POCKETWORD, WORDPRO97, WINWORD2000, W2KHTML, XL2KHTML, PP2KHTML, XML, WML, WMLB, HTML_JAPANESEJIS, WML_CHINESEBIG5, WML_CHINESEEUC, WML_CHINESEGB, WML_CYRILLIC1251, WML_CYRILLICKOI8, WML_JAPANESEJIS, WML_JAPANESESJIS, WML_JAPANESEEUC, WML_KOREANHANGUL, WML_LATIN2, WML_CSS, STAROFFICEWRITER52, MIFF6, MIFF6J, MIFF, JAVASCRIPT, TEXT, HDML, CHTML, XHTMLB, HTMLAG, HTMLWCA, SEARCHML, POCKETWORD20, WIRELESSHTML, HANGULWP97, HANGULWP2002, HTMLUNICODE, XML_DOCTYPE_HTML, PAGEML, EBCDIC, WINWORD2002, WINWORD2003, MIME, STAROFFICEWRITER6, OUTLOOK_PST, XHTML, MSWORKS2000, MIMENEWS, MIMEOUTLOOKNEWS, VCAL, TNEF, MHTML, WPEND, SMARTDATA, FRAMEWORKIII, WORKSDATA, DATAEASE, MSPROJECT98, MSPROJECT2000, SEARCHTEXT, PSTF, PST_2003, PAB_2002, SEARCHML20, SEARCHML30, YAHOOIM, WORDXML2003, WORDXML12, STAROFFICEWRITER8, SEARCHML31, OUTLOOK_OFT, WINWORD2007, ENCRYPTED_WORD2007, WINWORDTEMPLATE2007, SEARCHML32,  RM_UNKNOWN, DRM_WORD, DRM_WORD2007
All spreadsheets SYMPHONY1, 123R1, 123R2, 123R3, SMARTSHEET, EXCEL, ENABLESHEET, WORKSSHEET, VPPLANNER, TWIN, SUPERCALC5, QUATTROPRO, QUATTRO, PFS_PLAN, FIRSTCHOICE_SS, EXCEL3, GENERIC_WKS, MACWORKSSS2, WINWORKSSS, EXCEL4, QUATTROPROWIN, 123R4, QUATTROPRO1J, CEOSS, EXCEL5, MULTIPLAN4, WINWORKSSS3, QUATTROPRO4, QUATTROPRO5, QUATTROPRO6, 123R2OS2, 123R2OS2CHART, WINWORKSSS4, QUATTROPRO7NB, QUATTROPRO7GR, 123R6, MACEXCEL4, MACEXCEL5, EXCEL97, EXCEL3WORKBOOK, EXCEL4WORKBOOK, MACEXCEL4WORKBOOK, REGMACEXCEL4WB, 123R9, QUATTROPRO8, QUATTROPRO9NB, EXCEL2000, QUATTROPRO10NB, EXCEL2002, STAROFFICECALC52, QUATTROPRO11NB, EXCEL2003, STAROFFICECALC6, QUATTROPRO12NB, STAROFFICECALC8, EXCEL2007, EXCEL2007_BINARY, DRM_EXCEL, SSEND
All images BMP, TIFF, PCX, GIF, EPSTIFF, CCITTGRP3, MACPICT2, WPG, WINDOWSMETA, LOTUSPIC, MACPICT1, AMIDRAW, TARGA, GEMIMG, OS2DIB, WINDOWSICON, WINDOWSCURSOR, MICROGRAFX, MACPAINT, WPG2, CGM, CANDY4, HANAKO1, HANAKO2, JPEGFIF, DCX, OS2METAFILE, DXFA, DXFB, DXB, OS2WARPBMP, WPG7, SUNRASTER, KODAKPCD, ENHWINDOWSMETA, GEM, IGES, IBMPIF, XBITMAP, XPIXMAP, CALSRASTER, PNG, XDUMP, GDF, DESIGNER, PBM, PGM, PPM, ADOBEPHOTOSHOP, PAINTSHOPPRO, FLASHPIX, PROGRESSIVEJPEG, DGN, BMP5, WBMP, MIFFG, WPG10, EXPORTIMAGE, OS2V2BMP
All multimedia (sound and video) RIFFWAVE, RIFFAVI, MIDI, DIRECTOR, FLASH6, QUICKTIME, MP3_ID31, MP3_ID32, ID31, ID32, MP3, MPGAV1L1, MPGAV1L2, MPGAV2L1, MPGAV2L2, MPGAV2L3, ASF, WMV, WMA, DVR_MS, REALMEDIA, MPEG1, MPEG2, ISOBASEMEDIAFILE, MPEG4, MULTIMEND
All programs EXECUTABLE, COM, ZIPEXE, MSCAB
Other types (file types not found above)

 

Issue/Introduction

File types that are supported for ingestion into eDiscovery Platform