Input Formats
Name | File Extension | MIME Type | Description |
---|---|---|---|
PLAIN | .txt | text/plain | Plain text (UTF8) |
XLINE | .xline | text/x-line | Plain text, one sentence per line |
HTML | .htm, .html, .xhtml | text/html | HTML |
XML | .xml | text/xml | XML |
SDLXML | .sdlxml | text/sdlxml | Treats every closing XML tag in the input as the end of a segment. The XML format in contrast does not make this assumption. |
TMX | .tmx | text/x-tmx | Translation Memory eXchange |
XLIFF | .xliff | application/x-xliff | XML Localization Interchange File Format |
BCM | .bcm | application/x-json-bcm | Proprietary format |
application/pdf | Adobe Acrobat (PDF) | ||
RTF | .rtf | application/rtf | Rich Text Format (RTF) |
DOCX | .docx, .dotx, .docm, .dotm | application/vnd.openxmlformats-officedocument.wordprocessingml.document | Microsoft Word (Office Open XML) |
XLSX | .xlsx, .xltx, .xlsm, .xltm, .xlam, .xlsb | application/vnd.openxmlformats-officedocument.spreadsheetml.sheet | Microsoft Excel (Office Open XML) |
PPTX | .pptx, .potx, .ppsx, .pptm, .potm, .ppsm | application/vnd.openxmlformats-officedocument.presentationml.presentation | Microsoft PowerPoint (Office Open XML) |
DOC | .doc, .dot | application/msword | Microsoft Word (97-2003) |
XLS | .xls, .xlt, .xla | application/vnd.ms-excel | Microsoft Excel (97-2003) |
PPT | .ppt, .pot, .pps | application/vnd.ms-powerpoint | Microsoft PowerPoint (97-2003) |
ODT | .odt | application/vnd.oasis.opendocument.text | OpenDocument Text |
ODS | .ods | application/vnd.oasis.opendocument.spreadsheet | OpenDocument Spreadsheet |
ODP | .odp | application/vnd.oasis.opendocument.presentation | OpenDocument Presentation |
GIF | .gif | image/gif | Graphics Interchange Format (GIF) |
JPG | .jpg, .jpeg | image/jpeg | JPEG |
PNG | .png | image/png | Portable Network Graphics (PNG) |
TIF | .tif | image/tif | Tagged Image File (TIF) |
TIFF | .tiff | image/tiff | Tagged Image File Format (TIFF) |
EML | .eml | message/rfc822 | E-Mail Message |
MSG | .msg | application/vnd.ms-outlook | Outlook Message Item File |
Note
Image input formats are not available for language detection