Release Notes: DocumentCloud Beta now supports over 70 document types

Release Notes: DocumentCloud Beta now supports over 70 document types

The update allows uploading Word documents, images, and many more formats

Written by
Edited by Michael Morisy

DocumentCloud has historically offered its users the ability to upload documents in many different file formats. The latest upgrades to the DocumentCloud Beta bring this functionality to our new platform, allowing users to upload nearly any document file with ease. Whether your documents are PDFs, Microsoft Word files, text files, images, slideshows, or something else, you can now upload them to our web platform by simply dragging them into your document collection or selecting them from the upload dialog. Over 70 formats are supported, spanning document, presentation, spreadsheet, graphics, and other formats.

For previous site improvements, check out all of MuckRock’s release notes, and if you’d like updates emailed to you — along with ways to help contribute to the site’s development yourself — subscribe to our developer newsletter here.

Powerful, simple document conversion

PDF, or portable document format, is the de facto file format of DocumentCloud — it’s a universal way to display documents and capture the nuances contained within. When you upload non-PDF documents (e.g. Microsoft Word files) into DocumentCloud, they are processed behind the scenes and converted to PDF documents. With the new update, this happens seamlessly and is as easy as merely dragging the files you want into the upload file dialog and hitting “Begin upload.”

null

This fast and efficient document conversion is powered behind the scenes by LibreOffice, an established open source document editing platform. Each non-PDF document can be up to 25 MB (PDF files have a much larger size limit of 500 MB). Here is a comprehensive table of the currently supported file formats:

File format Supported extensions
AbiWord ABW, ZABW
Adobe PageMaker PMD, PM3, PM4, PM5, PM6, P65
AppleWorks word processing CWK
Adobe FreeHand AGD, FHD
Apple Keynote KTH, KEY
Apple Numbers Numbers
Apple Pages Pages
BMP file format BMP
Comma-separated values CSV, TXT
CorelDRAW 6-X7 CDR, CMX
Computer Graphics Metafile CGM
Data Interchange Format DIF
DBase, Clipper, VP-Info, FoxPro DBF
DocBook XML
Encapsulated PostScript EPS
Enhanced Metafile EMF
FictionBook FB2
Gnumeric GNM, GNUMERIC
Graphics Interchange Format GIF
Hangul WP 97 HWP
HPGL plotting file PLT
HTML HTML, HTM
Ichitaro 8/9/10/11 JTD, JTT
JPEG JPG, JPEG
Lotus 1-2-3 WK1, WKS, 123, wk3, wk4
Macintosh Picture File[69] PCT
MathML MML
Microsoft Excel 2003 XML XML
Microsoft Excel 4/5/95 XLS, XLW, XLT
Microsoft Excel 97–2003 XLS, XLW, XLT
Microsoft Excel 2007-2016 XLSX
Microsoft Office 2007-2016 Office Open XML DOCX, XLSX, PPTX
Microsoft PowerPoint 97–2003 PPT, PPS, POT
Microsoft PowerPoint 2007-2016 PPTX
Microsoft Publisher PUB
Microsoft RTF RTF
Microsoft Word 2003 XML (WordprocessingML) XML
Microsoft Word DOC, DOT, DOCX
Microsoft Works WPS, WKS, WDB
Microsoft Write WRI
Microsoft Visio VSD
Netpbm format PGM, PBM, PPM
OpenDocument ODT, FODT, ODS, FODS, ODP, FODP, ODG, FODG, ODF
Open Office Base ODB
OpenOffice.org XML SXW, STW, SXC, STC, SXI, STI, SXD, STD, SXM
PCX PCX
Photo CD PCD
PhotoShop PSD
Plain text TXT
Portable Document Format PDF
Portable Network Graphics PNG
QuarkXPress 3–4 QXP
Quattro Pro 6.0 WB2, wq1, wq2
Scalable vector graphics SVG
SGV SGV
Software602 (T602) 602, TXT
StarOffice StarCalc 3/4/5 SDC, VOR
StarOffice StarDraw/StarImpress SDA, SDD, SDP, VOR
StarOffice StarWriter 3/4/5 SDW, SGL, VOR
Star Writer graphics SGF
Sony Broad Band eBook RLF
SunOS Raster RAS
SVM SVM
SYLK SLK
Tagged Image File Format TIF, TIFF
Truevision TGA (Targa) TGA
Unified Office Format UOF, UOT, UOS, UOP
WordPerfect WPD
WordPerfect Suite 2000/Office 1.0 WPS
X BitMap XBM
X PixMap XPM
Zoner Draw ZMF

A need for speed

We’ve also released a number of caching and speed improvements over the past few weeks, so DocumentCloud’s interface should be more responsive and faster, particularly for searching for organizations and when conducting common queries, such as pulling up all of your own documents.


Image via Wikimedia Commons