DocumentCloud has historically offered its users the ability to upload documents in many different file formats. The latest upgrades to the DocumentCloud Beta bring this functionality to our new platform, allowing users to upload nearly any document file with ease. Whether your documents are PDFs, Microsoft Word files, text files, images, slideshows, or something else, you can now upload them to our web platform by simply dragging them into your document collection or selecting them from the upload dialog. Over 70 formats are supported, spanning document, presentation, spreadsheet, graphics, and other formats.
For previous site improvements, check out all of MuckRock’s release notes, and if you’d like updates emailed to you — along with ways to help contribute to the site’s development yourself — subscribe to our developer newsletter here.
Powerful, simple document conversion
PDF, or portable document format, is the de facto file format of DocumentCloud — it’s a universal way to display documents and capture the nuances contained within. When you upload non-PDF documents (e.g. Microsoft Word files) into DocumentCloud, they are processed behind the scenes and converted to PDF documents. With the new update, this happens seamlessly and is as easy as merely dragging the files you want into the upload file dialog and hitting “Begin upload.”
This fast and efficient document conversion is powered behind the scenes by LibreOffice, an established open source document editing platform. Each non-PDF document can be up to 25 MB (PDF files have a much larger size limit of 500 MB). Here is a comprehensive table of the currently supported file formats:
File format | Supported extensions |
---|---|
AbiWord | ABW, ZABW |
Adobe PageMaker | PMD, PM3, PM4, PM5, PM6, P65 |
AppleWorks word processing | CWK |
Adobe FreeHand | AGD, FHD |
Apple Keynote | KTH, KEY |
Apple Numbers | Numbers |
Apple Pages | Pages |
BMP file format | BMP |
Comma-separated values | CSV, TXT |
CorelDRAW 6-X7 | CDR, CMX |
Computer Graphics Metafile | CGM |
Data Interchange Format | DIF |
DBase, Clipper, VP-Info, FoxPro | DBF |
DocBook | XML |
Encapsulated PostScript | EPS |
Enhanced Metafile | EMF |
FictionBook | FB2 |
Gnumeric | GNM, GNUMERIC |
Graphics Interchange Format | GIF |
Hangul WP 97 | HWP |
HPGL plotting file | PLT |
HTML | HTML, HTM |
Ichitaro 8/9/10/11 | JTD, JTT |
JPEG | JPG, JPEG |
Lotus 1-2-3 | WK1, WKS, 123, wk3, wk4 |
Macintosh Picture File[69] | PCT |
MathML | MML |
Microsoft Excel 2003 XML | XML |
Microsoft Excel 4/5/95 | XLS, XLW, XLT |
Microsoft Excel 97–2003 | XLS, XLW, XLT |
Microsoft Excel 2007-2016 | XLSX |
Microsoft Office 2007-2016 Office Open XML | DOCX, XLSX, PPTX |
Microsoft PowerPoint 97–2003 | PPT, PPS, POT |
Microsoft PowerPoint 2007-2016 | PPTX |
Microsoft Publisher | PUB |
Microsoft RTF | RTF |
Microsoft Word 2003 XML (WordprocessingML) | XML |
Microsoft Word | DOC, DOT, DOCX |
Microsoft Works | WPS, WKS, WDB |
Microsoft Write | WRI |
Microsoft Visio | VSD |
Netpbm format | PGM, PBM, PPM |
OpenDocument | ODT, FODT, ODS, FODS, ODP, FODP, ODG, FODG, ODF |
Open Office Base | ODB |
OpenOffice.org XML | SXW, STW, SXC, STC, SXI, STI, SXD, STD, SXM |
PCX | PCX |
Photo CD | PCD |
PhotoShop | PSD |
Plain text | TXT |
Portable Document Format | |
Portable Network Graphics | PNG |
QuarkXPress 3–4 | QXP |
Quattro Pro 6.0 | WB2, wq1, wq2 |
Scalable vector graphics | SVG |
SGV | SGV |
Software602 (T602) | 602, TXT |
StarOffice StarCalc 3/4/5 | SDC, VOR |
StarOffice StarDraw/StarImpress | SDA, SDD, SDP, VOR |
StarOffice StarWriter 3/4/5 | SDW, SGL, VOR |
Star Writer graphics | SGF |
Sony Broad Band eBook | RLF |
SunOS Raster | RAS |
SVM | SVM |
SYLK | SLK |
Tagged Image File Format | TIF, TIFF |
Truevision TGA (Targa) | TGA |
Unified Office Format | UOF, UOT, UOS, UOP |
WordPerfect | WPD |
WordPerfect Suite 2000/Office 1.0 | WPS |
X BitMap | XBM |
X PixMap | XPM |
Zoner Draw | ZMF |
A need for speed
We’ve also released a number of caching and speed improvements over the past few weeks, so DocumentCloud’s interface should be more responsive and faster, particularly for searching for organizations and when conducting common queries, such as pulling up all of your own documents.
Image via Wikimedia Commons