9305 Tags

documentcloud

1 Project

View all...

DockIns: Machine Learning on Deadline for Journalists

As journalists dealing with data and document sets, we find that the most interesting information is usually hidden in large, unstructured, and incomplete sets of documents. Especially information in public contracts: what the government is buying, how much money is being spent, and who are the suppliers. To answer these questions, four media organizations — La Nacion, CLIP, Ojo Público, and MuckRock — joined forces under the JournalismAI Collab and experimented with different machine learning tools and techniques in order to build a platform that helps investigative reporters understand and process unstructured documents to get useful insights.

Learn more

75 Articles

View all...

An upside down stock photo of documents in Russian and manilla envelopes.

Release Notes: Making it easier to sort, filter and reprocess document OCR

Since our last release notes, we released a new Add-On OCR Tagger that allows you to tag your document(s) based on the OCR engine used and we added better logging for when scheduled Add-Ons like Klaxon or Scraper get disabled. This helps more easily diagnose and correct outages that impact Add-Ons.

Read More

Screenshot of DocumentCloud showing available documents sorted by key value pairs

Release Notes: Improved sorting, new revision control documentation and more

In recent weeks, we’ve rolled out a few updates on DocumentCloud. Users can now sort documents on DocumentCloud by their key/value pairs. We’ve documented API access for document revision control. Finally, a fellow DocumentCloud user contributed a write-up on how to run your own version of Klaxon.

Read More

Automate your beat: Unredact documents, monitor websites and much more with DocumentCloud

Automate your beat: Unredact documents, monitor websites and much more with DocumentCloud

Ever had a spreadsheet-turned-PDF you’re stuck untangling? Wish story ideas came right to you? Over the past two years, MuckRock’s DocumentCloud tool has built several ways to automate common journalism and research tasks, taking once-cumbersome processes and breaking them down to just a few clicks.

Read More

Black redacted bars with the words, For the Record

For the Record: Sunshine Week, in review

Sunshine Week shines a light on the importance of public records and open government.

Read More

A picture of a collection of books, organized by color on green square shelves.

Release Notes: Enhanced project management, an easier way to count pages and other MuckRock and DocumentCloud updates

In the last two weeks the MuckRock team released the ability to pin your projects on DocumentCloud, the ability to update your card on file without making a new purchase and several Add-On improvements including two new Add-Ons: OCR Scheduler and Page Counter.

Read More