MuckRock and NYU researchers collaborate to help journalists tap AI to tackle growing document releases

MuckRock and NYU researchers collaborate to help journalists tap AI to tackle growing document releases

Combined with training and financial support, Gumshoe integration with DocumentCloud helps small newsrooms tackle big stories

Written by

In collaboration with MuckRock, NYU Professor of Journalism and Emmy Award-Winning reporter Hilke Schellmann and NYU Senior Research Scientist and Tandon Faculty Member Mona Sloane have been awarded $200,000 grant from the Patrick J. McGovern Foundation to bring an innovative AI tool to under-resourced newsrooms to significantly scale up their investigative capacity and democratize access to FOIA records.

The project will integrate the NYU-developed Gumshoe prototype - a Natural Language Processing Tool that identifies relevant and irrelevant sections in large text corpora – with MuckRock’s DocumentCloud platform to help journalists effectively comb through thousands of Freedom of Information Request (FOIA) releases and other document sets. MuckRock is an open source journalism platform used by tens of thousands of journalists across 4,000 newsrooms to help request, analyze, and publish public documents. The effort will unlock decades of valuable information, data and history contained in federal government records released under the Freedom of Information Act.

“Effective use of the Freedom of Information Act is a key public tool to encourage transparent, accountable government activity,” said Vilas Dhar, president of the Patrick J. McGovern Foundation. “This innovative application of AI by New York University and MuckRock makes important records accessible to under-resourced newsrooms that are shining a light on our country’s most important social justice stories.”

Gumshoe is an innovative NLP-based text analyzing tool that was developed in a collaboration between Schellmann and Sloane and a team of graduate students at the NYU Center for Data Science, under the supervision of NYU Professor of Computer Science Julia Stoyanovich. Gumshoe sorts and ranks content in FOIA requests for journalists. Because it is based on NLP technology, Gumshoe understands the meaning of words in context and “learns” about the individual investigation of the journalist over time - like a shoe that fits better with every walk.

“We built this tool, because as a freelance journalist I was frustrated with combing through large Freedom of Information requests. I knew other journalists had the same problem and I am so delighted the Patrick J. McGovern Foundation is making it possible for us to bring this AI-tool to small newsrooms across America that really need this,” said Hilke Schellmann, Professor of Journalism at NYU’s Graduate School of Arts and Science. “Journalists in underserved newsrooms need this AI tool and the training MuckRock will provide to hold the powerful accountable and help strengthen democracy.”

Through integration with MuckRock’s open source DocumentCloud platform, Gumshoe will help journalists on a wide variety of critical stories.

“We’ve seen a widening disparity between the newsrooms that get access to the advanced tools and support needed to tackle complex analysis and those struggle to fund basic resources and training,” said Michael Morisy, MuckRock’s co-founder and chief executive. “I’m grateful for the McGovern Foundation’s support of this collaboration with Hilke and Mona, which will help us start to address that gap while supporting local accountability reporting.”

Over the past year, MuckRock has been expanding the way that it collaborates with local organizations, particularly those serving communities of color and innovative new outlets helping fill the voids left through news deserts where legacy media no longer exist. This includes extensive editorial partnerships helping dig into the impact of the COVID-19 pandemic as well as a revamped FOIA training and coaching program. These expanded collaborations, which have resulted in tens of thousands of newly released documents, provides the perfect opportunity to rapidly scale the impact of Gumshoe by integrating with MuckRock’s platform while further expanding editorial and investigative support to newsrooms around the country.

Along with access to the tool, the collaboration will offer select newsrooms additional training, editorial guidance, and financial support in the form of microgrants and training stipends to ensure that key stories in communities around the country can be told while journalists from a wide variety of backgrounds have an opportunity to put advanced technology to work in their reporting. These close collaborations will be leveraged to iteratively improve both Gumshoe and MuckRock’s approach to building intuitive and impactful transparency tools.


The Patrick J. McGovern Foundation is a global 21st century philanthropy bridging the frontiers of artificial intelligence, data science and social impact to create a thriving, equitable and sustainable future for all The Foundation’s work focuses on bringing together academia, practitioners, and civil society to pursue the potential of AI and data science to address some of the world’s most urgent challenges.