Technology-driven fraud detection research using publicly available data. We identify and document government program fraud, corporate wage suppression, and healthcare billing abuse through open-source analysis.
Greater corporate transparency is essential to a fairer, more prosperous and sustainable society. We believe open access to public data is the foundation of accountability.
— Adapted from OpenCorporates Public Benefit Statement
DB1K Research is a technology-driven investigative research operation that detects fraud, waste, and abuse in government programs using exclusively public data sources. We support whistleblowers, regulatory agencies, and investigative journalists with data analysis, case documentation, and technical expertise.
All of our work is built on publicly available data. We do not access non-public systems, hack databases, or engage in any activity outside the terms of service of the data platforms we use.
Figures based on detected anomalies in CMS Medicare data, DOL H-1B filings, and SBA PPP loan records across target states.
Cross-referencing DOL OFLC wage data with USCIS H-1B petitions to identify employers systematically underpaying foreign workers relative to prevailing wages — in violation of H-1B program requirements. 22 active cases detected.
Analyzing CMS Medicare billing data against Open Payments records to detect upcoding patterns and kickback correlations where providers increased billing after receiving pharmaceutical payments. 69 provider-level anomaly patterns detected.
Identifying shell companies and newly-formed entities that received suspicious PPP loans based on incorporation date, loan amount relative to business type, and correlation with H-1B filings. 48 high-risk entities identified.
State-level analysis of CCDF (Child Care Development Fund) program vulnerabilities, targeting states with weak oversight ratings and high estimated fraud rates including Minnesota, Kentucky, Ohio, and Maine.
All data sources below are publicly and freely available. We apply for at-scale access to improve coverage and reduce query latency for ongoing monitoring.
OpenCorporates — The world's largest open database of company data, used under the ODbL license. API access begins July 1, 2026.
Our current methodology relies on bulk-downloadable datasets and individual API queries. At-scale access to corporate registry data (via OpenCorporates) and other aggregated databases would allow us to:
Our work directly supports the public benefit missions of open data platforms:
If you represent a data platform with a public-benefit access program, we welcome the opportunity to discuss our methodology and how at-scale data access would enhance our fraud detection research.
✉ kevin@db1k.online