Public Benefit Research · Data Access Application

DB1K Research

Technology-driven fraud detection research using publicly available data. We identify and document government program fraud, corporate wage suppression, and healthcare billing abuse through open-source analysis.

Our Mission

Greater corporate transparency is essential to a fairer, more prosperous and sustainable society. We believe open access to public data is the foundation of accountability.

— Adapted from OpenCorporates Public Benefit Statement

DB1K Research is a technology-driven investigative research operation that detects fraud, waste, and abuse in government programs using exclusively public data sources. We support whistleblowers, regulatory agencies, and investigative journalists with data analysis, case documentation, and technical expertise.

All of our work is built on publicly available data. We do not access non-public systems, hack databases, or engage in any activity outside the terms of service of the data platforms we use.

Impact to Date

$1.1B
Detected Fraud Losses
139
Active Case Files
3
Fraud Vectors Tracked

Figures based on detected anomalies in CMS Medicare data, DOL H-1B filings, and SBA PPP loan records across target states.

Fraud Vectors Under Investigation

1. H-1B Wage Suppression

Cross-referencing DOL OFLC wage data with USCIS H-1B petitions to identify employers systematically underpaying foreign workers relative to prevailing wages — in violation of H-1B program requirements. 22 active cases detected.

2. Medicare/Healthcare Billing Fraud

Analyzing CMS Medicare billing data against Open Payments records to detect upcoding patterns and kickback correlations where providers increased billing after receiving pharmaceutical payments. 69 provider-level anomaly patterns detected.

3. PPP Loan Fraud

Identifying shell companies and newly-formed entities that received suspicious PPP loans based on incorporation date, loan amount relative to business type, and correlation with H-1B filings. 48 high-risk entities identified.

4. Child Care Subsidy Fraud

State-level analysis of CCDF (Child Care Development Fund) program vulnerabilities, targeting states with weak oversight ratings and high estimated fraud rates including Minnesota, Kentucky, Ohio, and Maine.

Data Sources Used

All data sources below are publicly and freely available. We apply for at-scale access to improve coverage and reduce query latency for ongoing monitoring.

CMS Medicare Data FREE
DOL OFLC H-1B Disclosure FREE
SBA PPP Loan Data FREE
USCIS H-1B Employer Data FREE
SAM.gov FREE
USAspending.gov FREE
SEC EDGAR FREE
IRS 990 Filings FREE
State Business Registries FREE
OpenCorporates APPROVED
NPPES (NPI Registry) FREE
FFIEC HMDA FREE

OpenCorporates — The world's largest open database of company data, used under the ODbL license. API access begins July 1, 2026.

Why We Need At-Scale Data Access

Our current methodology relies on bulk-downloadable datasets and individual API queries. At-scale access to corporate registry data (via OpenCorporates) and other aggregated databases would allow us to:

Public Benefit Alignment

Our work directly supports the public benefit missions of open data platforms:

Data Access Inquiries

If you represent a data platform with a public-benefit access program, we welcome the opportunity to discuss our methodology and how at-scale data access would enhance our fraud detection research.

✉ kevin@db1k.online