
The Gold Standard of
Healthcare Workforce Data
Overcoming the disparities of association datasets and the gaps of NPI files by building a comprehensive database through state-mandated licensure processes.
Why State Licensure?
Public information about traditional association-based data collection methodology is often non-existent. The National Provider Identifier (NPI) workforce data misses a large percentage of providers over 40 years old and an even larger percentage of non-physician providers.
The Problem with "Standard" Data
Most healthcare workforce data is either voluntarily updated (membership associations) or incomplete because it was never designed for research (NPI numbers). This leads to "dirty data"—outdated records that result in flawed policy and wasted resources.
The ESL Advantage
NCAHD utilizes state-mandated licensure processes as the basis for our data. Because licensure is legally required to practice, the quality and consistency of our provider data is unmatched in the industry.
16 Distinct Provider Types
We collect a core set of elements including licensure number, name, address, status, and effective dates across a massive spectrum of healthcare professionals.
Our Standardization Process
NCAHD has created a rigorous, standardized process for data collection and management that dramatically improves the quality of raw licensure data through six separate sequential phases.
1. Procurement
We contact licensing entities annually to determine and record any regulatory changes regarding the collection or publication of state licensure data. We then procure the data at least annually from all 50 states.
2. Data Normalization
Attributes are inventoried, format converted, and heavily scrubbed to remove duplicates, retired, deceased, overseas military, and inactive licensees.
3. Data Source Integration
Precise practice addresses from all previous alumni tracking are integrated. We perform automated searches against NPI and other online registries as a secondary validation.
4. Multiple State Licenses
Through this step’s QA/QC process we distinguish the main practice address from home address, especially for providers with multiple state licenses.
5. Spatial Aggregation
Using our Geographic Information System (GIS), we assigned each provider’s practice address a spatial identifier for analysis and aggregation into zip code, county, and state totals.
6. Quality Assurance (QA/QC)
County totals are mathematically compared to previous years. Variances over 5% trigger an exhaustive manual re-processing to ensure absolute data fidelity.
Ready to Explore the Data?
Leverage reliable, state-regulated provider insights. See how our national datasets inform better healthcare education planning and effective policy decisions.