Back to search

claim identifier

clm_id
claims·Updated May 29, 2026

Definition

ISO-11179 Definition

The claim identifier (clm_id) is the unique primary key assigned to each healthcare claim submitted by a provider or facility for reimbursement of services rendered to a patient. In the HIPAA 837 EDI transaction standard, the claim identifier appears in the CLM01 element of the CLM (Claim Information) segment and must be unique within the context of a trading partner relationship — it is the provider's own internal reference number for the claim, not a number assigned by the payer. When the payer adjudicates the claim and returns a remittance (835 transaction), the same claim ID appears in the CLP01 element of the Claim Payment/Adjustment (CLP) segment, enabling reconciliation between the original claim and the payment response.

Different systems may call this field claim number, claim reference number, document control number (DCN), or internal control number (ICN). The claim identifier is the foundational join key in any claims data model, linking the claim header to claim lines, diagnosis codes, remittance records, and authorization data. Poor claim ID management is one of the most common sources of claims data quality failures: duplicate claim IDs from different source systems appearing in the same data warehouse, claim IDs that change when a claim is reprocessed or corrected, and claim IDs that are not globally unique when combining claims from multiple payers or clearinghouses into a single analytical platform.

These problems cause double-counting of paid amounts, incorrect denial rate calculations, and failed revenue cycle reconciliations. Healthcare data engineers store clm_id as VARCHAR(50) to accommodate the full range of claim identifier formats used across payer and provider systems — some are short numeric sequences, others are long alphanumeric strings with date and system prefix components. When combining claims from multiple sources, a composite surrogate key is often necessary: concatenating the source system identifier with the original claim ID creates a globally unique key that prevents collision.

Critical data quality checks include deduplication on (clm_id, svc_from_dt, billed_amt, billing_npi) before loading into the warehouse, validation that clm_id is never null, and a referential integrity check that every claim line record links to an existing claim header. Claim IDs must be preserved through reversals and reprocessing cycles to maintain a complete audit trail of the claim lifecycle from submission through final payment.

Standard Abbreviation

clm_id

Category

claims

Production DDL — FACT_CLAIM_HEADER

FACT_CLAIM_HEADER.sql
CREATE OR REPLACE TABLE FACT_CLAIM_HEADER (
    clm_key        INTEGER        NOT NULL  -- surrogate key,
    clm_id         VARCHAR(50)    NOT NULL  -- claim identifier,
    mbr_key        INTEGER        NOT NULL  -- FK to DIM_MEMBER,
    prvdr_key      INTEGER        NOT NULL  -- FK to DIM_PROVIDER,
    clm_typ_cd     VARCHAR(10)              -- claim type code,
    clm_stat_cd    VARCHAR(10)              -- claim status code,
    svc_from_dt    DATE                     -- service from date,
    svc_to_dt      DATE                     -- service to date,
    diag_cd_1      VARCHAR(10)              -- principal diagnosis,
    proc_cd        VARCHAR(10)              -- procedure code,
    clm_bill_amt   DECIMAL(18,2)            -- billed amount,
    clm_alwd_amt   DECIMAL(18,2)            -- allowed amount,
    clm_pd_amt     DECIMAL(18,2)            -- paid amount,
    denial_rsn_cd  VARCHAR(10)              -- denial reason code,
    load_dt        TIMESTAMP_NTZ  NOT NULL  -- load timestamp
);

Standard Snowflake DDL for the canonical claims table. Convert to BigQuery or Databricks →

Why This Term Matters

Claims data is the financial backbone of the US healthcare system, and understanding claims terminology is essential for building accurate revenue cycle and reimbursement analytics. Data engineers who know this terminology can correctly parse 837 transactions, identify adjudication errors, and model denial patterns that represent real revenue recovery opportunities. A single misunderstood claims field can result in millions in underpayments identified only after external audits.

Looking for more healthcare terms?

Browse our complete library of 100,000+ standardized healthcare data terms

Browse All Terms