Introduction
Healthcare data architects spend days designing schemas from scratch. A Medicare Advantage claims warehouse alone requires 20+ tables, hundreds of columns, carefully defined foreign key relationships, and platform-specific data types. Then the naming debates start: Is it patient_id or mbr_id? Is the service date column date_of_service, svc_dt, or clm_svc_from_dt?
These debates are not academic. Wrong column names fail ISO-11179 validation and create inconsistencies that cost days of remediation when regulators or auditors ask for clean documentation. Wrong data types — using FLOAT for a paid amount instead of DECIMAL(18,2) — introduce rounding errors in financial reconciliation. Wrong platform syntax means DDL that does not execute without manual edits.
While ERwin requires a complex setup for schema generation, you can generate clean DDL in seconds using our free converter.
Convert your first 5 DDLs — No Credit Card RequiredWhat if you could skip all of that and generate a production-ready Star Schema for Snowflake in 30 seconds — with ISO-11179 standard column names built in?
That is exactly what the mdatool AI Data Modeling tool does.
Try it free — no account needed: mdatool.com/tools/modeling
What the Tool Does
The AI Data Modeling tool generates complete database schemas from a plain English description. You make five selections and write one sentence. The tool does the rest.
You select:
- Healthcare domain: Claims & Adjudication, Clinical & EHR, Pharmacy & PBM, Member Enrollment, Provider Network, Quality Measures, Finance & Revenue Cycle, or Population Health
- Output type: Logical model, Physical model, or Both
- Model architecture: Star Schema, Data Vault 2.0, Lakehouse (Delta/Iceberg), One Big Table, 3NF, MongoDB, or Cassandra
- Target platform: Snowflake, BigQuery, Databricks, PostgreSQL, SQL Server, Oracle, Amazon Redshift, or MySQL
- Plain English description of what you need
The tool generates:
- Complete DDL with
CREATE TABLEstatements - ISO-11179 standard column names throughout
- Correct data types for your selected platform
- Foreign key relationships between tables
- Primary keys and surrogate key conventions
- Inline comments on every column
- Platform-specific syntax — not generic SQL renamed
No blank page. No naming debates. No syntax lookups. A complete, executable schema in 30 seconds.
A Real Example — Medicare Advantage Claims Warehouse
Here is what the tool generates for a real Medicare Advantage claims warehouse request.
Input description:
"Medicare Advantage claims data warehouse for a health plan. Include claim header, claim line, member enrollment, provider, and diagnosis tables. Support HEDIS reporting and CMS risk adjustment scoring."
Settings:
- Architecture: Star Schema
- Platform: Snowflake
- Domain: Claims & Adjudication
- Output: Physical Model
Generated DDL:
-- ── FACT TABLE ────────────────────────────────────────────────────────────
CREATE TABLE FACT_CLAIM_HEADER (
CLM_ID VARCHAR(50) NOT NULL,
MBR_KEY INTEGER NOT NULL,
PRVDR_KEY INTEGER NOT NULL,
PLAN_KEY INTEGER NOT NULL,
SVC_FROM_DT_KEY INTEGER NOT NULL,
CLM_TYP_CD VARCHAR(10),
TOT_CHRG_AMT DECIMAL(18,2),
TOT_PAID_AMT DECIMAL(18,2),
TOT_ALWD_AMT DECIMAL(18,2),
CLM_STAT_CD VARCHAR(10),
DIAG_CD_1 VARCHAR(10),
DIAG_CD_2 VARCHAR(10),
PROC_CD VARCHAR(10),
DRG_CD VARCHAR(10),
LOAD_DT TIMESTAMP_NTZ NOT NULL DEFAULT CURRENT_TIMESTAMP,
CONSTRAINT PK_FACT_CLAIM_HEADER
PRIMARY KEY (CLM_ID)
);
-- ── DIMENSION TABLE: MEMBER ───────────────────────────────────────────────
CREATE TABLE DIM_MEMBER (
MBR_KEY INTEGER NOT NULL,
MBR_ID VARCHAR(50) NOT NULL, -- PHI: mask in non-prod
MBR_FIRST_NM VARCHAR(100), -- PHI
MBR_LAST_NM VARCHAR(100), -- PHI
BIRTH_DT DATE, -- PHI
SEX_CD CHAR(1),
STATE_CD CHAR(2),
CNTY_CD VARCHAR(10),
ZIP_CD VARCHAR(10), -- PHI
DUAL_ELIG_IND CHAR(1),
PLAN_ID VARCHAR(20),
EFF_START_DT DATE NOT NULL,
EFF_END_DT DATE,
CURR_ROW_IND BOOLEAN NOT NULL DEFAULT TRUE,
CONSTRAINT PK_DIM_MEMBER
PRIMARY KEY (MBR_KEY)
);
-- ── DIMENSION TABLE: PROVIDER ─────────────────────────────────────────────
CREATE TABLE DIM_PROVIDER (
PRVDR_KEY INTEGER NOT NULL,
PRVDR_NPI VARCHAR(10),
PRVDR_FIRST_NM VARCHAR(100),
PRVDR_LAST_NM VARCHAR(100),
PRVDR_ORG_NM VARCHAR(255),
PRVDR_TYP_CD VARCHAR(20),
PRVDR_SPCLTY_CD VARCHAR(10),
PRVDR_STATE_CD CHAR(2),
PRVDR_ZIP_CD VARCHAR(10),
EFF_START_DT DATE NOT NULL,
EFF_END_DT DATE,
CURR_ROW_IND BOOLEAN NOT NULL DEFAULT TRUE,
CONSTRAINT PK_DIM_PROVIDER
PRIMARY KEY (PRVDR_KEY)
);
-- ── DIMENSION TABLE: DATE ─────────────────────────────────────────────────
CREATE TABLE DIM_DATE (
DT_KEY INTEGER NOT NULL,
FULL_DT DATE NOT NULL,
YR_NBR SMALLINT,
QTR_NBR SMALLINT,
MO_NBR SMALLINT,
WK_NBR SMALLINT,
DAY_OF_WK_NBR SMALLINT,
CONSTRAINT PK_DIM_DATE
PRIMARY KEY (DT_KEY)
);
Key design decisions in this output:
ISO-11179 column names: The tool generates MBR_ID not member_id, PRVDR_NPI not provider_npi_number, CLM_STAT_CD not claim_status. These abbreviations come from the mdatool glossary of 100,000+ healthcare data terms. Any healthcare data engineer picks them up immediately without a decoder ring.
Surrogate keys with _KEY suffix: Dimension tables use integer surrogate keys (MBR_KEY, PRVDR_KEY) that fact tables reference. This is standard Star Schema convention — the natural business key (MBR_ID) is preserved in the dimension table but joins happen on fast integer keys.
DECIMAL(18,2) for amounts, not FLOAT: Financial columns use fixed-precision decimal types. FLOAT introduces rounding errors that cause reconciliation failures in claims financial reporting. The tool knows this distinction and applies it automatically.
TIMESTAMP_NTZ for Snowflake: Snowflake-specific timestamp type that stores no time zone offset — the correct choice for healthcare event timestamps that are already in local time at point of care. Generic SQL would generate TIMESTAMP or DATETIME, which behaves differently across platforms.
SCD Type 2 on dimensions: DIM_MEMBER includes EFF_START_DT, EFF_END_DT, and CURR_ROW_IND for slowly changing dimension tracking. Member demographics — ZIP codes, plan assignments, eligibility status — change over time, and payers need to reconstruct which coverage a member held at the time of any given claim.
Generate your own healthcare schema → mdatool.com/tools/modeling
8 Architectures Supported
The tool supports eight data model architectures, each generating structurally different DDL optimized for its intended use case.
Star Schema Best for analytics and BI reporting. Use when your team uses Tableau, Power BI, or Looker against Snowflake or Redshift. Generates fact and dimension tables with surrogate keys, foreign key relationships, and SCD Type 2 support. The right default for most health plan analytics warehouses, HEDIS measure calculation schemas, and claims financial reporting.
Data Vault 2.0 Best for enterprise data warehouses with multiple source systems and strict audit requirements. Generates hubs (business keys), links (relationships), and satellites (descriptive attributes) with hash keys and immutable append-only loading. The right choice for large health plans aggregating claims + clinical + member data from 10+ source systems where HIPAA audit trail completeness is a compliance requirement.
Lakehouse (Delta Lake / Iceberg) Best for Databricks and Spark-based workloads. Generates Bronze (raw ingestion), Silver (curated), and Gold (aggregated) medallion layer tables with Delta Lake syntax. The right architecture for population health analytics platforms, real-time eligibility streaming pipelines, and ML feature engineering for risk stratification models.
One Big Table (OBT) Best for simple analytics on small teams or single-payer environments. Generates a wide denormalized table optimized for direct query without joins. Useful for rapid prototyping or small-scale claims analysis where schema simplicity matters more than warehouse optimization.
3NF (Third Normal Form) Best for operational transactional systems, not analytics. Generates normalized relational tables without denormalization. Use when building a claims processing system (OLTP) rather than a claims analytics warehouse (OLAP). The tool generates different structures than Star Schema — no fact/dimension split, proper normalization to eliminate redundancy.
MongoDB Document Model Generates JSON schema definitions for document-oriented healthcare data, such as FHIR-native clinical records or unstructured encounter notes where flexible schema is more practical than rigid relational structure.
Apache Cassandra Generates partition-key-optimized table schemas for high-throughput write workloads, such as real-time pharmacy dispensing event streams or IoT health device data ingestion.
Tired of legacy complexity and high pricing?
mdatool offers instant DDL conversion, HL7 support, and AI-driven data modeling for a fraction of the cost of ER/Studio or ERwin.
Try mdatool for Free8 Target Platforms Supported
Each platform gets correct, immediately executable syntax — not generic SQL with a different name at the top.
| Platform | Platform-Specific Output |
|---|---|
| Snowflake | TIMESTAMP_NTZ, VARIANT, NUMBER, clustering key hints, transient table syntax |
| BigQuery | NUMERIC, DATETIME, REPEATED fields, STRUCT types, partition and clustering clauses |
| Databricks | DECIMAL, ARRAY, COMMENT clauses, OPTIMIZE and ZORDER hints, Delta Lake syntax |
| PostgreSQL | SERIAL, JSONB, standard index definitions, sequences |
| SQL Server | IDENTITY, NVARCHAR, clustered index definitions, GO batch separators |
| Oracle | NUMBER types, sequences, tablespace hints |
| Amazon Redshift | DISTKEY, SORTKEY, Redshift-specific compression encodings |
| MySQL | AUTO_INCREMENT, ENGINE=InnoDB, VARCHAR with charset hints |
If you need to convert an existing schema between platforms — for example, migrating a SQL Server claims schema to Snowflake — use the DDL Converter after generating your initial model.
ISO-11179 Naming Built In — No More Naming Debates
Every schema generated by the tool uses ISO-11179 standard healthcare column names automatically. The ISO-11179 standard defines column names as: object class + property + representation class.
The result eliminates the most common and most expensive debate in healthcare data teams.
Instead of:
patient_id
date_of_birth
claim_number
paid_amount
provider_name
member_zip
You get:
MBR_ID
BIRTH_DT
CLM_NBR
CLM_PD_AMT
PRVDR_NM
MBR_ZIP_CD
The tool references the mdatool glossary of 100,000+ healthcare terms to generate compliant abbreviations. MBR is the standard object class for member. DT is the standard representation class for date. AMT for monetary amount. CD for code. NM for name. NBR for number. IND for indicator.
These are not arbitrary abbreviations. They are the recognized standard in healthcare data engineering. Any certified healthcare data engineer reads CLM_PD_AMT and knows it is claim paid amount. Any NPI lookup connects to PRVDR_NPI without documentation. Any HEDIS calculation references DIAG_CD_1 without ambiguity.
For the full standard abbreviation reference, see the mdatool glossary and the ISO-11179 naming guide.
View as Diagram — Instant ER Visualization
After generating a schema, click View as Diagram to instantly visualize your data model as an interactive entity-relationship diagram.
The Data Model Canvas renders your generated schema with:
- All tables and column definitions
- Foreign key relationships drawn as connectable lines
- Color-coding by healthcare domain (claims tables, member tables, provider tables)
- Drag-and-drop table positioning to arrange by logical grouping
- Click-to-expand any table to see all columns and data types
- PDF export for architecture reviews, governance documentation, and team walkthroughs
This is the step that turns a DDL block into a shareable artifact. Paste the schema into the canvas and you have a diagram ready for a data governance review meeting in under two minutes.
See the canvas: mdatool.com/model-diagram
How to Use It — Step by Step
- Go to mdatool.com/tools/modeling
- Select your healthcare domain from the dropdown (e.g., Claims & Adjudication)
- Choose Physical Model as your output type
- Select Star Schema as your architecture (or whichever fits your use case)
- Choose your target platform (e.g., Snowflake)
- Write a plain English description of your data model in the text field
- Click Generate Data Model
- Review the generated DDL — tables, columns, data types, constraints
- Click View as Diagram to visualize the entity relationships
- Copy the DDL into your database tool (Snowflake Worksheet, BigQuery console, etc.)
- Iterate: adjust the description and regenerate to refine
Free tier limits:
- 1 generation without an account — no signup required
- Free account: 3 generations per month
- Pro: 20 generations per month + unlimited canvas exports and PDF export
Real Use Cases
Healthcare data teams use the AI Data Modeling tool across every segment of the industry.
Medicare Advantage health plans Designing a claims warehouse from scratch for a new MA plan is a 3-4 week project for an experienced data architect. The tool generates the core Star Schema in 30 seconds — fact tables for claims headers and lines, dimension tables for members, providers, plans, and dates, with HEDIS-ready table structures and HCC risk adjustment support. See the HCC risk adjustment guide for how these tables connect to CMS risk scoring. For NPI-related provider data, see the NPI reference.
Pharmacy Benefit Managers
PBM data models have distinct schema requirements — NDC codes, formulary tier tables, days-supply calculations, and medication adherence metrics. Selecting the Pharmacy & PBM domain generates columns like NDC_CD VARCHAR(11), DAYS_SPLY_CNT INTEGER, REFILL_NBR SMALLINT, and FORMULARY_TIER_CD VARCHAR(10) — the right names and types without manual lookup.
Hospital systems Clinical data warehouses for hospital systems need EHR integration schemas, quality measure reporting tables, and encounter-level clinical data models. The Clinical & EHR domain generates LOINC code columns, place-of-service codes, DRG tables, and CMS-aligned encounter grains.
Provider networks Provider master data models require credentialing tables, specialty hierarchies, network adequacy schemas, and NPI registry integration. The Provider Network domain generates the right structure for multi-specialty provider directories.
Health IT vendors Vendors building healthcare analytics platforms for multiple clients use the tool to generate client-specific data models rapidly — adapting the same base schema for different platforms (one client on Snowflake, another on BigQuery) by changing only the platform selector. Platform migration projects use it alongside the DDL Converter to translate existing schemas between database platforms.
Try the AI Data Modeling tool free → mdatool.com/tools/modeling
Frequently Asked Questions
Is the AI Data Modeling tool free?
Yes — one free generation requires no account. Free accounts (email signup) get 3 generations per month. Pro accounts get 20 generations per month plus unlimited canvas exports and PDF export of data model diagrams.
What healthcare domains does it support?
Claims & Adjudication, Clinical & EHR, Pharmacy & PBM, Member Enrollment, Provider Network, Quality Measures, Finance & Revenue Cycle, and Population Health. Each domain loads domain-specific column vocabulary, table patterns, and naming conventions that are not present in general-purpose schema generators.
How accurate are the generated schemas?
The schemas are production-ready starting points based on healthcare data standards and ISO-11179 naming conventions. Platform syntax is correct and immediately executable. Column names, data types, and table structures follow established healthcare data modeling patterns. They should be reviewed by a data architect before deployment to production — particularly for business-rule constraints specific to your organization, custom primary key strategies, and edge cases in your specific source system. Treat the output as an expert first draft that eliminates the blank-page problem, not a final artifact that bypasses schema review.
Can I run the generated DDL directly in Snowflake?
Yes. Selecting Snowflake as your target platform generates valid Snowflake DDL that executes in Snowflake Worksheets without modification. This includes Snowflake-specific syntax: TIMESTAMP_NTZ for timestamps, NUMBER for numeric columns, VARIANT for semi-structured fields where applicable, and BOOLEAN for indicator columns. No manual syntax corrections required before running.
What is the difference between Star Schema and Data Vault?
Star Schema is optimized for analytics and reporting. It generates fact tables (one row per business event — a claim, an encounter, a pharmacy fill) surrounded by dimension tables (members, providers, plans, dates). BI tools like Tableau and Power BI connect cleanly to this structure. Query performance is predictable. It is the right default for most healthcare analytics use cases.
Data Vault is optimized for enterprise data warehouses with multiple conflicting source systems, strict HIPAA audit trail requirements, and historical tracking needs that go beyond standard SCD Type 2. It is more complex to build and maintain but provides complete lineage from source to consumption — every row in every satellite is immutable with a source system tag and load timestamp. Use Data Vault when your organization has had compliance findings related to data provenance, or when you are aggregating claims + clinical + member data from 10+ source systems. For most teams building their first healthcare data warehouse, Star Schema is the right starting point.
mdatool Team
The mdatool team builds free engineering tools for healthcare data architects, analysts, and engineers working across payer, provider, and life sciences data.
Related Guides
Key Terms in This Article
More in Healthcare Data Modeling
Enforcing ISO-11179 Healthcare Naming Standards in dbt Projects
Every healthcare data warehouse eventually develops naming drift — DOB in one model, birth_dt in another, member_birth_date in a third. The dbt-healthcare-standards package brings ISO-11179 column naming directly into your dbt project as installable macros and schema tests.
Read moreAI-Powered Healthcare Data Modeling: Generate Production-Ready Schemas in Seconds
Healthcare data models take weeks to design manually — HIPAA constraints, CMS reporting requirements, and ISO-11179 naming standards all have to be applied correctly from the start. AI data modeling changes that. Here is how to generate production-ready schemas for Snowflake, BigQuery, and Databricks in seconds.
Read moreHealthcare Claims Data Model: Complete SQL Schema for Payers
The complete SQL schema for a payer-side healthcare claims data warehouse — including claim header, claim line, adjudication, remittance, and provider tables with DDL for Snowflake and BigQuery.
Read moreFree Tools
Free HCC RAF Score Calculator
Calculate CMS-HCC V28 RAF scores from ICD-10 codes. No account required.
Try it freeFree SQL Linter
Catch SQL bugs, performance issues, and naming violations before production.
Try it freeFree DDL Converter
Translate SQL schemas between Snowflake, BigQuery, Oracle, and SQL Server.
Try it freeReady to improve your data architecture?
Free tools for DDL conversion, SQL analysis, naming standards, and more.
Get weekly healthcare data engineering tips
Practical guides on data modeling, SQL standards, and healthcare domain conventions — straight to your inbox.
No spam. Unsubscribe any time.