top of page

Data profiling on metric tables

Requirement

Requirement: Develop a Databricks PySpark script to generate a data profile table named metric_profile for each column in the metric tables, based on the metric tables health_insurance_claims_metric, d_product_revenue_metric and s_field_reporting_sales_source_customer_metric .

 

 

metric_profile should contain:

 

table_name

 

column_name

 

Data type

 

Null count

 

Distinct count

 

Total count

 

Minimum value

 

Maximum value

 

Mean

Purgo AI Agentic Code

bottom of page