top of page
Anomaly analysis in claims Data
Requirement
Information: Anomaly analysis focuses on identifying unusual patterns or outliers, such as claims that deviate from expected ranges or behaviors. Anomaly detection highlights unexpected or irregular patterns in the data.
Requirement: Create a Pyspark code to find anomaly for the table health_insurance_claimsunder different levels.
- Year level: In a year (in `Service_Date`) if the count of Claim_ID ,sum of Billed_Amount and sum of Allowed_Amount is 1.5 times preceding years values then flag it as anomaly
- Provider level : for each provider (`Provider_ID` ) if the count of claim id ,sum of billed amount and sum of Allowed Amount is 1.5 times preceding years values then flag it as anomaly
- Medication level: for each Medication if the count of Claim_ID ,sum of Billed_Amount and sum of Allowed_Amount is 1.5 times preceding years values then flag it as anomaly
plot 3 line graphs with count of claim id ,sum of billed amount and sum of Allowed Amount to study each of these anomalies
Unity catalog information: health_insurance_claims
Expected output: Databricks Pyspark code, Show the Result and Graphs.
Purgo AI Agentic Code
bottom of page